
BLOG · 18/3/2026

To familiarize with Jupyter Notebook as a tool for both coding and communication. This task is designed to build confidence in writing clean, readable, and well-structured notebooks using both code and Markdown.
I learnt how to use Jupyter Notebook along with its working. Also completed the given quests of making a report using markdown language and also displayed python code and visualization. My-Markdown-repo
Understand foundational ML concepts and data preparation techniques by watching two beginner-friendly videos and writing an article.
As per the given refernce video, I got introduced to basic Machine Learning concepts. From the first video I learnt the following things:
A method that fits a straight line to data to show a trend and make predictions.
A more complex model that curves to fit every individual point in the training data.
The ability of a model to make accurate predictions on new data (testing data) rather than just memorizing the training data.
When a model (like the "green squiggle") fits the training data too perfectly, it loses the ability to generalize and performs poorly on testing data.
The concept that a model that fits training data too closely (high variance) may perform poorly on new data is introduced as the bias-variance tradeoff.
The second video is about key concepts used in Data preparation for Machine Learning;
The amount of data required depends on the problem complexity and the learning algorithm.
The principle of “garbage in, garbage out” emphasises that poor-quality or inaccurate data will produce poor results regardless of model sophistication or resources. Data must be accurate, complete, and relevant.
In supervised learning, data must be labelled to provide correct answers during training.
Although large datasets are valuable, not all data points contribute to model accuracy. Dimensionality reduction removes irrelevant or redundant features to improve model performance.
This transforms raw data into a consistent, usable format. It includes: