Supervised Learning Fundamentals

Supervised Learning is like teaching a child with flashcards. You show the model an input (image of a cat) and tell it the correct answer ("Cat"). The model learns to map inputs to outputs.

1. Regression vs. Classification 🎯

There are two main types of Supervised Learning problems:

Regression (Predicting a Number)

Goal: Predict a continuous value.
Examples:
- Predicting house prices ($250,000, $500,000).
- Predicting temperature (72°F, 85°F).
- Predicting stock prices.

Classification (Predicting a Label)

Goal: Predict a category (class).
Examples:
- Is this email Spam or Not Spam? (Binary Classification)
- Is this image a Cat, Dog, or Bird? (Multi-class Classification)
- Will this customer churn? (Yes/No)

2. The Training Pipeline 🚂

How does a machine actually learn?

Data Collection: Gather features ($X$) and labels ($y$).
Split Data: Divide into Training Set (80%) and Test Set (20%).
Model Selection: Choose an algorithm (e.g., Linear Regression).
Training: The model looks at the Training Set and tries to minimize error.
Evaluation: Test the model on the Test Set to see how well it generalizes.

3. Loss Functions 📉

How does the model know if it's wrong? Loss Functions.

A Loss Function measures the distance between the model's prediction (\hat{y}) and the actual value (y). The goal of training is to minimize this loss.

MSE (Mean Squared Error): Used for Regression.
- Loss = (y - \hat{y})^2
Cross-Entropy Loss: Used for Classification.
- Penalizes confident wrong answers.

Interactive Challenge: Regression vs Classification

Decide which type of problem each scenario is.

Quiz

Question 1 of 3

Predicting the exact price of a used car.

Regression

Classification

Clustering

Interactive Demo: Simple Linear Regression

Let's use scikit-learn (the standard ML library) to predict a number.

PYTHON PLAYGROUND

⏳ Loading editor…

Key Takeaways

✅ Regression predicts numbers; Classification predicts labels.
✅ Training minimizes a Loss Function.
✅ Scikit-Learn is the go-to library for traditional ML.

What's Next?

Now that we know the basics, let's look at the most popular algorithms in the data scientist's toolbox.

Next Chapter: Common ML Algorithms.