Common ML Algorithms

There is no "one size fits all" algorithm. Different problems require different tools.

1. Linear Regression 📏

Type: Regression
Idea: Fit a straight line through the data.
Best for: Predicting continuous numbers (e.g., House Prices).
Pros: Simple, interpretable.
Cons: Assumes a linear relationship (real life is rarely linear).

2. Logistic Regression ⚖️

Type: Classification (despite the name!)
Idea: Uses a "Sigmoid" function to squash output between 0 and 1 (probability).
Best for: Binary classification (Spam/Not Spam).
Pros: Outputs probabilities.
Cons: Struggles with complex, non-linear boundaries.

3. Decision Trees 🌳

Type: Regression & Classification
Idea: A flowchart-like structure. "If Age > 30, go left. Else, go right."
Best for: Categorical data, clear decision rules.
Pros: Easy to visualize and explain to humans.
Cons: Prone to Overfitting (memorizing the data).

4. Random Forest 🌲🌲🌲

Type: Ensemble (Regression & Classification)
Idea: Train 100 Decision Trees on random subsets of data and average their predictions.
Best for: Almost everything! It's a great default algorithm.
Pros: Very accurate, robust to overfitting.
Cons: Slow to train, hard to interpret (Black Box).

5. K-Nearest Neighbors (KNN) 📍

Type: Classification
Idea: "Show me who your friends are, and I'll tell you who you are." Classifies a point based on its nearest neighbors.
Best for: Simple recommendation systems.
Pros: No training phase (Lazy Learner).
Cons: Slow on large datasets.

Interactive Visualization: Decision Tree

Imagine we are classifying fruit based on Size and Color.

Interactive Demo: Random Forest Classifier

Let's use a Random Forest to classify the famous Iris Dataset (flowers).

PYTHON PLAYGROUND

⏳ Loading editor…

Quiz

Question 1 of 3

Which algorithm is best for predicting a Yes/No outcome?

Linear Regression

Logistic Regression

K-Means

Key Takeaways

✅ Linear/Logistic Regression are simple baselines.
✅ Decision Trees are interpretable but prone to overfitting.
✅ Random Forest is a powerful, accurate all-rounder.

What's Next?

We trained a model. But is it any good? How do we measure "good"?

Next Chapter: Model Evaluation.