Decision Tree vs Random Forest – Hands-on Comparison with Python

🧪 Decision Tree vs Random Forest – Hands-on Comparison

Train both models on the same dataset, compare metrics (Accuracy/F1), inspect overfitting, and review feature importance.

You’ll Learn

Fair model comparisons with same split
Detecting overfitting (train vs test)
Hyperparameter basics for both models
Reading feature importance

Dataset

Use a tabular, multi-class dataset like Iris or Breast Cancer (binary). Below uses Iris.

Python: Train & Compare (Iris)

# Imports
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.tree import DecisionTreeClassifier
from sklearn.ensemble import RandomForestClassifier
from sklearn.metrics import accuracy_score, f1_score, classification_report
import numpy as np

# Data & split (same split for both)
X, y = load_iris(return_X_y=True)
X_tr, X_te, y_tr, y_te = train_test_split(X, y, test_size=0.2, random_state=42, stratify=y)

# Models
dt = DecisionTreeClassifier(max_depth=None, random_state=42)
rf = RandomForestClassifier(n_estimators=200, max_depth=None, random_state=42, n_jobs=-1)

# Fit
dt.fit(X_tr, y_tr)
rf.fit(X_tr, y_tr)

# Train & Test metrics
def metrics(name, model):
    yhat_tr = model.predict(X_tr)
    yhat_te = model.predict(X_te)
    print(f"== {name} ==")
    print("Train Acc:", round(accuracy_score(y_tr, yhat_tr), 3),
          " Test Acc:", round(accuracy_score(y_te, yhat_te), 3))
    print("Train F1:", round(f1_score(y_tr, yhat_tr, average='macro'), 3),
          " Test F1:", round(f1_score(y_te, yhat_te, average='macro'), 3))
    print()

metrics("Decision Tree", dt)
metrics("Random Forest", rf)

# Feature importance (Random Forest)
importances = rf.feature_importances_
rank = np.argsort(importances)[::-1]
print("Feature importance (RF):")
for i in rank:
    print(f"  x{i} -> {importances[i]:.3f}")

# Optional: classification report
print("\\nRandom Forest report:")
print(classification_report(y_te, rf.predict(X_te)))

Interpretation: If the Decision Tree shows very high train score but lower test score relative to Random Forest, that’s a sign of overfitting. RF usually generalizes better.

Tune for Decision Tree

max_depth – limit depth to reduce variance
min_samples_leaf – smooth leaf predictions
criterion – gini vs entropy

Tune for Random Forest

n_estimators – more trees → stabler scores
max_depth, min_samples_leaf – control overfitting
max_features – randomness per split (e.g., sqrt)
oob_score=True – quick internal validation

🎯 Mini Challenge

Use GridSearchCV to find the best max_depth for DT and n_estimators/max_features for RF.
Plot a bar chart of RF feature importances.
Repeat on another dataset (e.g., Breast Cancer) and compare.

📝 Self-Check

Why does Random Forest usually have better test performance than a single tree?
How would you diagnose overfitting using train vs test metrics?
Which features are most important in your RF run, and do they make sense?

Next: Module 8 – Model Evaluation & Tuning (Advanced)

Machine Learning with Python: From Basics to Capstone

Curriculum