Machine Learning
This course moves from supervised prediction to unsupervised discovery. We begin with linear models, add tree-based methods, then study how to evaluate and tune models before finishing with clustering techniques for unlabeled data.
Linear modeling
- Simple and multiple regression
- Inference, diagnostics, regularization, and logistic regression
- A running house-price example
Decision trees
- Tree structure, feature-space partitioning, and leaf predictions
- Gini, entropy, stopping rules, and pruning
- Practical considerations before moving to ensembles
Model evaluation & feature selection
- Train/validation/test strategy, bias-variance, and learning curves
- Metrics, cross-validation, feature selection, and tuning
- Robust workflows for imbalance, thresholds, and leakage prevention
Clustering techniques
- Hierarchical clustering, K-means, and K-medoids
- Gaussian mixtures, DBSCAN, and OPTICS
- Choosing the number of clusters and assessing cluster quality