Basics of Machine Learning

This chapter covers the core mathematical machinery of machine learning: the objective functions we optimize, the algorithms that optimize them, and the techniques that ensure models generalize.

Loss Functions: Cross entropy, KL divergence, and the mathematics of measuring prediction error
RL Algorithms for LLMs: PPO, DPO, GRPO, and RLOO for reinforcement learning from human feedback
Gradient Descent: Optimization algorithms such as SGD, momentum, Adam, and convergence analysis
Backpropagation: The chain rule on computational graphs, covering the forward pass, backward pass, and gradient flow
Regularization: Controlling model complexity through L1/L2 penalties, dropout, early stopping, and the bias-variance tradeoff