Skip to main content

Basics of Machine Learning

This chapter covers the core mathematical machinery of machine learning: the objective functions we optimize, the algorithms that optimize them, and the techniques that ensure models generalize.

  • Loss Functions: Cross entropy, KL divergence, and the mathematics of measuring prediction error
  • RL Algorithms for LLMs: PPO, DPO, GRPO, and RLOO for reinforcement learning from human feedback
  • Gradient Descent: Optimization algorithms such as SGD, momentum, Adam, and convergence analysis
  • Backpropagation: The chain rule on computational graphs, covering the forward pass, backward pass, and gradient flow
  • Regularization: Controlling model complexity through L1/L2 penalties, dropout, early stopping, and the bias-variance tradeoff