
What is Supervised Learning?
Supervised learning is a machine learning paradigm in which a model is trained on labeled data β input-output pairs where the correct answer is known. The model learns to map inputs to outputs and can then make predictions on new, unseen data.
Why It Matters
Supervised learning is the most widely used form of machine learning in practice. Classification (is this email spam?), regression (what will this house sell for?), and even the supervised fine-tuning step of LLM training all use supervised learning. It's the paradigm most people encounter first and the backbone of countless production AI systems.
How It Works
- Labeled dataset β each training example has an input and a corresponding label (the "ground truth").
- Model selection β choose an algorithm (linear regression, decision tree, neural network, etc.).
- Training β the model processes training examples, makes predictions, computes error using a loss function, and adjusts its parameters via gradient descent to reduce error.
- Evaluation β test on held-out data to measure accuracy, precision, recall, or other metrics.
- Prediction β apply the trained model to new inputs.
Two main tasks:
- Classification β predict a category (spam/not spam, cat/dog, positive/negative sentiment)