Learn · Beginner

Backpropagation: how a neural network learns from its mistakes

Every AI model you've heard of - the chatbots, the image generators, the robot policies - was trained by one core algorithm: backpropagation. It's the engine of modern AI, and the idea behind it is more intuitive than its reputation suggests. This lesson explains how a network learns from being wrong.

Start with what a neural network is: a giant function with millions or billions of adjustable dials, called parameters or weights. You feed in an input (say, the start of a sentence), the numbers flow through the dials, and out comes a prediction (the next word). Training means turning all those dials until the predictions are good. The question is: with billions of dials, which ones do you turn, and which way?

First you need to measure how wrong the model is. That's the job of a loss function - a single number that's large when the prediction is bad and small when it's good. If the model should have said 'cat' and was confident about 'dog,' the loss is high. The entire goal of training is to make this number small.

Now the central idea, and it's worth slowing down for. Suppose you nudge one particular dial a tiny bit. Does the loss go up or down, and by how much? That sensitivity - how much the error changes when you wiggle this one dial - is called the gradient for that dial. If you knew the gradient for every dial, you'd know exactly how to adjust each one to reduce the error: turn each dial a small step in the direction that lowers the loss. Repeating that, over and over on millions of examples, is how the model learns. (The repeated 'take a small step downhill' part is called gradient descent; backpropagation is how you compute the gradients that tell you which way downhill is.)

So how do you get the gradient for billions of dials without separately testing each one (which would be hopeless)? This is backpropagation's clever trick. The network is a chain of operations: input feeds layer one, which feeds layer two, and so on to the output. Backpropagation computes the error at the very end, then works backward through the chain, layer by layer, using the calculus chain rule to figure out how much each layer - and each dial inside it - contributed to the final mistake. Blame flows backward from the output toward the input, which is exactly why it's called back-propagation.

The analogy that makes it click: imagine a factory assembly line that produces a flawed product at the end. To fix the process, you don't randomly tweak every station. You start at the final inspection, see what's wrong, and trace the fault backward - this defect came from the painting station, which got a bad part from the welding station, which was misaligned by the cutting station. By the time you've walked back to the start, you know how much each station contributed to the flaw and how to adjust it. Backpropagation walks that blame backward through every layer of the network, and it does it efficiently - the backward pass costs about the same as the forward pass, no matter how many dials there are.

One pass works like this: run an example forward and get a prediction (the forward pass); compare it to the right answer to get the loss; propagate the error backward to get a gradient for every dial (the backward pass); nudge every dial a small step in its improving direction. Do this across mountains of data, millions of times, and a network that started as random noise becomes one that can write, translate, or recognize images. This is the difference between training and inference: backpropagation happens during training; when you actually use the model, only the forward pass runs.

A few things worth knowing. The size of the step matters enormously - the 'learning rate.' Too big and the model overshoots and never settles; too small and training crawls. Backpropagation is also why scaling laws work: because the backward pass is efficient, you can train networks of almost any size the same way, which is what made today's enormous transformer models possible. And there's a bit of history here - the method was popularized by a 1986 paper from Rumelhart, Hinton, and Williams. It sat relatively quiet for decades until fast hardware and big datasets let it shine, and Geoffrey Hinton later shared a Turing Award largely for this line of work.

The one-sentence takeaway: backpropagation is how a network turns a single 'you were wrong' signal into precise, individualized instructions for every one of its billions of dials - by tracing the blame backward, efficiently, from the mistake to its sources.

Key papers
Learning representations by back-propagating errors (Rumelhart, Hinton & Williams, 1986)
Attention Is All You Need (Vaswani et al., 2017)