The Central Limit Theorem
Here's one of the most surprising ideas in all of statistics: take samples from almost any distribution — lopsided, lumpy, whatever — average each sample, and those averages pile up into a smooth, predictable bell curve. Let's not take that on faith. Let's watch it happen.
The setup
Imagine a population — say, the reaction times of every student in a huge psychology study. The shape of that population can be anything. Most real data is not a tidy bell curve; reaction times are usually skewed, with a long tail of slow responses.
Now we do something simple and repeat it over and over:
- Reach into the population and grab a random sample of size n.
- Compute the mean of just that sample — one number.
- Write that number down, and do it all again.
The collection of all those sample means has its own distribution — the sampling distribution of the mean. The Central Limit Theorem tells us what it looks like.
🎮 Sampling Distribution Playground
Pick a population shape, choose a sample size, then draw samples and watch the averages collect below.
① The population you're sampling from
② Your most recent sample of n values (their average is the orange line)
③ The sampling distribution — every orange average lands here. The dashed curve is the bell curve the CLT predicts.
What you should notice
Play with it for a minute, and three things jump out — these are the Central Limit Theorem:
1. The averages form a bell curve — even when the population is skewed or lumpy. Start with the "Skewed" or "Bimodal" population and you'll still get a symmetric bell in panel ③.
2. Bigger samples → tighter bell. Drag n from 1 up to 50 and watch panel ③ get narrower. Averaging more values per sample makes the averages cluster more tightly around the truth.
3. The bell is centered on the population mean. The averages don't drift — they pile up right over the true mean of the population.
The one formula worth remembering
The spread of the sampling distribution has a name — the standard error — and a tidy formula:
SE = σ / √n
where σ is the population's standard deviation and n is your sample size. Notice the √n on the bottom: to halve the standard error, you need four times the data. That diminishing return is why the playground's bell tightens quickly at first, then more slowly. Compare the "SE" you measured with the "σ/√n" the theorem predicts — they should match closely once you've drawn enough samples.
Why it matters: almost everything that comes next — confidence intervals, t-tests, p-values — leans on the fact that sample means behave predictably like this. The CLT is the bridge from "messy real data" to "clean, usable inference."