Standard deviation is the most commonly used measure of spread in statistics โ but most people only vaguely understand what it means. Here's a plain-English explanation with real examples.
Standard deviation appears on exam results, medical studies, financial reports, and quality control charts. If you've ever seen "mean ยฑ SD" and wondered what the ยฑ part actually means, this is the explanation.
Standard deviation measures how spread out values are around the mean (average). A low standard deviation means values are clustered tightly around the average. A high one means they're scattered widely.
Consider two classes of 30 students who both scored an average of 65% on a test:
Same mean. Completely different distributions. Standard deviation captures this difference; the mean alone doesn't.
Worked example: Test scores: [60, 65, 65, 70, 70]
You'll notice calculators offer two options: population SD (divides by n) and sample SD (divides by n-1).
Sample SD uses n-1 (called Bessel's correction) because a small sample tends to underestimate the spread of the full population. For most everyday purposes with large datasets, the difference is negligible.
If data follows a normal distribution (the classic bell curve), standard deviation has a very useful property:
Practical example: Adult male height in the UK has a mean of ~175cm and SD of ~7cm.
Finance: Volatility in investing is measured as standard deviation of returns. A fund with SD of 15% has more variable returns than one with SD of 5%. Higher SD = higher risk and higher potential reward.
Manufacturing: If bolts need to be 10mm with an SD of 0.1mm, quality control can predict exactly what percentage will fall outside acceptable tolerances.
Medicine: "Normal range" for blood tests is often defined as mean ยฑ 2 SD โ the range that covers 95% of healthy individuals. Values outside this trigger further investigation.
Education: Standardised test scoring often converts raw scores to standard deviations from the mean to compare performance across different test forms.
Variance is simply standard deviation squared (or equivalently, SD is the square root of variance). Both measure spread. SD is preferred in most situations because it's in the same units as the original data โ if you're measuring heights in centimetres, SD is also in centimetres, making it intuitively interpretable. Variance would be in cmยฒ, which is harder to visualise.