Question 1

Should I use sample or population standard deviation?

Accepted Answer

Use population SD (&sigma;) when your dataset IS the entire population you care about — all 30 students in a specific class, all 500 products from one machine run. Use sample SD (s) when your dataset is a subset of a larger population you want to make inferences about — 100 survey respondents out of millions, 20 measurements taken from an ongoing process. When in doubt: if you collected a subset and want to generalize, use sample SD.

Question 2

What does standard deviation actually tell me?

Accepted Answer

It tells you the typical distance any random data point is from the mean. If exam scores have a mean of 75 and SD of 10, a typical student scores somewhere between 65 and 85. A small SD means data clusters tightly around the mean (consistent). A large SD means data is widely spread (variable). Two datasets can have the same mean but very different SDs — SD is what distinguishes them.

Question 3

Why does standard deviation use squared deviations instead of absolute deviations?

Accepted Answer

Squaring deviations has three advantages: it makes all deviations positive (eliminating the cancel-out problem), it penalizes large deviations more heavily than small ones (making SD sensitive to outliers), and it produces mathematically tractable properties — variance adds across independent variables, which absolute deviations don't. The mean absolute deviation (MAD) is an alternative that's more robust to outliers but lacks these algebraic properties.

Question 4

What is a "good" or "bad" standard deviation?

Accepted Answer

There's no universal answer — it entirely depends on context. For manufacturing tolerances, a small SD is good (consistent output). For investment returns, some SD is necessary (no SD = no return). The Coefficient of Variation (CV) provides more context: CV < 15% is generally considered low variability, 15–30% moderate, >30% high — but these are rough guidelines, not universal rules.

Question 5

How do outliers affect standard deviation?

Accepted Answer

Dramatically. Because deviations are squared, one value far from the mean contributes far more to SD than values close to the mean. Dataset [1,2,3,4,5] has SD ≈ 1.58. Add one outlier: [1,2,3,4,5,50] and SD jumps to ≈ 17.9. This is why SD is called "not robust" to outliers. The IQR (interquartile range) is the robust alternative — it ignores values outside the middle 50% entirely.

Question 6

What does a negative z-score mean?

Accepted Answer

A negative z-score means the value is below the mean. z = &minus;1.5 means the value is 1.5 standard deviations below average. Z-scores can be any real number — positive (above mean), negative (below mean), or zero (exactly at the mean). There's nothing inherently bad about a negative z-score; it just indicates position relative to the group average.

Question 7

What's the difference between standard deviation and standard error?

Accepted Answer

Standard deviation (SD) measures the spread of individual data points in your sample. Standard error of the mean (SEM = SD/&radic;n) measures the precision of your estimate of the population mean. As you collect more data (larger n), SEM shrinks but SD stays roughly constant. SD answers "how variable is the data?" SEM answers "how precisely do I know the mean?" Error bars in scientific papers typically show SEM, not SD.

Question 8

Can standard deviation be zero or negative?

Accepted Answer

Zero, yes — if every value in the dataset is identical, all deviations are zero, so SD = 0. This means no variability whatsoever. Negative, no — SD is a square root, and variance (the thing being square-rooted) is a sum of squared values which is always &ge; 0. A negative standard deviation is mathematically impossible, so any result suggesting otherwise indicates a calculation error.

Statistic	Formula	What it measures
Mean (μ or x̅)	Σx / n	Average; center of mass of data
Median	Middle value when sorted	Center value; robust to outliers
Mode	Most frequent value(s)	Most common observation
Range	Max − Min	Total spread; sensitive to outliers
Variance (s²)	Σ(x − x̅)² / (n−1)	Average squared deviation; in units²
Standard Deviation	√Variance	Typical distance from mean; in original units
Q1 (25th pct)	Median of lower half	Value below which 25% of data falls
Q3 (75th pct)	Median of upper half	Value below which 75% of data falls
IQR	Q3 − Q1	Middle 50% spread; very robust to outliers
CV (%)	(s / x̅) × 100	Relative variability; unitless
SEM	s / √n	Precision of mean estimate
Skewness	Standardized 3rd moment	>0: right tail; <0: left tail
Kurtosis	Excess 4th moment	>0: heavy tails; <0: light tails vs normal

σ Standard Deviation Calculator

Sample vs. Population Standard Deviation

The Full Deviation Table Explained

Complete Descriptive Statistics Reference

The 68–95–99.7 Empirical Rule

Z-Scores and Standardization

Coefficient of Variation (CV)

Skewness and Kurtosis

Frequently Asked Questions