Visual Statistics

Unit 1 · Distributions

The Normal Curve & the Empirical Rule

Almost every quantitative variable that piles up in the middle and tapers at the edges is well-approximated by a normal model. The famous 68–95–99.7 rule says that for any normal distribution, those percentages of data fall within 1, 2, and 3 standard deviations of the mean. Change the spread (σ) — the curve gets wider or narrower, but the percentages never move.

Standard deviation σ 15

Shade left of z 0.00

68.3%Band area

50.0%P(Z < z)

50.0%P(Z > z)

💡 The idea

Mean μ = 100 here. Spread changes the shape width, but a value's z-score — how many σ it sits from the mean — is what determines its percentile. That is why every normal problem reduces to the single standard normal curve.

Unit 1 · Shape & Center

Mean vs. Median: Who Gets Pulled?

A classic exam trap: which is bigger, the mean or the median? The answer lives in the shape. The median sits at the 50% mark and barely flinches, but the mean is a balance point that the long tail drags toward it. Slide from a left skew to a right skew and watch the two lines separate.

Skewness 0.0

—Mean (•)

—Median (┃)

SymmetricShape

🎯 Exam shortcut

The mean chases the tail. Right-skewed → mean > median. Left-skewed → mean < median. Roughly symmetric → they sit on top of each other.

Unit 2 · Two-Variable Data

Correlation & the Least-Squares Line

Click anywhere in the plot to drop a point. The correlation r and the least-squares regression line ŷ = a + bx recompute instantly. Notice how a single far-away point (an influential outlier) can swing the line — and how r only measures the strength of a linear pattern.

Generate cloud · target r 0.80

—Correlation r

—r²

—LSRL ŷ = a + bx

0Points

⚠️ Watch out

A strong r does not mean a line is the right model, and it never proves causation. Always picture the scatter before trusting the number.

Unit 4 · Probability

The Binomial Distribution

Count the successes in n independent trials, each with success probability p. The result is a binomial distribution with mean np and standard deviation √(np(1−p)). Slide p away from 0.5 to see it skew; turn on the normal overlay to see when the approximation is safe (the rule of thumb: np ≥ 10 and n(1−p) ≥ 10).

Trials n 20

Success prob. p 0.50

10.0Mean np

2.24SD √(np(1−p))

✓ OKNormal cond.

Unit 4 · Probability

The Law of Large Numbers

Probability is a long-run idea, not a short-run one. Flip a coin a few times and the proportion of heads bounces wildly; flip it thousands of times and it settles toward the true probability. This is why "I'm due for a win" is a fallacy — the average converges, but the coin has no memory.

True probability p 0.50

—Current proportion

0Flips

0Heads

Unit 5 · Sampling Distributions

The Central Limit Theorem

The crown jewel of the course. Start with any population shape — even a wildly skewed or bimodal one. Take a sample of size n, record its mean, and repeat thousands of times. The distribution of those sample means (right) becomes approximately normal, centered at the population mean, with standard deviation σ/√n. Larger n → tighter, more normal.

Population

Distribution of sample mean x̄

Population shape

Sample size n 10

—Pop. mean μ

—Mean of x̄'s

—σ/√n (theory)

—Observed SD

0Samples

✅ What to notice

The mean of the sample means always lands near the population mean, regardless of shape. And the observed SD closes in on the theoretical σ/√n — cut n in four and the spread halves.

Units 6–7 · Inference

What "95% Confidence" Really Means

A confidence level describes the method, not any single interval. Each horizontal bar below is one sample's interval for the true mean (the gold line). Build hundreds of them: about 95% will capture the truth and about 5% will miss. Raise the confidence level and the intervals get wider — the price of being right more often.

Confidence level

Sample size n 30

0Captured

0Missed

—Capture rate

95%Target

⚠️ The common misread

It is wrong to say "there's a 95% chance the true mean is in this interval." The true mean is fixed; the interval is what's random. 95% refers to how often the procedure works over many samples.

Units 6–7 · Inference

Hypothesis Tests & the p-value

A significance test asks: if the null hypothesis were true, how surprising is my data? The curve below is the distribution of the test statistic assuming H₀ is true. The shaded tail(s) are the p-value — the chance of a result at least this extreme. Slide the observed statistic out toward the tail and watch the p-value shrink past your significance level α.

Observed statistic z 1.80

Alternative Hₐ

Significance α 0.05

—p-value

0.05α

—Decision

💡 Decision rule

If p ≤ α, the data would be too surprising under H₀, so we reject H₀. If p > α, we fail to reject — never "accept" — H₀. The p-value is not the probability that H₀ is true.

Interactive

Simulated

Concept-First

The Normal Curve & the Empirical Rule

Mean vs. Median: Who Gets Pulled?

Correlation & the Least-Squares Line

The Binomial Distribution

The Law of Large Numbers

The Central Limit Theorem

What "95% Confidence" Really Means

Hypothesis Tests & the p-value