Home About AP Statistics 📈 Visual Statistics 🧮 Calculator

Visual Statistics

The same AP Statistics ideas — but seen, not just read. Move the sliders, run the simulations, and watch the theorems come alive in real time.

Statistics is a visual subject hiding behind formulas. Each interactive below targets a concept students find slippery — the Central Limit Theorem, confidence intervals, p-values, correlation — and turns it into something you can poke at. Everything runs live in your browser.

🎚️

Interactive

Drag sliders and watch distributions, lines, and intervals respond instantly.

🎲

Simulated

Thousands of random trials run on demand — see sampling behavior emerge from chaos.

🧠

Concept-First

Each demo pairs the picture with the key idea it is meant to make obvious.

The Normal Curve & the Empirical Rule

Almost every quantitative variable that piles up in the middle and tapers at the edges is well-approximated by a normal model. The famous 68–95–99.7 rule says that for any normal distribution, those percentages of data fall within 1, 2, and 3 standard deviations of the mean. Change the spread (σ) — the curve gets wider or narrower, but the percentages never move.

68.3%Band area
50.0%P(Z < z)
50.0%P(Z > z)
💡 The idea
Mean μ = 100 here. Spread changes the shape width, but a value's z-score — how many σ it sits from the mean — is what determines its percentile. That is why every normal problem reduces to the single standard normal curve.

Mean vs. Median: Who Gets Pulled?

A classic exam trap: which is bigger, the mean or the median? The answer lives in the shape. The median sits at the 50% mark and barely flinches, but the mean is a balance point that the long tail drags toward it. Slide from a left skew to a right skew and watch the two lines separate.

Mean (•)
Median (┃)
SymmetricShape
🎯 Exam shortcut
The mean chases the tail. Right-skewed → mean > median. Left-skewed → mean < median. Roughly symmetric → they sit on top of each other.

Correlation & the Least-Squares Line

Click anywhere in the plot to drop a point. The correlation r and the least-squares regression line ŷ = a + bx recompute instantly. Notice how a single far-away point (an influential outlier) can swing the line — and how r only measures the strength of a linear pattern.

Correlation r
LSRL ŷ = a + bx
0Points
⚠️ Watch out
A strong r does not mean a line is the right model, and it never proves causation. Always picture the scatter before trusting the number.

The Binomial Distribution

Count the successes in n independent trials, each with success probability p. The result is a binomial distribution with mean np and standard deviation √(np(1−p)). Slide p away from 0.5 to see it skew; turn on the normal overlay to see when the approximation is safe (the rule of thumb: np ≥ 10 and n(1−p) ≥ 10).

10.0Mean np
2.24SD √(np(1−p))
✓ OKNormal cond.

The Law of Large Numbers

Probability is a long-run idea, not a short-run one. Flip a coin a few times and the proportion of heads bounces wildly; flip it thousands of times and it settles toward the true probability. This is why "I'm due for a win" is a fallacy — the average converges, but the coin has no memory.

Current proportion
0Flips
0Heads

The Central Limit Theorem

The crown jewel of the course. Start with any population shape — even a wildly skewed or bimodal one. Take a sample of size n, record its mean, and repeat thousands of times. The distribution of those sample means (right) becomes approximately normal, centered at the population mean, with standard deviation σ/√n. Larger n → tighter, more normal.

Population
Distribution of sample mean x̄
Pop. mean μ
Mean of x̄'s
σ/√n (theory)
Observed SD
0Samples
✅ What to notice
The mean of the sample means always lands near the population mean, regardless of shape. And the observed SD closes in on the theoretical σ/√n — cut n in four and the spread halves.

What "95% Confidence" Really Means

A confidence level describes the method, not any single interval. Each horizontal bar below is one sample's interval for the true mean (the gold line). Build hundreds of them: about 95% will capture the truth and about 5% will miss. Raise the confidence level and the intervals get wider — the price of being right more often.

0Captured
0Missed
Capture rate
95%Target
⚠️ The common misread
It is wrong to say "there's a 95% chance the true mean is in this interval." The true mean is fixed; the interval is what's random. 95% refers to how often the procedure works over many samples.

Hypothesis Tests & the p-value

A significance test asks: if the null hypothesis were true, how surprising is my data? The curve below is the distribution of the test statistic assuming H₀ is true. The shaded tail(s) are the p-value — the chance of a result at least this extreme. Slide the observed statistic out toward the tail and watch the p-value shrink past your significance level α.

p-value
0.05α
Decision
💡 Decision rule
If p ≤ α, the data would be too surprising under H₀, so we reject H₀. If p > α, we fail to reject — never "accept" — H₀. The p-value is not the probability that H₀ is true.