What Is Statistical Inference?
Statistical inference uses sample data to draw conclusions about an unknown population parameter. There are two main tools:
Both procedures require the same three conditions to be verified first:
1. Random: Data came from a random sample (or randomized experiment).
2. Normal: The sampling distribution of \(\hat{p}\) is approximately Normal. Check: \(np \geq 10\) AND \(n(1-p) \geq 10\).
3. Independent: Individual observations are independent. Check: \(n \leq 10\%\) of the population (10% condition).
Confidence Intervals for a Population Proportion \(p\)
A confidence interval gives a range of plausible values for the unknown population proportion \(p\), based on what we observed in our sample \(\hat{p}\).
\(z^*\) = critical value (depends on confidence level)
\(\sqrt{\hat{p}(1-\hat{p})/n}\) = estimated standard error of \(\hat{p}\)
In a random sample of 120 voters, 78 say they support a local measure. Construct a 95% confidence interval for the true proportion of all voters who support it.
Step 1 — Check conditions:
Random ✓ | \(n\hat{p} = 78 \geq 10\) ✓ | \(n(1-\hat{p}) = 42 \geq 10\) ✓ | Independent (120 < 10% of all voters) ✓
Step 2 — Calculate: \(\hat{p} = 78/120 = 0.65\)
\(SE = \sqrt{\frac{0.65(0.35)}{120}} = \sqrt{0.001896} \approx 0.04354\)
\(CI = 0.65 \pm 1.960(0.04354) = 0.65 \pm 0.0854\)
Interval: (0.565, 0.735)
✓ Always state conditions, show the formula, and write the final interval as (lower, upper).
Interpreting Confidence Intervals
WRONG: "There is a 95% probability that the true proportion is between 0.565 and 0.735."
CORRECT: "We are 95% confident that the true proportion of all voters who support the measure is between 0.565 and 0.735."
"We are [C%] confident that the true [parameter in context] is between [lower bound] and [upper bound]."
Increase confidence level (e.g., 90% → 99%): Larger z*, wider interval — less precise but more confident.
Increase sample size n: Smaller SE, narrower interval — more precise.
Margin of error formula: \(ME = z^* \sqrt{\hat{p}(1-\hat{p})/n}\). To find required sample size: \(n = \left(\frac{z^*}{ME}\right)^2 \hat{p}(1-\hat{p})\). If no estimate of \(\hat{p}\) is available, use \(\hat{p} = 0.5\) (gives largest/most conservative n).
Hypothesis Tests for a Population Proportion
A hypothesis test evaluates evidence against a specific claim about \(p\). The 4-step AP procedure must be memorized and applied in order every time.
Step 1: State the Hypotheses
Null Hypothesis \(H_0\): \(p = p_0\) (the "nothing is happening" claim — always has an equals sign)
Alternative Hypothesis \(H_a\): Choose based on the research question:
\(H_a: p > p_0\) (one-sided, right — "greater than")
\(H_a: p < p_0\) (one-sided, left — "less than")
\(H_a: p \neq p_0\) (two-sided — "different from")
The null hypothesis always contains an equality (=). Never write H₀: p > p₀ or H₀: p ≠ p₀. Also, hypotheses are always about parameters (\(p\)), never about statistics (\(\hat{p}\)).
Step 3: The Test Statistic
Note: Use \(p_0\) (not \(\hat{p}\)) in the standard error — this is different from the CI formula!
Confidence Interval SE: Uses \(\hat{p}\) — because we're estimating \(p\).
Hypothesis Test SE: Uses \(p_0\) — because we're assuming \(H_0\) is true and \(p = p_0\).
p-values and Drawing Conclusions
The p-value is the probability of obtaining a test statistic as extreme as (or more extreme than) the observed value, assuming \(H_0\) is true.
A small p-value means: "If \(H_0\) were true, this result would be very unlikely." → Evidence against \(H_0\).
Compare the p-value to the significance level \(\alpha\) (usually 0.05):
p-value \(\leq \alpha\): Reject \(H_0\). We have convincing evidence for \(H_a\).
p-value \(> \alpha\): Fail to reject \(H_0\). We do NOT have convincing evidence for \(H_a\).
Never say "we accept \(H_0\)" or "we proved \(H_0\) is true." We simply don't have enough evidence to reject it. The absence of evidence is not evidence of absence.
AP conclusion template:
"Because the p-value [value] is [less than / greater than] α = [value], we [reject / fail to reject] \(H_0\). There [is / is not] convincing evidence that [Hₐ in context]."
A coin is flipped 100 times and lands heads 62 times. Test whether the coin is biased at α = 0.05.
Step 1 — Hypotheses: \(H_0: p = 0.5\) vs \(H_a: p \neq 0.5\) (two-sided — "biased" means either direction)
Step 2 — Conditions: Random (stated) ✓ | \(np_0 = 50 \geq 10\) ✓ | \(n(1-p_0) = 50 \geq 10\) ✓ | Independent ✓
Step 3 — Calculate: \(\hat{p} = 62/100 = 0.62\)
\(z = \frac{0.62 - 0.50}{\sqrt{0.50(0.50)/100}} = \frac{0.12}{0.05} = 2.40\)
Two-sided p-value = \(2 \times P(Z > 2.40) = 2(0.0082) = \mathbf{0.0164}\)
Step 4 — Conclude: Since p-value = 0.0164 < α = 0.05, we reject \(H_0\). There is convincing evidence that the coin is biased (the true proportion of heads differs from 0.5).