Unit 6: Inference for Proportions

Section 6.1

What Is Statistical Inference?

Statistical inference uses sample data to draw conclusions about an unknown population parameter. There are two main tools:

🔑 The Big Picture of Inference

Both procedures require the same three conditions to be verified first:

1. Random: Data came from a random sample (or randomized experiment).

2. Normal: The sampling distribution of \(\hat{p}\) is approximately Normal. Check: \(np \geq 10\) AND \(n(1-p) \geq 10\).

3. Independent: Individual observations are independent. Check: \(n \leq 10\%\) of the population (10% condition).

Section 6.2

Confidence Intervals for a Population Proportion \(p\)

A confidence interval gives a range of plausible values for the unknown population proportion \(p\), based on what we observed in our sample \(\hat{p}\).

One-Sample z-Interval for a Proportion

\[ \hat{p} \pm z^* \sqrt{\frac{\hat{p}(1-\hat{p})}{n}} \]

\(\hat{p}\) = sample proportion | \(n\) = sample size
\(z^*\) = critical value (depends on confidence level)
\(\sqrt{\hat{p}(1-\hat{p})/n}\) = estimated standard error of \(\hat{p}\)

📌 Example: Constructing a CI

In a random sample of 120 voters, 78 say they support a local measure. Construct a 95% confidence interval for the true proportion of all voters who support it.

Step 1 — Check conditions:

Random ✓ | \(n\hat{p} = 78 \geq 10\) ✓ | \(n(1-\hat{p}) = 42 \geq 10\) ✓ | Independent (120 < 10% of all voters) ✓

Step 2 — Calculate: \(\hat{p} = 78/120 = 0.65\)

\(SE = \sqrt{\frac{0.65(0.35)}{120}} = \sqrt{0.001896} \approx 0.04354\)

\(CI = 0.65 \pm 1.960(0.04354) = 0.65 \pm 0.0854\)

Interval: (0.565, 0.735)

✓ Always state conditions, show the formula, and write the final interval as (lower, upper).

Section 6.3

Interpreting Confidence Intervals

⚠️ Most Common AP Mistake: Wrong Interpretation

WRONG: "There is a 95% probability that the true proportion is between 0.565 and 0.735."

CORRECT: "We are 95% confident that the true proportion of all voters who support the measure is between 0.565 and 0.735."

📐 The Correct Interpretation Template

"We are [C%] confident that the true [parameter in context] is between [lower bound] and [upper bound]."

💡 Effect of Confidence Level and Sample Size

Increase confidence level (e.g., 90% → 99%): Larger z*, wider interval — less precise but more confident.

Increase sample size n: Smaller SE, narrower interval — more precise.

Margin of error formula: \(ME = z^* \sqrt{\hat{p}(1-\hat{p})/n}\). To find required sample size: \(n = \left(\frac{z^*}{ME}\right)^2 \hat{p}(1-\hat{p})\). If no estimate of \(\hat{p}\) is available, use \(\hat{p} = 0.5\) (gives largest/most conservative n).

Section 6.4

Hypothesis Tests for a Population Proportion

A hypothesis test evaluates evidence against a specific claim about \(p\). The 4-step AP procedure must be memorized and applied in order every time.

Step 1: State the Hypotheses

📐 Hypotheses for a Proportion Test

Null Hypothesis \(H_0\): \(p = p_0\) (the "nothing is happening" claim — always has an equals sign)

Alternative Hypothesis \(H_a\): Choose based on the research question:

\(H_a: p > p_0\) (one-sided, right — "greater than")

\(H_a: p < p_0\) (one-sided, left — "less than")

\(H_a: p \neq p_0\) (two-sided — "different from")

⚠️ H₀ Always Uses =

The null hypothesis always contains an equality (=). Never write H₀: p > p₀ or H₀: p ≠ p₀. Also, hypotheses are always about parameters (\(p\)), never about statistics (\(\hat{p}\)).

Step 3: The Test Statistic

One-Sample z-Test Statistic for a Proportion

\[ z = \frac{\hat{p} - p_0}{\sqrt{\dfrac{p_0(1-p_0)}{n}}} \]

\(p_0\) = the hypothesized value from \(H_0\)
Note: Use \(p_0\) (not \(\hat{p}\)) in the standard error — this is different from the CI formula!

⚠️ Key Formula Difference: CI vs Hypothesis Test

Confidence Interval SE: Uses \(\hat{p}\) — because we're estimating \(p\).

Hypothesis Test SE: Uses \(p_0\) — because we're assuming \(H_0\) is true and \(p = p_0\).

Section 6.5

p-values and Drawing Conclusions

🔑 What Is a p-value?

The p-value is the probability of obtaining a test statistic as extreme as (or more extreme than) the observed value, assuming \(H_0\) is true.

A small p-value means: "If \(H_0\) were true, this result would be very unlikely." → Evidence against \(H_0\).

📐 Decision Rule

Compare the p-value to the significance level \(\alpha\) (usually 0.05):

p-value \(\leq \alpha\): Reject \(H_0\). We have convincing evidence for \(H_a\).

p-value \(> \alpha\): Fail to reject \(H_0\). We do NOT have convincing evidence for \(H_a\).

💡 "Fail to Reject" ≠ "Accept H₀"

Never say "we accept \(H_0\)" or "we proved \(H_0\) is true." We simply don't have enough evidence to reject it. The absence of evidence is not evidence of absence.

AP conclusion template:
"Because the p-value [value] is [less than / greater than] α = [value], we [reject / fail to reject] \(H_0\). There [is / is not] convincing evidence that [Hₐ in context]."

📌 Full Worked Example: One-Sample z-Test

A coin is flipped 100 times and lands heads 62 times. Test whether the coin is biased at α = 0.05.

Step 1 — Hypotheses: \(H_0: p = 0.5\) vs \(H_a: p \neq 0.5\) (two-sided — "biased" means either direction)

Step 2 — Conditions: Random (stated) ✓ | \(np_0 = 50 \geq 10\) ✓ | \(n(1-p_0) = 50 \geq 10\) ✓ | Independent ✓

Step 3 — Calculate: \(\hat{p} = 62/100 = 0.62\)

\(z = \frac{0.62 - 0.50}{\sqrt{0.50(0.50)/100}} = \frac{0.12}{0.05} = 2.40\)

Two-sided p-value = \(2 \times P(Z > 2.40) = 2(0.0082) = \mathbf{0.0164}\)

Step 4 — Conclude: Since p-value = 0.0164 < α = 0.05, we reject \(H_0\). There is convincing evidence that the coin is biased (the true proportion of heads differs from 0.5).

Error Type	What Happened	Probability	Example
Type I Error (α)	Rejected H₀ when H₀ was actually TRUE	\(\alpha\) (significance level)	Convicting an innocent person; alarm goes off but there's no fire
Type II Error (β)	Failed to reject H₀ when H₀ was actually FALSE	\(\beta\)	Acquitting a guilty person; no alarm when there IS a fire
Power (1 − β)	Correctly rejected a false H₀	\(1 - \beta\)	Correctly detecting a real effect

Unit 6: Inference for Proportions

What Is Statistical Inference?

Confidence Intervals for a Population Proportion \(p\)

Interpreting Confidence Intervals

Hypothesis Tests for a Population Proportion

Step 1: State the Hypotheses

Step 3: The Test Statistic

p-values and Drawing Conclusions

Type I and Type II Errors

Two-Proportion Procedures

Multiple Choice Questions

Free Response Questions

FRQ 1 — One-Proportion z-Test

✓ Model Solution

FRQ 2 — Two-Proportion z-Test & CI

✓ Model Solution