Study Guide – Module 9

Overview of Two-Sample Tests

This module covers hypothesis tests for comparing TWO populations. The key is identifying:

Are you testing means or proportions?
Are the samples independent or paired?

Test	When to Use	Test Statistic	Distribution
Independent Two-Sample t-Test	Compare means from two separate groups	t	t-distribution
Paired t-Test	Compare means for same subjects (or matched pairs)	t	t-distribution
Two-Proportion z-Test	Compare proportions from two groups	z	Normal (z)

1 Independent Two-Sample t-Test

Purpose

Compare means from two independent populations (μ₁ vs. μ₂)

Hypotheses

Two-tailed: H₀: μ₁ = μ₂, Hₐ: μ₁ ≠ μ₂
Right-tailed: H₀: μ₁ = μ₂, Hₐ: μ₁ > μ₂
Left-tailed: H₀: μ₁ = μ₂, Hₐ: μ₁ < μ₂

Conditions

Independent random samples
Both populations normally distributed OR both n ≥ 30
Independence within each sample

Test Statistic (Unpooled - Welch's t-test)

t = (x̄₁ - x̄₂) / √(s₁²/n₁ + s₂²/n₂)

Degrees of freedom: Use Welch's approximation (use technology)

Test Statistic (Pooled Variance)

Pooled variance:

sp² = [(n₁ - 1)s₁² + (n₂ - 1)s₂²] / (n₁ + n₂ - 2)

Test statistic:

t = (x̄₁ - x̄₂) / (sp√(1/n₁ + 1/n₂))

Degrees of freedom: df = n₁ + n₂ - 2

When to Use Which?

Unpooled (Welch's): DEFAULT. Use when variances may be unequal or uncertain.
Pooled: Only when you have strong evidence that σ₁² = σ₂²

Confidence Interval for μ₁ - μ₂ (Unpooled)

(x̄₁ - x̄₂) ± t* × √(s₁²/n₁ + s₂²/n₂)

2 Paired t-Test

Purpose

Compare means when same subjects are measured twice OR matched pairs are used

Key Concept

A paired t-test is a one-sample t-test on the DIFFERENCES!
Calculate d = x₁ - x₂ for each pair, then test if mean difference = 0

When to Use

Before/after measurements on same subjects
Matched pairs (e.g., twins, siblings)
Same subject, two conditions (left vs. right hand)

Hypotheses

Two-tailed: H₀: μd = 0, Hₐ: μd ≠ 0
Right-tailed: H₀: μd = 0, Hₐ: μd > 0
Left-tailed: H₀: μd = 0, Hₐ: μd < 0

Process

Calculate differences: d = x₁ - x₂ (or x₂ - x₁, be consistent)
Find mean of differences: d̄ = Σd / n
Find standard deviation of differences: sd
Calculate test statistic

Test Statistic

t = (d̄ - μd₀) / (sd / √n)

where μd₀ = 0 (usually)
Degrees of freedom: df = n - 1 (n = number of PAIRS)

Conditions

Random sampling of pairs
Pairs are independent of each other
Differences are approximately normal OR n ≥ 30

Confidence Interval for μd

d̄ ± t* × (sd / √n)

Advantages of Paired Design

Controls for individual variability
More powerful (better at detecting real effects)
Requires fewer total subjects for same power

3 Two-Proportion z-Test

Purpose

Compare proportions from two independent populations (p₁ vs. p₂)

Hypotheses

Two-tailed: H₀: p₁ = p₂, Hₐ: p₁ ≠ p₂
Right-tailed: H₀: p₁ = p₂, Hₐ: p₁ > p₂
Left-tailed: H₀: p₁ = p₂, Hₐ: p₁ < p₂

Conditions

Independent random samples
Success-failure condition for BOTH samples:
- n₁p̂₁ ≥ 10 and n₁(1 - p̂₁) ≥ 10
- n₂p̂₂ ≥ 10 and n₂(1 - p̂₂) ≥ 10

Pooled Proportion (for Hypothesis Testing)

p̄ = (x₁ + x₂) / (n₁ + n₂)

or equivalently: p̄ = (n₁p̂₁ + n₂p̂₂) / (n₁ + n₂)

Test Statistic

z = (p̂₁ - p̂₂) / √[p̄(1 - p̄)(1/n₁ + 1/n₂)]

Uses standard normal (z) distribution

Confidence Interval for p₁ - p₂ (NO pooling!)

(p̂₁ - p̂₂) ± z* × √[(p̂₁(1-p̂₁)/n₁) + (p̂₂(1-p̂₂)/n₂)]

Important Note

For hypothesis testing, use pooled proportion p̄
For confidence intervals, DO NOT pool - use individual p̂₁ and p̂₂

Decision Flowchart: Choosing the Right Test

Start Here ↓

QUESTION 1: Are you testing MEANS or PROPORTIONS?

If Testing MEANS
One sample?	One-sample t-test (σ unknown) or z-test (σ known)
Two independent samples?	Independent two-sample t-test
Same subjects twice or matched pairs?	Paired t-test

If Testing PROPORTIONS
One sample?	One-sample z-test for proportion
Two samples?	Two-proportion z-test

Common Mistakes to Avoid

Mistake	Correct Approach
Using independent test for paired data	If same subjects measured twice → MUST use paired test
Confusing n for paired tests	n = number of PAIRS (not total observations) df = n - 1
Using pooled p̄ in confidence intervals	Pooling ONLY for hypothesis tests, NOT for CIs
Forgetting to check conditions	Always verify sample size, normality, independence
Not specifying direction of difference	In paired tests, clearly state if d = Before - After or After - Before

Quick Reference Table

Test	Hypotheses	Test Statistic	df
Independent t-test (unpooled)	H₀: μ₁ = μ₂ Hₐ: μ₁ ≠ μ₂ (or <, >)	t = (x̄₁ - x̄₂) / √(s₁²/n₁ + s₂²/n₂)	Welch's formula (use technology)
Independent t-test (pooled)	H₀: μ₁ = μ₂ Hₐ: μ₁ ≠ μ₂ (or <, >)	t = (x̄₁ - x̄₂) / (sp√(1/n₁ + 1/n₂))	n₁ + n₂ - 2
Paired t-test	H₀: μd = 0 Hₐ: μd ≠ 0 (or <, >)	t = (d̄ - 0) / (sd / √n)	n - 1
Two-proportion z-test	H₀: p₁ = p₂ Hₐ: p₁ ≠ p₂ (or <, >)	z = (p̂₁ - p̂₂) / √[p̄(1-p̄)(1/n₁ + 1/n₂)]	N/A (uses z)

Tips for Success

Identify the design first: Independent or paired? This is the most critical decision.
Check ALL conditions: Don't skip this step! Violating conditions invalidates results.
Use technology for df: Welch's approximation is complex - use calculators or software.
Interpret in context: Always relate statistical conclusions to the research question.
Remember the difference: Hypothesis tests pool proportions; confidence intervals don't.
Practice decision-making: Work through many scenarios until test selection is automatic.
Draw diagrams: Visualize the two distributions or before/after measurements.

Key Formulas Summary

Independent Two-Sample t-Test (Unpooled)

t = (x̄₁ - x̄₂) / √(s₁²/n₁ + s₂²/n₂)

Pooled Variance

sp² = [(n₁-1)s₁² + (n₂-1)s₂²] / (n₁+n₂-2)

Independent Two-Sample t-Test (Pooled)

t = (x̄₁ - x̄₂) / (sp√(1/n₁ + 1/n₂))

Paired t-Test

t = (d̄ - 0) / (sd / √n)

Pooled Proportion

p̄ = (x₁ + x₂) / (n₁ + n₂)

Two-Proportion z-Test

z = (p̂₁ - p̂₂) / √[p̄(1-p̄)(1/n₁ + 1/n₂)]

Module 9: Study Guide

Module 9 Study Guide

Hypothesis Testing for Two Populations

Overview of Two-Sample Tests

1 Independent Two-Sample t-Test

Purpose

Hypotheses

Conditions

Test Statistic (Unpooled - Welch's t-test)

Test Statistic (Pooled Variance)

When to Use Which?

Confidence Interval for μ₁ - μ₂ (Unpooled)

2 Paired t-Test

Purpose

Key Concept

When to Use

Hypotheses

Process

Test Statistic

Conditions

Confidence Interval for μd

Advantages of Paired Design

3 Two-Proportion z-Test

Purpose

Hypotheses

Conditions

Pooled Proportion (for Hypothesis Testing)

Test Statistic

Confidence Interval for p₁ - p₂ (NO pooling!)

Important Note

Decision Flowchart: Choosing the Right Test

Start Here ↓

Common Mistakes to Avoid

Quick Reference Table

Tips for Success

Key Formulas Summary

Independent Two-Sample t-Test (Unpooled)

Pooled Variance

Independent Two-Sample t-Test (Pooled)

Paired t-Test

Pooled Proportion

Two-Proportion z-Test