Module 5 Study Guide

Sampling Distributions

Comprehensive Reference with All Formulas

1. Parameters vs. Statistics

Parameter: A numerical summary of a population. Fixed but usually unknown.
Statistic: A numerical summary of a sample. Calculated from data, varies from sample to sample.

Notation

Measure Population Parameter Sample Statistic
Mean μ (mu) x̄ (x-bar)
Standard Deviation σ (sigma) s
Proportion p p̂ (p-hat)
Variance σ²
Remember: Greek letters (μ, σ, p) = POPULATION parameters
Roman letters (x̄, s, p̂) = SAMPLE statistics

2. Sampling Distributions

Sampling Distribution: The probability distribution of a sample statistic (like x̄ or p̂) based on all possible samples of size n from the population.

Important Distinction

Sampling Variability: Different random samples produce different statistics. This is normal and expected!

3. The Central Limit Theorem (CLT)

Central Limit Theorem: For a population with mean μ and standard deviation σ, the sampling distribution of x̄ from samples of size n will be approximately normal with: This approximation improves as n increases, and is generally good for n ≥ 30.

CLT Conditions

  1. Random sampling: Samples randomly selected from population
  2. Independence: Individual observations independent (10n rule for sampling without replacement)
  3. Sample size: Either n ≥ 30 OR population is already normal
Why CLT is Amazing: Regardless of population shape (skewed, bimodal, uniform), the distribution of sample means will be approximately normal if n is large enough!

Properties of Sampling Distribution of x̄

Shape: Approximately normal (when n ≥ 30 or population is normal)

x̄ ~ N(μ, σ/√n)

Center:

μₓ̄ = μ

Mean of sampling distribution equals population mean

Spread (Standard Error):

σₓ̄ = σ / √n

Standard error decreases as sample size increases

z-score for Sample Mean:

z = (x̄ - μ) / (σ/√n)

4. Standard Error

Standard Error (SE): The standard deviation of a sampling distribution. Measures the typical distance between a sample statistic and the population parameter.

Standard Error vs. Standard Deviation

Aspect Standard Deviation (σ) Standard Error (SE)
What it measures Variability of individuals Variability of sample means
Affected by n? No (population SD is fixed) Yes (SE = σ/√n)
Interpretation How spread out the data is Precision of estimates
Sample Size Effect: To cut SE in half, you must multiply n by 4 (because of √n)

Standard Error Formulas

For Sample Means:

SE = σ / √n

For Sample Proportions:

SE = √(p(1-p) / n)

5. Sampling Distribution of Proportions

Sample Proportion: p̂ = x/n, where x = number of "successes" and n = sample size

Properties of Sampling Distribution of p̂

Center:

μₚ̂ = p

Spread:

σₚ̂ = √(p(1-p) / n)

Shape: Approximately normal when success-failure condition is met

Success-Failure Condition

For normal approximation to be appropriate:
np ≥ 10 AND n(1-p) ≥ 10

z-score for Sample Proportion:

z = (p̂ - p) / √(p(1-p)/n)

SE for proportions is largest when p = 0.5 (maximum variability)

6. Step-by-Step Procedures

Finding Probability about Sample Mean

  1. Check CLT conditions
  2. Find sampling distribution
  3. Calculate z-score
  4. Find probability

Finding Probability about Sample Proportion

  1. Check success-failure condition
  2. Find sampling distribution
  3. Calculate z-score
  4. Find probability

7. Example Problems

Example 1: CLT Application

Problem: A population has μ = 100, σ = 20. For n = 64, find P(x̄ > 105).

Solution:

Answer: 2.28% chance

Example 2: Sample Proportions

Problem: If p = 0.40 and n = 200, find P(p̂ < 0.35).

Solution:

Answer: About 7.35% chance

Example 3: Required Sample Size

Problem: To achieve SE ≤ 2 when σ = 20, what n is needed?

Solution:

Answer: Need at least n = 100

8. Common Mistakes to Avoid

  1. Confusing σ and SE
  2. Forgetting to check CLT conditions
  3. Using wrong formula for SE
  4. Not squaring when solving for n
  5. Thinking larger samples change μₓ̄

9. Quick Decision Guide

Which Distribution Should I Use?

If you're finding probability about... Use...
One individual value (x) Population distribution with σ
Sample mean (x̄) Sampling distribution with SE = σ/√n
Sample proportion (p̂) Sampling distribution with SE = √(p(1-p)/n)

When Can I Use Normal Distribution?

For... Conditions
Individual values Population must be normal
Sample means n ≥ 30 OR population normal
Sample proportions np ≥ 10 AND n(1-p) ≥ 10

10. Formula Sheet

All Key Formulas

Sample Proportion:

p̂ = x / n

Mean of Sampling Distribution:

μₓ̄ = μ

μₚ̂ = p

Standard Error:

SE(x̄) = σ / √n

SE(p̂) = √(p(1-p) / n)

z-scores:

z = (x̄ - μ) / (σ/√n)

z = (p̂ - p) / √(p(1-p)/n)

Sample Size for Desired SE:

n = (σ / SE)²

(For sample means)

11. Study Tips

Ready to Test Your Knowledge?

Use this study guide to prepare, then take the Module 5 quiz!

Take the Quiz Quick Reference Card

Module 5: Sampling Distributions | Safaa Dabagh | Free Statistics Course