Learn Without Walls
← Back to Module 2

Module 2 Practice Problems

Practice Makes Perfect!

Apply what you've learned with 20 comprehensive practice problems covering all Module 2 topics. Each problem includes a hint and detailed solution.

Part 1: Measures of Center (Problems 1-5)

1 Calculate the Mean Easy

Question: Find the mean of the following dataset:

Ages: 18, 22, 25, 28, 32, 35

Hint: Mean = (Sum of all values) ÷ (Number of values)

Solution:

Step 1: Add all values
18 + 22 + 25 + 28 + 32 + 35 = 160

Step 2: Count how many values
There are 6 ages

Step 3: Divide sum by count
Mean = 160 ÷ 6 = 26.67 years

Answer: 26.67 years

2 Find the Median (Even Dataset) Easy

Question: Find the median of the following test scores:

Scores: 68, 72, 75, 80, 85, 90

Hint: With an even number of values, the median is the average of the two middle values.

Solution:

Step 1: Data is already ordered: 68, 72, 75, 80, 85, 90

Step 2: Count values: 6 (even number)

Step 3: Find the two middle values
With 6 values, the 3rd and 4th values are in the middle: 75 and 80

Step 4: Average them
Median = (75 + 80) ÷ 2 = 77.5

Answer: 77.5

3 Identify the Mode Easy

Question: Find the mode of the following dataset:

Shoe sizes: 7, 8, 8, 9, 9, 9, 10, 10, 11

Hint: The mode is the value that appears most frequently.

Solution:

Count frequencies:

  • 7 appears 1 time
  • 8 appears 2 times
  • 9 appears 3 times ← Most frequent!
  • 10 appears 2 times
  • 11 appears 1 time

Answer: Mode = 9 (appears most frequently)

4 Mean vs. Median with Outlier Medium

Question: Calculate both mean and median for this dataset, then explain which better represents a "typical" value:

Salaries ($1000s): 45, 48, 50, 52, 55, 250

Hint: Look for outliers! Which measure is resistant to outliers?

Solution:

Mean:
Sum = 45 + 48 + 50 + 52 + 55 + 250 = 500
Mean = 500 ÷ 6 = $83.33k

Median:
Ordered: 45, 48, 50, 52, 55, 250
6 values (even), so average the 3rd and 4th values
Median = (50 + 52) ÷ 2 = $51k

Which is better?
Median ($51k) better represents a typical salary. Five out of six employees earn $45-55k. The $250k outlier (possibly CEO or owner) pulls the mean up to $83.33k, which doesn't represent most employees.

Answer: Mean = $83.33k, Median = $51k; Median is more representative

5 Choosing the Appropriate Measure Medium

Question: For each scenario, state which measure of center (mean, median, or mode) would be most appropriate and why:

a) Average height of all students in a class

b) Typical home price in a neighborhood with some mansions

c) Most popular pizza topping from a survey

Hint: Consider data type and presence of outliers!

Solution:

a) Mean for average height
Why: Heights are typically symmetric with no extreme outliers. Mean uses all data and is appropriate.

b) Median for home prices
Why: Home prices are right-skewed with outliers (mansions). Median is resistant to these high outliers and better represents a "typical" home.

c) Mode for pizza toppings
Why: This is categorical data (pepperoni, mushroom, etc.). You can't calculate mean or median of text! Mode shows the most popular choice.

Part 2: Measures of Spread (Problems 6-10)

6 Calculate Range and IQR Easy

Question: Find the range and IQR for this dataset:

Quiz scores: 6, 7, 8, 9, 10, 11, 12, 14, 15

Hint: Range = Max - Min; IQR = Q3 - Q1

Solution:

Range:
Max = 15, Min = 6
Range = 15 - 6 = 9

IQR:
Step 1: Find median (Q2) = 10 (5th value of 9)
Step 2: Lower half: 6, 7, 8, 9 → Q1 = (7+8)/2 = 7.5
Step 3: Upper half: 11, 12, 14, 15 → Q3 = (12+14)/2 = 13
Step 4: IQR = Q3 - Q1 = 13 - 7.5 = 5.5

Answer: Range = 9, IQR = 5.5

7 Calculate Variance Hard

Question: Calculate the variance for this dataset:

Data: 2, 4, 6, 8, 10

Hint: s² = Σ(x - x̄)² / (n - 1). First find the mean!

Solution:

Step 1: Calculate mean
x̄ = (2+4+6+8+10) / 5 = 30/5 = 6

Step 2: Create deviation table

xx - x̄(x - x̄)²
22-6 = -416
44-6 = -24
66-6 = 00
88-6 = 24
1010-6 = 416

Step 3: Sum squared deviations
Σ(x - x̄)² = 16 + 4 + 0 + 4 + 16 = 40

Step 4: Divide by (n-1)
s² = 40 / (5-1) = 40/4 = 10

Answer: Variance = 10

8 Calculate Standard Deviation Medium

Question: If a dataset has variance = 25, what is the standard deviation? Interpret what this means.

Hint: SD = √(variance)

Solution:

Calculation:
Standard Deviation = √(Variance) = √25 = 5

Interpretation:
The standard deviation of 5 means that values in the dataset typically vary by about 5 units from the mean. This is in the original units (not squared like variance), making it more interpretable.

Answer: SD = 5

9 Apply the Empirical Rule Medium

Question: Heights are bell-shaped with mean = 65 inches and SD = 3 inches. Using the Empirical Rule:

a) What percentage of people have heights between 62 and 68 inches?

b) What percentage have heights between 59 and 71 inches?

Hint: 68% within ±1 SD, 95% within ±2 SD, 99.7% within ±3 SD

Solution:

a) Between 62 and 68 inches:
62 = 65 - 3 (mean - 1 SD)
68 = 65 + 3 (mean + 1 SD)
This is within ±1 SD → ~68% of people

b) Between 59 and 71 inches:
59 = 65 - 6 (mean - 2 SD)
71 = 65 + 6 (mean + 2 SD)
This is within ±2 SD → ~95% of people

Answers: a) 68%, b) 95%

10 Comparing Variability Medium

Question: Two classes took the same test:

Class A: Mean = 75, SD = 12

Class B: Mean = 75, SD = 4

Which class has more consistent scores? Explain your reasoning.

Hint: What does standard deviation measure?

Solution:

Class B is more consistent.

Reasoning:
Both classes have the same mean (75), but Class B has a much smaller standard deviation (4 vs. 12). A smaller SD means scores are clustered more tightly around the mean.

What this means:

  • Class B (SD=4): Most scores are within 71-79 (very consistent performance)
  • Class A (SD=12): Scores vary widely, from ~63 to ~87 (inconsistent performance)

Answer: Class B is more consistent (smaller SD = less variability)

Part 3: Distribution Shapes (Problems 11-15)

11 Identify Distribution Shape Easy

Question: A dataset has Mean = 50, Median = 55, Mode = 60. What is the shape of this distribution?

Hint: Compare the order: Mean < Median < Mode suggests which tail is longer?

Solution:

Analysis:
Mean (50) < Median (55) < Mode (60)

Shape: Left-skewed

Explanation:
When Mean < Median < Mode, the distribution has a long tail extending to the left (low values). The mean is pulled down by low outliers, while the mode (most common value) is on the right side.

Answer: Left-skewed distribution

12 Right-Skewed Real-World Example Medium

Question: Household incomes have Mean = $85,000 and Median = $58,000.

a) What is the shape of this distribution?

b) Which measure better represents a "typical" household income?

c) Explain why income distributions have this shape.

Hint: When Mean > Median, which direction is the tail?

Solution:

a) Right-skewed
Mean ($85k) > Median ($58k) indicates a right-skewed distribution.

b) Median ($58,000) is more representative
The median better represents a typical household because it's resistant to high-income outliers (millionaires, billionaires) that pull the mean up.

c) Why income is right-skewed:

  • Most people earn moderate incomes ($40k-$80k)
  • A small percentage earn very high incomes ($500k+)
  • Income has a natural floor ($0) but no ceiling
  • This creates a long right tail of high earners

This is why news reports use "median household income" rather than mean!

13 Symmetric Distribution Properties Easy

Question: A bell-shaped distribution has Mean = 100.

What can you conclude about the median and mode?

Hint: In symmetric distributions, what's the relationship between mean, median, and mode?

Solution:

Conclusion:
In a bell-shaped (symmetric) distribution:
Mean ≈ Median ≈ Mode

Therefore:
Median ≈ 100
Mode ≈ 100

Reasoning:
Bell-shaped distributions are symmetric, meaning they're balanced on both sides of the center. All three measures of center converge at the same point.

Answer: Median and Mode both ≈ 100

14 Outlier Impact Analysis Hard

Question: A dataset of commute times (minutes) is:

Original: 15, 18, 20, 22, 25, 28, 30

One person had an unusual 120-minute commute added to the data.

a) Calculate mean and median for BOTH datasets (with and without outlier)

b) Which measure changed more dramatically?

Hint: Which measure is resistant to outliers?

Solution:

Without outlier (Original):
Mean = (15+18+20+22+25+28+30)/7 = 158/7 = 22.57 minutes
Median = 22 (middle of 7 values)

With outlier (120 added):
Mean = (15+18+20+22+25+28+30+120)/8 = 278/8 = 34.75 minutes
Median = (22+25)/2 = 23.5 minutes

Changes:

  • Mean: 22.57 → 34.75 (+12.18 minutes, 54% increase!)
  • Median: 22 → 23.5 (+1.5 minutes, 7% increase)

Answer: The mean changed much more dramatically (54% vs 7%). This demonstrates that the median is resistant to outliers while the mean is sensitive to outliers.

15 Real-World Distribution Matching Medium

Question: Match each real-world scenario to its most likely distribution shape:

Scenarios:

A) Ages of people in a kindergarten class
B) Prices of homes sold in your city last year
C) Heights of adult men

Shapes: Symmetric, Right-skewed, Left-skewed

Hint: Think about whether there are outliers and which direction they'd be in!

Solution:

A) Kindergarten ages: Left-skewed
Most kids are 5-6 years old, with a few younger kids who started early creating a left tail.

B) Home prices: Right-skewed
Most homes are in an affordable range ($200k-$500k), but a few expensive mansions ($2M+) create a long right tail.

C) Heights of adult men: Symmetric
Heights follow a bell curve with most men near average (5'9"), fewer very short or very tall, creating a symmetric distribution.

Part 4: Five-Number Summary & Boxplots (Problems 16-20)

16 Calculate Five-Number Summary Medium

Question: Find the five-number summary for this dataset:

Data: 12, 15, 18, 20, 22, 25, 28, 30, 35

Hint: Five numbers: Min, Q1, Median, Q3, Max

Solution:

Data (already ordered): 12, 15, 18, 20, 22, 25, 28, 30, 35

Step 1: Min and Max
Min = 12, Max = 35

Step 2: Median (Q2)
9 values, so median is the 5th value = 22

Step 3: Q1 (median of lower half)
Lower half: 12, 15, 18, 20
Q1 = (15 + 18)/2 = 16.5

Step 4: Q3 (median of upper half)
Upper half: 25, 28, 30, 35
Q3 = (28 + 30)/2 = 29

Five-Number Summary:
Min = 12, Q1 = 16.5, Median = 22, Q3 = 29, Max = 35

17 Identify Outliers (1.5×IQR Rule) Hard

Question: Using the five-number summary from Problem 16, identify any outliers using the 1.5×IQR rule.

Hint: Lower fence = Q1 - 1.5×IQR; Upper fence = Q3 + 1.5×IQR

Solution:

From Problem 16: Q1 = 16.5, Q3 = 29

Step 1: Calculate IQR
IQR = Q3 - Q1 = 29 - 16.5 = 12.5

Step 2: Calculate fences
Lower Fence = Q1 - (1.5 × IQR) = 16.5 - (1.5 × 12.5) = 16.5 - 18.75 = -2.25
Upper Fence = Q3 + (1.5 × IQR) = 29 + (1.5 × 12.5) = 29 + 18.75 = 47.75

Step 3: Check for outliers
Data: 12, 15, 18, 20, 22, 25, 28, 30, 35
Is any value < -2.25? No
Is any value > 47.75? No

Answer: No outliers (all values fall within the fences)

18 Interpret a Boxplot Medium

Question: A boxplot shows:

Min = 20, Q1 = 40, Median = 45, Q3 = 80, Max = 120

a) What is the IQR?

b) What percentage of data falls between 40 and 80?

c) Describe the shape of this distribution.

Hint: The box contains the middle 50%. Where is the median within the box?

Solution:

a) IQR:
IQR = Q3 - Q1 = 80 - 40 = 40

b) Percentage between 40 and 80:
Q1 (40) is the 25th percentile, Q3 (80) is the 75th percentile.
75% - 25% = 50% of data falls between Q1 and Q3

c) Shape analysis:

  • Median (45) is closer to Q1 (40) than to Q3 (80)
  • Distance Q1 to Median = 5
  • Distance Median to Q3 = 35 (much larger!)
  • Upper whisker (80 to 120) is longer than lower whisker (20 to 40)

Shape: Right-skewed (median near Q1, longer upper whisker)

Answers: a) IQR = 40, b) 50%, c) Right-skewed

19 Compare Two Groups with Boxplots Hard

Question: Compare test scores for two classes:

Class A: Min=60, Q1=70, Median=80, Q3=88, Max=95

Class B: Min=50, Q1=65, Median=75, Q3=85, Max=100

a) Which class has a higher typical score?

b) Which class has more consistent scores?

c) Which class has more variability overall?

Hint: Compare medians, IQRs, and ranges!

Solution:

a) Higher typical score: Class A
Class A median = 80 > Class B median = 75
The median represents the typical score, so Class A performed better.

b) More consistent scores: Class A
Class A IQR = 88 - 70 = 18
Class B IQR = 85 - 65 = 20
Smaller IQR = more consistent (middle 50% more tightly clustered)

c) More variability overall: Class B
Class A Range = 95 - 60 = 35
Class B Range = 100 - 50 = 50
Class B has a wider range (more spread from lowest to highest)

Summary: Class A performed better on average with more consistent scores, while Class B had more variability (both higher highs and lower lows).

20 Boxplot Creation Challenge Hard

Question: You're given this dataset:

Data: 5, 8, 12, 15, 18, 22, 25, 30, 35, 40, 95

a) Calculate the five-number summary

b) Are there any outliers? (Use 1.5×IQR rule)

c) Describe what the boxplot would look like (where would the median be in the box? Length of whiskers?)

Hint: Work through each step methodically!

Solution:

a) Five-Number Summary:

Min = 5, Max = 95
Median (6th of 11 values) = 22
Q1 (median of 5, 8, 12, 15, 18) = 12
Q3 (median of 25, 30, 35, 40, 95) = 35

Five-number summary: 5, 12, 22, 35, 95

b) Outlier Detection:
IQR = 35 - 12 = 23
Lower Fence = 12 - (1.5 × 23) = 12 - 34.5 = -22.5
Upper Fence = 35 + (1.5 × 23) = 35 + 34.5 = 69.5

Is 95 > 69.5? YES! 95 is an outlier

c) Boxplot Description:

  • Box: From Q1 (12) to Q3 (35)
  • Median line: At 22, slightly right of center in the box
  • Lower whisker: From 12 down to 5 (length = 7)
  • Upper whisker: From 35 to 40 (NOT to 95! Stops at last non-outlier)
  • Outlier point: A dot at 95, beyond the upper whisker
  • Shape: Appears slightly right-skewed due to the outlier

Answers: a) (5, 12, 22, 35, 95), b) Yes, 95 is an outlier, c) Box from 12-35 with median at 22, upper whisker to 40, outlier dot at 95

Practice Complete!

Great work! You've practiced all the key concepts from Module 2. Ready to test your mastery?