Descriptive Statistics Quick Reference
Module 2: Measures of Center, Spread, Shape & Boxplots
Measures of Center
Sum of all values ÷ number of values
If even n: average of two middle values
| Use... | When... |
|---|---|
| Mean | Symmetric, no outliers |
| Median | Skewed or has outliers |
| Mode | Categorical data |
Measures of Spread
Spread of middle 50% of data
Same units as data; typical distance from mean
| Resistant to Outliers? | Yes | No |
|---|---|---|
| Center | Median, Mode | Mean |
| Spread | IQR | Range, SD |
Empirical Rule (68-95-99.7)
For BELL-SHAPED distributions only!
Example: Mean = 100, SD = 15
• 68% between 85–115
• 95% between 70–130
• 99.7% between 55–145
Five-Number Summary
1. Min • 2. Q1 (25th percentile) • 3. Median (Q2, 50th) • 4. Q3 (75th) • 5. Max
- Order data, find median (Q2)
- Q1 = median of lower half
- Q3 = median of upper half
Distribution Shapes
Symmetric
Mean ≈ Median ≈ Mode
Mirror image sides
Ex: Heights, test scores
Right-Skewed (Positive)
Mode < Median < Mean
Long tail to right
Ex: Income, home prices
Left-Skewed (Negative)
Mean < Median < Mode
Long tail to left
Ex: Age at death
Outlier Detection (1.5×IQR Rule)
Any value < Lower Fence OR > Upper Fence = OUTLIER
Example: Q1 = 20, Q3 = 40, IQR = 20
Lower: 20 − 30 = −10
Upper: 40 + 30 = 70
Outliers: < −10 or > 70
Boxplot Components
- Box: Q1 to Q3 (middle 50%)
- Line in box: Median (Q2)
- Whiskers: Extend to min/max within 1.5×IQR
- Dots: Outliers beyond whiskers
| Shape | Boxplot Appearance |
|---|---|
| Symmetric | Median centered, equal whiskers |
| Right-skewed | Median left, longer right whisker |
| Left-skewed | Median right, longer left whisker |
Comparing Groups
Side-by-side boxplots compare:
- Center: Which median is higher?
- Spread: Which has wider box/whiskers?
- Shape: Symmetric vs. skewed?
- Outliers: Which has unusual values?
Decision Tree: Choosing Statistics
Use MEAN & SD when:
- Distribution is symmetric
- No extreme outliers
- Data is bell-shaped
Use MEDIAN & IQR when:
- Distribution is skewed
- Outliers present
- Data like income/prices
Key Reminders
- Variance in squared units; SD in original units
- IQR = middle 50% of data
- Empirical Rule for bell-shaped ONLY
- Mean sensitive; median resistant
- Skew direction = where tail points
- Q1 at 25%, Q2 at 50%, Q3 at 75%
- Boxplot shows 5-number summary
- 1.5×IQR rule detects outliers