Session 6 — Correlation vs. Causation Grade 5 Data Science · Ages 10–11 ← → or Space to navigate · F = fullscreen
1 / 15
Session 6 of 8

Correlation vs.
Causation

Two things can happen together without one causing the other. Today we learn to tell the difference — and why it matters enormously.

🔍 Data Science for Young Minds · Grade 5 — Data Detective
2 / 15
Today's Plan

What We're Doing Today

3 / 15
Opening Hook

Does Ice Cream Cause Drowning?

Every summer, two things increase at exactly the same time:

Does eating ice cream cause people to drown? Vote: Yes / No / Complicated

Both of these statements are true. The connection between them is also real. But does one CAUSE the other? And if not — why do they move together?

4 / 15
Lesson 1

Correlation — Moving Together

Two variables are correlated when they tend to change together — both go up, both go down, or one goes up as the other goes down.

Positive Correlation

  • Both variables increase together
  • Height and shoe size
  • Study hours and test scores
  • Ice cream sales and drowning rates

Negative Correlation

  • One goes up as the other goes down
  • Hours of TV watched and grades
  • Exercise and resting heart rate
  • Smoking and lung capacity

Correlation is a description of a pattern. It does NOT tell you why the pattern exists.

5 / 15
Lesson 2

Causation — Actually Causing It

Causation means one variable directly produces a change in another. Removing or changing the cause changes the effect.

✅ True Causation

  • Smoking → lung cancer (proven by controlled trials)
  • Sunlight → plant growth (proven by experiments)
  • Exercise → stronger muscles
  • Eating food → reduced hunger

❌ Correlation, NOT Causation

  • Ice cream → drowning (confounded by heat)
  • Shoe size → reading level (confounded by age)
  • Nicolas Cage movies → pool drownings
  • TV ownership → longer life expectancy

To prove causation, you need a controlled experiment — change only one variable and keep everything else the same.

6 / 15
Lesson 2

Confounding Variables — The Hidden Cause

A confounding variable is a hidden third factor that causes both correlated variables to change — making them look connected when they aren't.

Ice cream sales ↑   ←   HOT WEATHER   →   Drowning rates ↑

Hot weather causes BOTH — ice cream and drowning are not connected to each other

Detective question: "Is there a third factor that could explain why these two things move together?"

7 / 15
Lesson 3

4 Spurious Correlations

😂 Funny Example 1

Nicolas Cage movies released per year correlates with swimming pool drownings.
Real connection: none — pure coincidence with small samples.

😂 Funny Example 2

Per-capita cheese consumption correlates with deaths by bedsheet tangling.
Real connection: none — both happen to have similar patterns in the same data years.

🔬 Serious Example 3

Countries with more TVs per capita have higher life expectancy.
Real connection: national wealth drives both better healthcare AND more TV ownership.

🔬 Serious Example 4

Children with bigger shoe sizes read at a higher level.
Real connection: age — older children have bigger feet AND have had more reading instruction.

8 / 15
Activity Time!

Investigate the Spurious Correlations

For each of the 4 correlations, answer these questions on your worksheet:

⏱ You have 18 minutes. Then we apply the same thinking to more serious real-world examples.

9 / 15
Debrief

What Did You Find?

"What was the confounding variable in each case? How did identifying it change your understanding of the data?"

10 / 15
Lesson 4

Real Stakes — Why This Matters

Wrong conclusions from correlations lead to real policy decisions that can harm people. This is why the correlation/causation distinction is one of the most important ideas in data science.

11 / 15
Lesson 4

The C-E-R Framework

When you argue from data, use Claim → Evidence → Reasoning.

C

CLAIM — What you think is true

"Ice cream sales do not cause drowning."

E

EVIDENCE — Data that supports it

"Both ice cream sales and drowning rates peak in July, when temperatures are highest."

R

REASONING — Why evidence supports the claim

"Hot weather causes both increased swimming activity and more ice cream eating. Temperature is the confounding variable — not ice cream."

12 / 15
🧠
Brain Break — Correlation or Causation?

Thumbs UP for Causation, thumbs DOWN for Correlation only!

"Smoking and lung cancer" · "Shoe size and reading in kids" · "Exercise and lower blood pressure" · "Countries with chocolate consumption and Nobel Prizes" · "Vaccines and reduced disease rates"

For every correlation — ask: "What could be the confounding variable?"

13 / 15
Write Time

Your C-E-R Argument

"Choose one of the serious correlations from today. Write a full C-E-R argument explaining why it is a correlation and NOT causation. Name the confounding variable."

✍️ 6 minutes. Use your worksheet — Part 4. Your argument must have all three parts: Claim, Evidence, and Reasoning with a named confounding variable.

14 / 15
Vocabulary Check

Session 6 Key Terms

Correlation
Two variables that tend to change together — does not imply causation
Causation
One variable directly produces a change in another
Confounding Variable
A hidden third factor that causes both correlated variables to change
Spurious Correlation
A correlation with no real causal relationship — coincidence or confounding
Claim
A clear statement about what you believe is true, based on evidence
C-E-R
Claim → Evidence → Reasoning: the framework for data-based arguments
15 / 15
Session Close

The Correlation Detective Rule

"Correlation tells you WHAT. Only a controlled experiment tells you WHY.
Always ask: what's the confounding variable?"

Next session: Data Ethics — privacy, consent, and the genuinely difficult questions about who data belongs to and who benefits from collecting it.