Capstone + Portfolio + Career Prep
No new concepts. Just you, real data, and the tools you have spent weeks learning. Build your proof.
~25 minutesWhat you need: Google Sheets, Tableau Public (or Google Colab), and a free GitHub account at github.com. Create your GitHub account now if you have not already — it takes 2 minutes and is completely free.
What you’ll do: This module is different from every other one. There is no concept to read and then apply. You are going to use everything you have learned to build something real — your first portfolio project. Step by step.
There is no wrong way to do this. There is only doing it.
A portfolio is proof.
Anyone can list “SQL” on a resume. Not everyone can show a cleaned dataset, a documented analysis, and a live visualization. Your portfolio is what converts “trust me” into “here is evidence.”
GitHub is where data professionals store and share their work. It is free, widely recognized by employers, and searchable. Your GitHub profile is essentially a second resume — one that shows the work, not just the claims.
Your capstone project has four parts:
- Pick a free public dataset about something you genuinely care about
- Clean it and document what you found and fixed
- Answer one specific question using your analysis
- Publish the cleaned data, your analysis, and one visualization on GitHub
One question. One visualization. One clear answer. That is all a first project needs. Quality over quantity.
Entry-level data analyst roles are competitive. Most applicants only have coursework to show. A portfolio with one real project — even a simple one — sets you apart from the majority of candidates who do not have one.
When an interviewer asks “Tell me about a time you worked with data,” you will say “I cleaned a dataset on [topic], asked a specific question, built a dashboard, and published it on GitHub. Here’s the link.” That changes everything.
Follow each step. Take your time. This module is worth lingering on.
- What the dataset is and where it came from
- What question you asked
- What you found
- A link to your Tableau Public dashboard (or Colab notebook)
- What data quality issues you found and fixed
💼 Your portfolio now contains:
- A GitHub repository with a public URL
- A cleaned CSV dataset
- A documented analysis (README)
- A published visualization (Tableau Public URL or Colab link)
- One answered data question
You just built a portfolio piece. That is real. That is yours. Nobody can take that from you. Stand up. Walk to a different room and back. Let yourself feel the weight of what you actually just did.
The ONE thing to remember from this module:
🏁 Phase 4 Complete
You have a real portfolio now. Phase 5 is SAP FI/CO — the enterprise finance system that most data analysts never learn. It is the rare skill that opens Fortune 500 doors. You are about to have it.