Python for Data — pandas in Google Colab
Write Python in your browser. Zero installation. Real data analysis in minutes.
~22 minutesWhat you need: Open colab.research.google.com in a new tab. Sign in with your Google account. No installation required. Python runs entirely in your browser.
What you’ll do: Create a new notebook, import pandas, build a DataFrame (a table of data), filter rows, group and sum by category, and get instant statistics. These four operations cover 80% of daily analyst work in Python.
Python is the most popular programming language for data analysis. pandas is the Python library that makes working with data easy — think of it as a supercharged spreadsheet inside Python.
Why Python instead of just Sheets?
- Scale: Google Sheets struggles with 100,000 rows. pandas handles 10 million rows without breaking a sweat.
- Speed: Operations that take minutes in Sheets take milliseconds in pandas.
- Reproducibility: Your code is a recipe. Run the same analysis again, instantly, on new data. No clicking. No errors.
- Power: Merge multiple datasets, reshape data, calculate complex formulas — all in a few lines.
SQL → Python comparison (you already know SQL from Phase 1 — this will feel familiar):
SQL
WHERE sales > 400Group:
GROUP BY regionSort:
ORDER BY sales DESC
Python/pandas
df[df['Sales'] > 400]Group:
df.groupby('Region')Sort:
df.sort_values('Sales', ascending=False)
Different syntax. Same logic. Your SQL brain will help you learn pandas faster than you think.
Python + pandas appears on nearly every data analyst and data scientist job posting. Even basic Python knowledge puts you ahead of 60% of applicants who only know spreadsheets.
Google Colab means you can practice Python from any computer with a browser — no setup, no installation, no IT department. You can show your work in a Colab notebook that anyone can view online. That is another portfolio piece.
Switch to your Colab notebook. Type each block of code into a new cell and run it with Shift+Enter. Do not copy-paste — typing it yourself builds the skill.
'Name': ['Alice', 'Bob', 'Carol', 'David'],
'Sales': [500, 300, 750, 200],
'Region': ['North', 'South', 'North', 'East']
}
df = pd.DataFrame(data)
print(df)
WHERE Sales > 400 in SQL.
SELECT Name FROM table. Same concept, Python syntax.
You just wrote Python. Real Python. Hands that have never written a line of Python just wrote six working programs. Take your hands off the keyboard. Shake them out gently. Breathe.
Take at least 2 full minutes. Your brain is processing new syntax patterns. It needs this.
The ONE thing to remember from this module:
What comes next: Module 8 introduces Power BI — Microsoft’s data visualization tool. If you know Tableau (Phase 2), Power BI will feel familiar. Knowing both makes you more versatile.