Learn Without Walls
← Module 07: Data Cleaning 🏠 Course Home Course Complete 🏆
Module 08 of 08  |  Excel & Google Sheets for Data

🏆 Capstone Project: Monthly Sales Report

This is your final project. You'll take a realistic 50-row dataset — messy, like real data always is — and walk through the complete analysis workflow: import, inspect, clean, calculate, analyze, visualize, and write your findings.

⏰ ~60–90 minutes 🎓 All Levels 📋 Requires: All Modules 01–07
📌 Before You Start
What this project covers

This capstone simulates a real task you might get at work: "Here's the monthly sales data. Analyze it and tell me what happened." You'll use every skill from this course:

Create a new Google Sheets file called "Monthly Sales Report — Capstone". You'll work on multiple sheets within it.

📋 The Dataset — Monthly Sales Report (50 rows)
Enter this data into Sheet1 (rename it "Raw Data")

The intentionally messy entries are highlighted. Spot them as you type — you'll fix them in Step 2.

A — DateB — SalespersonC — RegionD — ProductE — UnitsF — PriceG — RevenueH — Returns
2024-01-05Alice ChenNorthLaptop260012000
2024-01-07alice chenNorthWebcam5793950
2024-01-09Bob PatelSouthKeyboard105050050
2024-01-12BOB PATELSouthMouse Pad2091800
2024-01-15Carol RiveraEastMonitor34501350450
2024-01-18Carol RiveraEastUSB Hub8161280
2024-01-20David KimWestLaptop16006000
2024-01-22David Kim WestHeadphones41204800
2024-01-25Alice ChenNorthKeyboard7503500
2024-01-28Bob PatelSouthMonitor2450900
2024-02-02Carol RiveraEastLaptop46002400600
2024-02-05Alice ChenNorthWebcam10797900
2024-02-08Bob PatelSouthKeyboard15507500
2024-02-10David KimWestHeadphones6120720120
2024-02-13Carol RiveraEastMonitor24509000
2024-02-16 Alice ChenNorthUSB Hub12161920
2024-02-18Bob PatelSouthMouse Pad3092700
2024-02-20David KimWestLaptop36001800600
2024-02-23Alice ChenNorthKeyboard8504000
2024-02-25Carol RiveraEastWebcam5793950
2024-03-01Bob PatelSouthHeadphones71208400
2024-03-03David KimWestUSB Hub15162400
2024-03-06Alice ChenNorthMonitor2450900450
2024-03-09Carol RiveraEastKeyboard205010000
2024-03-12carol riveraEastLaptop260012000
2024-03-15Bob PatelSouthWebcam8796320
2024-03-18David KimWestMouse Pad2592250
2024-03-20Alice ChenNorthHeadphones41204800
2024-03-23Bob PatelSouthUSB Hub1816288
2024-03-26Carol RiveraEastMonitor14504500
2024-04-02David KimWestLaptop260012000
2024-04-04Alice ChenNorthKeyboard12506000
2024-04-07Bob PatelSouthHeadphones5120600120
2024-04-10Carol RiveraEastWebcam8796320
2024-04-13David KimWestUSB Hub20163200
2024-04-16ALICE CHENNorthMonitor34501350450
2024-04-18Bob PatelSouthLaptop460024000
2024-04-21Carol RiveraEastMouse Pad3593150
2024-04-24David KimWestKeyboard10505000
2024-04-27Alice ChenNorthWebcam6794740
2024-05-01Bob PatelSouthMonitor24509000
2024-05-04Carol RiveraEastHeadphones61207200
2024-05-07David KimWestLaptop1600600600
2024-05-10Alice ChenNorthUSB Hub15162400
2024-05-13Bob PatelsouthKeyboard12506000
2024-05-16Carol RiveraEastMouse Pad4093600
2024-05-19David KimWestWebcam7795530
2024-05-22Alice ChenNorthMonitor14504500
2024-05-25Bob PatelSouthHeadphones81209600
2024-05-28Carol RiveraEastLaptop36001800600

Issues embedded in the data (highlighted red): inconsistent capitalization in Salesperson and Region columns, leading spaces, and two blank Returns cells.

🔧 Phase 1
Step 1: Inspect the Data Start here

Before touching anything, understand what you have.

  1. Freeze row 1 (View → Freeze → 1 row). Scroll down to row 50 to see if your data is all there.
  2. Check column widths — auto-fit all columns so you can read the data.
  3. Use =COUNTA(B2:B51) to count how many Salesperson entries you have. Should be 50.
  4. Use =COUNTA(H2:H51) to count Returns entries. If it's less than 50, you have blank cells.
  5. Scroll through the Salesperson column. Notice the capitalization inconsistencies (alice chen, BOB PATEL, etc.).
  6. Scroll through Region — notice "south" and " West" (with a leading space).
  7. Write your observations in a comment or a separate notes area on the sheet.
🔧 Phase 2
Step 2: Clean the Data

Duplicate the "Raw Data" sheet. Rename the copy "Cleaned Data". Work on the copy.

  1. Fix Salesperson (column B): Add a temporary column I. In I2, write =PROPER(TRIM(B2)). Copy down to I51. Then copy I2:I51, and Paste Special → Values into B2:B51. Delete column I.
  2. Fix Region (column C): Repeat the same process with =PROPER(TRIM(C2)). This fixes "south" → "South" and " West" → "West".
  3. Fix blank Returns (column H): In a helper column, use =IF(ISBLANK(H2), 0, H2). Paste values back. Delete helper column.
  4. Verify with COUNTIF: Use =COUNTIF(B:B, "Alice Chen") — you should now get a consistent count for each salesperson name.
  5. Check COUNTA(H2:H51) — should now be 50 (no blanks).
🔧 Phase 3
Step 3: Add Calculated Columns

On the "Cleaned Data" sheet:

  1. Add column I: Net Revenue. In I1, type "Net Revenue". In I2, type =G2-H2. Copy down to I51.
  2. Add column J: Month. In J1, type "Month". In J2, type =TEXT(A2, "MMMM"). Copy down. This extracts the month name from the date.
  3. Format column I as currency (Format → Number → Currency). Format column J as plain text.
  4. Your data now has 10 columns: Date, Salesperson, Region, Product, Units, Price, Revenue, Returns, Net Revenue, Month.
🔧 Phase 4
Step 4: Analyze with Formulas

Add a new sheet called "Analysis". Create a summary table with these calculations:

Total Revenue: =SUM('Cleaned Data'!G2:G51) Total Net Revenue: =SUM('Cleaned Data'!I2:I51) Total Returns: =SUM('Cleaned Data'!H2:H51) Avg Revenue/Sale: =AVERAGE('Cleaned Data'!G2:G51) Total Transactions: =COUNTA('Cleaned Data'!A2:A51) -- By Region -- North Revenue: =SUMIF('Cleaned Data'!C:C, "North", 'Cleaned Data'!I:I) South Revenue: =SUMIF('Cleaned Data'!C:C, "South", 'Cleaned Data'!I:I) East Revenue: =SUMIF('Cleaned Data'!C:C, "East", 'Cleaned Data'!I:I) West Revenue: =SUMIF('Cleaned Data'!C:C, "West", 'Cleaned Data'!I:I) -- By Salesperson -- Alice Chen Revenue: =SUMIF('Cleaned Data'!B:B, "Alice Chen", 'Cleaned Data'!I:I) Bob Patel Revenue: =SUMIF('Cleaned Data'!B:B, "Bob Patel", 'Cleaned Data'!I:I) Carol Rivera Revenue:=SUMIF('Cleaned Data'!B:B, "Carol Rivera", 'Cleaned Data'!I:I) David Kim Revenue: =SUMIF('Cleaned Data'!B:B, "David Kim", 'Cleaned Data'!I:I)
🔧 Phase 5
Step 5: Build a Pivot Table
  1. Click inside the "Cleaned Data" sheet.
  2. Insert → Pivot table → New sheet. Rename the sheet "Pivot — By Person & Month".
  3. Add Salesperson to Rows.
  4. Add Month to Columns.
  5. Add Net Revenue to Values (Summarize by: SUM).
  6. You now see each salesperson's net revenue broken down by month. Who had the best January? Who grew the most from January to May?
  7. Add a slicer for Region to filter the pivot by region.
🔧 Phase 6
Step 6: Create Two Charts

Chart 1: Monthly Net Revenue Trend (Line Chart)

  1. On the Analysis sheet, create a small summary table: Month (Jan–May) and total Net Revenue per month (use SUMIF with the Month column).
  2. Select that table → Insert → Chart → Line chart.
  3. Title: "Monthly Net Revenue Trend — Jan to May 2024".
  4. Horizontal axis label: "Month". Vertical axis label: "Net Revenue ($)".

Chart 2: Net Revenue by Region (Bar Chart)

  1. Use your Analysis sheet's "By Region" summary.
  2. Select the Region + Net Revenue cells → Insert → Chart → Bar chart (or Column chart).
  3. Title: "Net Revenue by Region — Jan to May 2024".
  4. Add axis labels. Color the bars green (#43A047).
🖐 Phase 7 — Your Turn
Step 7: Write Your 3 Findings

Add a new sheet called "Findings". In a text area or a series of cells, write 3 specific, data-backed findings from your analysis. Each finding should:

📝 Example findings structure:

Finding 1: [Salesperson X] had the highest total net revenue of $[amount], which is [X]% above the average salesperson. Consider assigning them to the highest-value accounts.

Finding 2: The [Month] had the lowest net revenue at $[amount], which may be worth investigating.

Finding 3: Returns accounted for $[amount] of total revenue ($[gross]), representing [X]% return rate. The [Product] had the most returns.

Hint for return rate: =SUM(Returns)/SUM(Revenue) gives you the return rate as a decimal. Format it as percentage.
Hint for top salesperson: Use your Analysis sheet SUMIF values to compare. Or sort the pivot table by Total column.
💡 Sample Solution Notes (click to expand)

These are approximate expected values based on the dataset above. Your numbers may vary slightly if you entered data differently.

Cleaning

  • After PROPER(TRIM()) on Salesperson: all 4 names should be consistent across 50 rows.
  • Region "south" → "South"; " West" → "West".
  • Two blank Returns cells → filled with 0.
  • COUNTA(B2:B51) = 50, COUNTA(H2:H51) = 50 after cleaning.

Summary Formulas (approximate)

Total Gross Revenue: ~$31,969 Total Returns: ~$3,540 Total Net Revenue: ~$28,429 Average Revenue per Sale: ~$639 Transactions: 50

By Region (Net Revenue, approximate)

North: ~$7,681 South: ~$7,308 East: ~$7,755 West: ~$5,685

By Salesperson (Net Revenue, approximate)

Alice Chen: ~$7,471 Bob Patel: ~$7,538 Carol Rivera: ~$8,315 David Kim: ~$5,105

Sample Findings

Finding 1: Carol Rivera generated the highest net revenue (~$8,315), approximately 16% above the average salesperson revenue. Alice Chen and Bob Patel performed similarly at ~$7,500 each.

Finding 2: David Kim generated the lowest net revenue (~$5,105), which is about 37% below Carol. A review of his sales territory (West region, lowest revenue at ~$5,685) may be warranted.

Finding 3: Returns totaled ~$3,540 on gross revenue of ~$31,969 — an 11% return rate. Laptops and Monitors accounted for the majority of returns due to their high unit price. A product quality or fit-to-need review for those categories is recommended.

🧠 Brain Break
You Just Did What Analysts Do

What you completed in this capstone — inspect, clean, calculate, pivot, chart, and report — is exactly what a business data analyst does daily. The tools may become more powerful (SQL, Python, Tableau, Power BI), but the workflow stays the same.

Spreadsheet skills remain foundational because most data starts in spreadsheets, gets shared as spreadsheets, and gets communicated through spreadsheets. You now have those skills. From here, the natural next step is SQL for larger datasets and Python for automation — both courses are available on this site.

✅ Wrap Up
Course Complete — What You've Learned

🎓 Get Your Certificate

You completed the Excel & Google Sheets for Data course. Generate a free printable certificate to recognize your achievement.

Get Your Certificate →

← Back to Course Home