Welcome to your first week in the exciting world of Data Analytics! Whether you're a fresh graduate, a working professional looking to switch careers, or someone simply curious about how companies make sense of their data, this guide is designed just for you.
Over the next seven days, you'll build a rock-solid foundation in data analytics concepts and gain practical skills in Excel that employers are actively looking for. By the end of this week, you'll be able to analyze real business data and create your first analytics dashboard.
Let's begin this transformative journey!
What You'll Learn This Week
- Day 1: Introduction to Data Analytics
- Day 2: Understanding Data Types & Structures
- Day 3: Business Questions & Problem Framing
- Day 4: KPIs & Metrics Design
- Day 5: Excel Fundamentals for Analytics
- Day 6: Advanced Excel & Data Analysis
- Day 7: Week 1 Project & Assessment
- Self-Assessment Quiz
- About the Instructor
Introduction to Data Analytics
Understanding the What, Why, and How of Data Analytics
Watch Video Lesson
What is Data Analytics?
Think of data analytics as being a detective, but instead of solving crimes, you're solving business problems. Just like a detective collects evidence, examines clues, and connects the dots to solve a case, a data analyst collects data, examines patterns, and connects insights to solve business challenges.
Real-World Example: How You Already Use Data Analytics
Believe it or not, you already practice data analytics in your daily life:
- Choosing a restaurant: You check ratings, read reviews, compare prices, and look at photos before deciding
- Online shopping: You compare products, check prices across websites, read customer reviews
- Planning a trip: You analyze weather data, compare flight prices over different dates, read hotel reviews
Professional data analytics follows the same logic, just at a much larger scale with more sophisticated tools.
The Four Types of Analytics
Data analytics can be categorized into four types, each building upon the previous one and offering increasingly valuable insights:
| Type | Question It Answers | Example | Complexity |
|---|---|---|---|
| Descriptive | "What happened?" | Sales dropped 15% last month | Basic |
| Diagnostic | "Why did it happen?" | Sales dropped because competitor launched a discount campaign | Intermediate |
| Predictive | "What will happen?" | Based on trends, sales will likely increase 10% next quarter | Advanced |
| Prescriptive | "What should we do?" | Launch a loyalty program to retain customers and increase sales by 12% | Expert |
Example: Netflix and Analytics
Descriptive: 80% of users watched at least one show this week
Diagnostic: Viewership increased because a popular new series was released
Predictive: Based on viewing patterns, this user will likely enjoy sci-fi movies
Prescriptive: Show "Stranger Things" in the top row for users who watched similar sci-fi content
The Data Analytics Lifecycle
Every analytics project follows a structured process. Understanding this lifecycle is crucial for success:
- Define the Problem: Clearly understand what business question you're trying to answer. This is the most critical step - a well-defined problem is half-solved.
- Collect Data: Gather relevant data from various sources - databases, spreadsheets, APIs, surveys, etc.
- Clean & Prepare Data: Remove errors, handle missing values, format consistently. This typically takes 60-80% of an analyst's time!
- Analyze Data: Apply statistical methods, create visualizations, identify patterns and trends.
- Interpret Results: Draw meaningful conclusions from your analysis. What story does the data tell?
- Communicate Findings: Present insights to stakeholders through reports, dashboards, or presentations.
- Take Action: Implement recommendations and monitor outcomes.
Role of a Data Analyst in Organizations
As a data analyst, you'll wear many hats. Here's what a typical data analyst does:
- Data Collection & Cleaning: Gathering data from multiple sources and ensuring it's accurate and usable
- Data Analysis: Using statistical tools and software to analyze trends and patterns
- Reporting & Visualization: Creating dashboards and reports that communicate insights clearly
- Business Partnership: Working with different teams to understand their data needs
- Problem Solving: Translating business problems into analytical questions and vice versa
Industry Applications
Data analytics is used across virtually every industry. Here are some examples:
BFSI (Banking, Financial Services, Insurance)
- Fraud detection and prevention
- Credit risk assessment
- Customer segmentation for targeted marketing
- Stock market trend analysis
Healthcare
- Patient outcome prediction
- Disease outbreak tracking
- Hospital resource optimization
- Drug effectiveness analysis
Retail & E-commerce
- Customer behavior analysis
- Inventory management
- Price optimization
- Recommendation systems
IT & Technology
- User experience optimization
- System performance monitoring
- A/B testing for product features
- Cybersecurity threat detection
Career Paths in Data Analytics
The data analytics field offers multiple career progression paths:
- Data Analyst (Entry Level): Analyze data, create reports, build dashboards
- Senior Data Analyst: Lead complex projects, mentor juniors, work with stakeholders
- Business Intelligence Analyst: Focus on strategic reporting and data visualization
- Data Scientist: Build predictive models, work with machine learning
- Analytics Manager: Lead analytics teams, define strategy
- Chief Data Officer: Executive-level data strategy leadership
Day 1 Key Takeaways
- Data analytics is about turning raw data into actionable insights
- There are four types: Descriptive, Diagnostic, Predictive, and Prescriptive
- Every project follows the analytics lifecycle from problem definition to action
- Data analysts are in high demand across all industries
- Multiple career paths exist from entry-level analyst to C-suite executive
Understanding Data Types & Structures
The Building Blocks of Data Analysis
Watch Video Lesson
Before you can analyze data effectively, you need to understand what types of data exist and how they're structured. This knowledge is fundamental because the type of data you're working with determines which analysis methods you can use.
Data Structure: Organized vs. Unorganized
Structured Data
Structured data is highly organized and fits neatly into rows and columns, like an Excel spreadsheet or a database table.
Example of Structured Data
| Customer ID | Name | Age | Purchase Amount | Date |
|---|---|---|---|---|
| 1001 | Rahul Kumar | 28 | 5,499 | 2024-01-15 |
| 1002 | Priya Sharma | 34 | 12,999 | 2024-01-16 |
| 1003 | Amit Patel | 45 | 3,250 | 2024-01-16 |
Easy to search, sort, and analyze. This is what you'll work with most often!
Unstructured Data
Unstructured data doesn't have a predefined format. It includes:
- Text documents and emails
- Social media posts
- Images and videos
- Audio recordings
- Web pages
Unstructured data requires special techniques to analyze and often needs to be converted to structured format first.
Semi-structured Data
Semi-structured data falls between the two. It has some organizational properties but doesn't fit into traditional tables. Examples include:
- JSON files
- XML documents
- Email (has fields like To, From, Subject, but body is unstructured)
Data Types: The Different Flavors of Data
Understanding data types is crucial because it determines how you can analyze and visualize information.
Numerical Data (Quantitative)
Numerical data represents quantities and can be measured. It comes in two forms:
Discrete Data
Countable values that cannot be broken down into smaller parts. Think of things you can count:
- Number of customers (you can't have 2.5 customers)
- Number of products sold
- Number of website visits
- Number of employees
Continuous Data
Measurable values that can take any value within a range, including decimals:
- Height (175.5 cm)
- Weight (68.3 kg)
- Temperature (36.6 degrees)
- Time (2.5 hours)
- Revenue (Rs. 1,45,678.50)
Categorical Data (Qualitative)
Categorical data represents characteristics or qualities that can be grouped into categories.
Nominal Data
Categories with no inherent order or ranking:
- Gender: Male, Female, Other
- Blood Type: A, B, AB, O
- City: Mumbai, Delhi, Bangalore
- Product Category: Electronics, Clothing, Books
You can count them but can't rank them - is "Mumbai" greater than "Delhi"? No!
Ordinal Data
Categories with a meaningful order, but intervals between them aren't equal:
- Education: High School < Bachelor's < Master's < PhD
- Customer Satisfaction: Poor < Average < Good < Excellent
- T-shirt Size: S < M < L < XL
- Movie Rating: 1 star < 2 stars < 3 stars < 4 stars < 5 stars
You can rank them, but the "distance" between categories isn't necessarily equal.
| Data Type | Can Count? | Can Order? | Can Measure? | Example |
|---|---|---|---|---|
| Nominal | Yes | No | No | Colors, Cities |
| Ordinal | Yes | Yes | No | Ratings, Rankings |
| Discrete | Yes | Yes | Yes (whole numbers) | Count of items |
| Continuous | Yes | Yes | Yes (any value) | Height, Weight |
Date/Time Data
Date and time data is special because it can be treated as both categorical and numerical depending on your analysis needs.
- As Categorical: Group sales by month (January, February, etc.) or day of week (Monday, Tuesday)
- As Numerical: Calculate time differences, trends over time
Common date/time formats you'll encounter:
- DD/MM/YYYY (15/01/2024) - Common in India and Europe
- MM/DD/YYYY (01/15/2024) - Common in USA
- YYYY-MM-DD (2024-01-15) - ISO standard, best for sorting
- DD-Mon-YYYY (15-Jan-2024)
Pro Tip: Always Use Consistent Date Formats
Inconsistent date formats are one of the most common data quality issues. When working with dates, always ensure consistency across your entire dataset.
Data Quality Dimensions
"Garbage in, garbage out" - the quality of your analysis is only as good as the quality of your data. Here are the key dimensions of data quality:
1. Accuracy
Is the data correct? Does it reflect reality?
Example: A customer's age is recorded as 150 years - clearly inaccurate!
2. Completeness
Is all required data present? Are there missing values?
Example: 20% of customer records are missing email addresses - incomplete data!
3. Consistency
Is the same information recorded the same way everywhere?
Example: One record says "Bangalore" while another says "Bengaluru" for the same city - inconsistent!
4. Timeliness
Is the data up-to-date? Is it available when needed?
5. Validity
Does the data conform to defined rules and formats?
Example: Phone number field contains "abc123" - invalid format!
Common Data Formats
As a data analyst, you'll work with various file formats. Here are the most common ones:
| Format | Full Name | Best For | Pros | Cons |
|---|---|---|---|---|
| CSV | Comma-Separated Values | Simple tabular data | Universal, lightweight | No formatting, limited data types |
| Excel (.xlsx) | Excel Workbook | Business data, reports | Formatting, formulas, multiple sheets | Larger file size, proprietary |
| JSON | JavaScript Object Notation | Web data, APIs | Flexible structure, widely used | Can be complex to read |
| XML | Extensible Markup Language | Configuration, data exchange | Self-describing, widely supported | Verbose, larger files |
Day 2 Key Takeaways
- Data can be structured (tables), unstructured (text, images), or semi-structured (JSON, XML)
- Numerical data can be discrete (countable) or continuous (measurable)
- Categorical data can be nominal (no order) or ordinal (ordered)
- Data quality has five dimensions: Accuracy, Completeness, Consistency, Timeliness, Validity
- CSV and Excel are the most common formats you'll encounter
Business Questions & Problem Framing
The Art of Asking the Right Questions
Watch Video Lesson
The most critical skill a data analyst can develop isn't technical - it's the ability to ask the right questions. A perfectly executed analysis that answers the wrong question is worthless. Today, we'll learn how to frame business problems into questions that data can answer.
Why Asking the Right Questions Matters
Consider this scenario:
The Wrong Approach
Manager says: "Our sales are down. Can you look at the data and tell me what's happening?"
Analyst thinks: "Let me create every possible chart and report about sales and show them all."
Result: Weeks of work, hundreds of charts, no clear insight, frustrated manager.
The Right Approach
Manager says: "Our sales are down. Can you look at the data?"
Analyst asks: "When you say 'down,' compared to what? Last month? Last year? A target? Which products or regions are you most concerned about?"
Result: Focused analysis, clear insights, actionable recommendations.
The SMART Framework for Business Questions
Good analytical questions follow the SMART framework:
- S - Specific: Clearly defined, not vague
- M - Measurable: Can be answered with data
- A - Actionable: The answer leads to a decision or action
- R - Relevant: Aligned with business goals
- T - Time-bound: Has a clear time frame
| Vague Question | SMART Question |
|---|---|
| "Are customers happy?" | "What percentage of customers rated their experience 4 or 5 stars in Q4 2024?" |
| "Is marketing working?" | "Which marketing channel generated the highest ROI in the last 6 months?" |
| "Why are sales bad?" | "Which product categories showed a decline of more than 10% compared to the same quarter last year?" |
| "How can we grow?" | "Which customer segments have the highest lifetime value and lowest acquisition cost?" |
Translating Business Problems to Analytical Questions
Here's a step-by-step process to translate business problems into analytical questions:
- Listen carefully: Understand what the stakeholder is really asking
- Clarify ambiguity: Ask follow-up questions to remove vagueness
- Identify the metric: What exactly needs to be measured?
- Define the comparison: Compared to what?
- Set the time frame: What period are we analyzing?
- Consider segmentation: Should we break this down by category, region, customer type?
Example: E-commerce Sales Decline
Business Problem: "Our online sales have dropped"
Clarifying Questions:
- By how much? (20% decline)
- Compared to when? (Same month last year)
- All products or specific categories? (Electronics mainly)
- All regions? (North India specifically)
Refined Analytical Question: "Why did electronics sales in North India decline by 20% in November 2024 compared to November 2023, and which specific factors contributed to this decline?"
Stakeholder Requirements Gathering
Before starting any analysis, you need to understand what your stakeholders actually need. Key questions to ask:
- What decision will this analysis inform? This helps you focus on actionable insights.
- Who will use this analysis? Executive summary for leadership? Detailed report for the team?
- When do you need it? Urgent decisions need quick analysis; strategic planning allows deeper exploration.
- What format do you prefer? Dashboard? Report? Presentation?
- What data is available? No point planning an analysis if the required data doesn't exist.
Hypothesis Formation
A hypothesis is an educated guess about what you expect to find in the data. Forming hypotheses before diving into analysis helps you:
- Stay focused on what matters
- Avoid getting lost in the data
- Recognize unexpected findings
Hypothesis Structure
"I believe [X] because [reason], and I can verify this by [measurement]."
Example: "I believe sales dropped in North India because a major competitor opened stores there last quarter. I can verify this by comparing sales trends before and after the competitor's entry and analyzing lost customers' zip codes."
Day 3 Key Takeaways
- Asking the right question is more important than technical analysis skills
- Use the SMART framework: Specific, Measurable, Actionable, Relevant, Time-bound
- Always clarify vague requests before starting analysis
- Understand stakeholder needs: What decision? Who? When? What format?
- Form hypotheses to guide your analysis and stay focused
KPIs & Metrics Design
Measuring What Matters
Watch Video Lesson
Today we'll learn about Key Performance Indicators (KPIs) and metrics - the numbers that tell you whether a business is succeeding or failing. Understanding KPIs is essential because they form the foundation of any business dashboard or report you'll create.
What's the Difference: Metrics vs. KPIs?
Metric: Any quantifiable measure that tracks some aspect of business performance.
KPI (Key Performance Indicator): A metric that is critical to the success of the business. All KPIs are metrics, but not all metrics are KPIs.
Example: E-commerce Store
Metrics (many things you can measure):
- Number of page views
- Time spent on site
- Number of products viewed
- Cart additions
- Bounce rate
KPIs (the critical few):
- Revenue
- Conversion Rate
- Customer Acquisition Cost
Leading vs. Lagging Indicators
Understanding this distinction is crucial for effective business management:
| Aspect | Leading Indicators | Lagging Indicators |
|---|---|---|
| Definition | Predict future performance | Measure past performance |
| When to use | To make proactive decisions | To evaluate results |
| Example (Sales) | Number of sales calls made | Monthly revenue |
| Example (Health) | Hours of exercise per week | Weight/BMI |
| Advantage | Can take corrective action early | Accurate, factual |
Industry-Specific KPIs
Different industries and functions track different KPIs. Here are the most important ones:
Sales KPIs
- Revenue: Total money earned from sales
- Conversion Rate: (Customers / Visitors) × 100
- Average Order Value (AOV): Revenue / Number of Orders
- Customer Acquisition Cost (CAC): Marketing Spend / New Customers
- Customer Lifetime Value (LTV): Average purchase value × Purchase frequency × Customer lifespan
A healthy ratio is 3:1 or higher (you earn 3x what you spend to acquire a customer)
Marketing KPIs
- Click-Through Rate (CTR): (Clicks / Impressions) × 100
- Cost Per Click (CPC): Ad Spend / Number of Clicks
- Return on Ad Spend (ROAS): Revenue from Ads / Ad Spend
- Engagement Rate: (Likes + Comments + Shares) / Followers × 100
Operations KPIs
- Efficiency: Output / Input × 100
- Utilization Rate: Actual Hours Worked / Available Hours × 100
- Defect Rate: Defective Units / Total Units × 100
- On-Time Delivery Rate: On-Time Deliveries / Total Deliveries × 100
Finance KPIs
- Return on Investment (ROI): (Gain - Cost) / Cost × 100
- Profit Margin: Net Profit / Revenue × 100
- Cash Flow: Cash In - Cash Out
- Accounts Receivable Turnover: Net Sales / Average Accounts Receivable
Calculating ROI - A Simple Example
You invest Rs. 10,000 in a marketing campaign and generate Rs. 25,000 in sales.
For every rupee spent, you earned Rs. 1.50 in return. That's a great ROI!
Setting KPI Targets and Benchmarks
A KPI without a target is just a number. Here's how to set meaningful targets:
- Historical Benchmarking: Compare to your own past performance
- Industry Benchmarking: Compare to industry standards
- Competitor Benchmarking: Compare to competitors (if data is available)
- Goal-Based: Work backward from business objectives
KPI Dashboard Best Practices
When building KPI dashboards, follow these principles:
- Less is more: Focus on 5-7 KPIs maximum. More is confusing.
- Show trends: Include historical data to show direction
- Use visual cues: Red/yellow/green to indicate status
- Enable drill-down: Allow users to explore underlying data
- Update regularly: Stale data is worse than no data
Day 4 Key Takeaways
- KPIs are the vital few metrics that indicate business success
- Leading indicators predict future; lagging indicators measure past
- Different functions track different KPIs (Sales, Marketing, Ops, Finance)
- Always set targets using benchmarks (historical, industry, competitor, goal-based)
- Effective dashboards focus on 5-7 KPIs with trends and visual cues
Excel Fundamentals for Analytics
Your First Analytics Tool
Watch Video Lesson
Excel is where most data analysts start their careers. Despite advanced tools like Python and R, Excel remains essential because it's universally available, easy to learn, and perfect for quick analysis. Today we'll master the fundamentals.
Practice Along With Sample Data
Download these beginner-friendly datasets to practice the Excel functions below:
- Supermarket Sales - Great for practicing SUM, AVERAGE, COUNT, and IF functions
- Customer Dataset - Perfect for text functions and data cleaning practice
- Mall Customers - Simple dataset for statistical functions
Essential Excel Functions for Data Analysis
Basic Aggregation Functions
Adds all numbers in a range
Example: =SUM(B2:B100) adds all values from B2 to B100
Calculates the arithmetic mean
Example: =AVERAGE(C2:C50) finds the average of C2 to C50
Counts cells containing numbers
Example: =COUNT(A2:A100) counts how many numeric values
Counts non-empty cells (numbers AND text)
Example: =COUNTA(A2:A100) counts all non-blank cells
Counts empty cells - useful for finding missing data!
Example: =COUNTBLANK(D2:D100)
Statistical Functions
Find the smallest and largest values
Example: =MIN(B2:B100) finds the minimum value
Finds the middle value (better than average for skewed data)
Example: =MEDIAN(C2:C100)
Finds the most frequently occurring value
Example: =MODE.SNGL(D2:D100)
Logical Functions
Makes decisions based on conditions
Example: =IF(B2>1000, "High Value", "Low Value")
Returns TRUE only if ALL conditions are true
Example: =AND(B2>100, B2<500) - TRUE if B2 is between 100 and 500
Returns TRUE if ANY condition is true
Example: =OR(A2="Red", A2="Blue") - TRUE if A2 is Red or Blue
Combining IF with AND/OR
Classify customers based on purchase amount and frequency:
This returns "Premium" if purchase amount (B2) is over 10,000 AND frequency (C2) is over 5.
Text Functions
Extracts characters from the left
Example: =LEFT("ABCDEF", 3) returns "ABC"
Extracts characters from the right
Example: =RIGHT("ABCDEF", 2) returns "EF"
Extracts characters from the middle
Example: =MID("ABCDEF", 2, 3) returns "BCD"
Joins text strings together
Example: =A2 & " " & B2 combines first and last name with a space
Cell References: The Foundation of Excel
Understanding cell references is crucial for building formulas that work correctly when copied.
| Type | Syntax | Behavior When Copied | Use Case |
|---|---|---|---|
| Relative | A1 | Changes based on new position | Most calculations |
| Absolute | $A$1 | Never changes | Referencing fixed values (tax rate, exchange rate) |
| Mixed (Row) | A$1 | Column changes, row stays fixed | Looking up from a fixed row |
| Mixed (Column) | $A1 | Row changes, column stays fixed | Looking up from a fixed column |
Pro Tip: F4 Shortcut
Press F4 while editing a cell reference to cycle through reference types: A1 → $A$1 → A$1 → $A1 → A1
Named Ranges and Tables
Instead of referring to ranges like B2:B100, you can give them meaningful names:
- Select the range
- Click in the Name Box (left of formula bar)
- Type a name (e.g., "SalesData")
- Now use =SUM(SalesData) instead of =SUM(B2:B100)
This makes formulas more readable and easier to maintain.
Data Validation
Data validation prevents bad data from entering your spreadsheet:
- Dropdown lists: Limit choices to predefined options
- Number ranges: Only allow numbers between X and Y
- Date ranges: Only allow dates in a specific range
- Text length: Limit the number of characters
Access via: Data Tab → Data Validation
Try It Yourself - Day 5 Exercises
Download the Week 1 Sales Dataset and complete these exercises:
- Exercise 1: Calculate the total revenue (Total_Amount column) using SUM
- Exercise 2: Find the average order value using AVERAGE
- Exercise 3: Count how many orders are in the dataset using COUNTA
- Exercise 4: Use IF to create a new column "Order_Size" - "Large" if Total_Amount > 5000, otherwise "Small"
- Exercise 5: Extract the month from the Date column using MID or TEXT function
- Exercise 6: Find the MIN and MAX values in the Total_Amount column
Expected Answers: Total Revenue: ~Rs. 11,00,000+ | Average Order: ~Rs. 3,100 | Total Orders: 352
Day 5 Key Takeaways
- Master SUM, AVERAGE, COUNT, COUNTA, COUNTBLANK for basic analysis
- Use IF, AND, OR for creating conditional logic
- Text functions (LEFT, RIGHT, MID) help clean and extract data
- Understand relative ($A$1), absolute (A1), and mixed references ($A1, A$1)
- Named ranges make formulas readable; data validation prevents errors
Advanced Excel & Data Analysis
Lookup Functions, Conditional Aggregations, and Pivot Tables
Watch Video Lesson
Today we'll learn the functions that separate beginner Excel users from intermediate analysts. These skills are mentioned in almost every data analyst job description!
Practice Datasets for Advanced Excel
These datasets are ideal for practicing VLOOKUP, INDEX-MATCH, and Pivot Tables:
- Superstore Dataset - Multi-sheet data perfect for VLOOKUP practice (Orders, Returns, People tables)
- Sample Sales Data - 2,800+ rows ideal for Pivot Tables and SUMIFS
- Brazilian E-commerce (Olist) - Multiple related tables for advanced INDEX-MATCH practice
Lookup Functions: Finding Data Across Tables
VLOOKUP (Vertical Lookup)
VLOOKUP searches for a value in the first column of a range and returns a value from another column.
lookup_value: The value you're searching for
table_array: The table to search in
col_index_num: Which column to return (1, 2, 3...)
range_lookup: FALSE for exact match (use this 99% of the time)
VLOOKUP Example
You have a Product ID and want to find the product name from a product list.
This looks up the value in A2 in the Products sheet, and returns the value from column 2 (product name).
INDEX-MATCH: The Better Alternative
INDEX-MATCH is more flexible than VLOOKUP because:
- It can look left (VLOOKUP can only look right)
- It doesn't break when you insert columns
- It's faster on large datasets
MATCH finds the position of the lookup value
INDEX returns the value at that position from the return range
INDEX-MATCH Example
Same lookup as above, but using INDEX-MATCH:
MATCH finds A2 in column A. INDEX returns the corresponding value from column B.
XLOOKUP (Excel 365)
If you have Excel 365, XLOOKUP combines the best of both worlds:
Example: =XLOOKUP(A2, Products!A:A, Products!B:B, "Not Found")
Conditional Aggregation Functions
These functions aggregate data based on conditions - essential for any analyst!
SUMIF and SUMIFS
Sum values based on one condition
Example: =SUMIF(A:A, "Electronics", B:B)
Sums all values in column B where column A is "Electronics"
Sum values based on multiple conditions
Example: =SUMIFS(C:C, A:A, "Electronics", B:B, "North")
Sums column C where A is "Electronics" AND B is "North"
COUNTIF and COUNTIFS
Count cells based on one condition
Example: =COUNTIF(A:A, ">1000")
Counts how many values in column A are greater than 1000
Count cells based on multiple conditions
Example: =COUNTIFS(A:A, "Mumbai", B:B, ">=10000")
Counts rows where city is Mumbai AND amount is >= 10000
AVERAGEIF and AVERAGEIFS
Same logic as SUMIF/SUMIFS, but calculates average instead of sum.
Average of column B where column A is "Premium"
Pivot Tables: The Most Powerful Excel Feature
Pivot Tables are arguably the most important feature for data analysts. They allow you to summarize, analyze, explore, and present large datasets in seconds.
Creating a Pivot Table
- Select your data (including headers)
- Insert → PivotTable
- Choose where to place it (new worksheet recommended)
- Drag fields to the appropriate areas:
- Rows: Categories to group by (e.g., Product, Region)
- Columns: Secondary grouping (e.g., Month, Year)
- Values: Numbers to summarize (e.g., Sum of Sales)
- Filters: Fields to filter the entire table
Pivot Table Example
You have sales data with columns: Date, Region, Product, Salesperson, Amount
To see total sales by Region and Product:
- Rows: Region
- Columns: Product
- Values: Sum of Amount
In seconds, you get a cross-tabulation showing sales for every Region-Product combination!
Pivot Table Features
- Grouping: Group dates by month/quarter/year, or group numbers into ranges
- Calculated Fields: Create new calculations within the pivot table
- Slicers: Visual filters that make your pivot table interactive
- Pivot Charts: Create charts directly from pivot tables
Sorting, Filtering & Conditional Formatting
Sorting
Select your data → Data tab → Sort. You can sort by multiple columns (e.g., first by Region, then by Sales within each region).
Filtering
Select your data → Data tab → Filter. Dropdown arrows appear in headers. Click to filter by specific values, text, or conditions.
Conditional Formatting
Automatically format cells based on their values. Common uses:
- Data Bars: Show relative values with bars inside cells
- Color Scales: Gradient from low to high (red → green)
- Icon Sets: Arrows, traffic lights, stars based on values
- Highlight Rules: Color cells greater than X, top 10, duplicates, etc.
Try It Yourself - Day 6 Exercises
Continue using the Week 1 Sales Dataset for these advanced exercises:
- Exercise 1 (SUMIF): Calculate total sales for "Electronics" category only
- Exercise 2 (SUMIFS): Calculate total sales for "North" region AND "Electronics" category
- Exercise 3 (COUNTIF): Count how many orders are from "Returning" customers
- Exercise 4 (AVERAGEIF): Find the average order value for "New" customers only
- Exercise 5 (Pivot Table): Create a pivot table showing Total Sales by Region (rows) and Product Category (columns)
- Exercise 6 (Pivot Table): Create a pivot table showing Monthly Sales Trend (group Date by Month)
- Exercise 7: Apply conditional formatting to highlight orders above Rs. 5,000
Hints: Electronics total: ~Rs. 2,80,000 | North Electronics: ~Rs. 70,000+ | Returning customers: ~175 orders
Day 6 Key Takeaways
- VLOOKUP is useful but INDEX-MATCH is more flexible
- SUMIF/COUNTIF/AVERAGEIF aggregate data based on conditions
- Add "S" (SUMIFS, COUNTIFS) for multiple conditions
- Pivot Tables are the most powerful Excel feature for data analysis
- Conditional formatting makes data patterns visible at a glance
Week 1 Project & Assessment
Putting It All Together
Watch Video Lesson
Congratulations on reaching the final day of Week 1! Today, you'll apply everything you've learned to create your first complete analytics project: a Sales Performance Dashboard.
Project: Sales Performance Analysis Dashboard
Project Brief
You've been hired as a data analyst for a retail company. The Sales Director has asked you to analyze the last 6 months of sales data and create a dashboard that answers key business questions.
Dataset Description
You'll work with a sales dataset containing:
- Date: Transaction date
- Order ID: Unique identifier for each order
- Region: North, South, East, West
- Product Category: Electronics, Clothing, Home & Kitchen, Sports
- Product Name: Specific product
- Quantity: Units sold
- Unit Price: Price per unit
- Total Amount: Quantity × Unit Price
- Customer Type: New, Returning
EDUSHARK Project Dataset - Ready to Use!
We've prepared a custom dataset specifically designed for this Week 1 project. It contains 352 sales transactions over 6 months with all the required columns.
Download Week 1 Sales Dataset (CSV)
352 rows | 9 columns | Jan-Jun 2024 | Perfect for Excel analysis
Additional Practice Datasets
Want more practice? Try these free datasets from Kaggle and other sources:
Recommended Datasets for Week 1 Project:
- Superstore Sales Dataset - Perfect for sales dashboard projects with Region, Category, Sales, and Profit data
- Sample Sales Data - Comprehensive retail sales data with order dates, quantities, and customer details
- E-commerce Data (UK Retailer) - Real transaction data from a UK online store
Beginner Practice Datasets:
- Groceries Dataset - Simple dataset for practicing basic Excel functions
- Retail Orders Dataset - Clean data perfect for VLOOKUP and Pivot Table practice
- India Consumer Price Index - Government data for practicing with Indian context
Additional Resources:
- Kaggle Datasets - Thousands of free datasets (search for "sales", "retail", or "business")
- Data.gov.in - Indian Government Open Data Portal
- Maven Analytics Data Playground - Curated datasets for practice projects
- GitHub Datasets - Community-maintained datasets collection
Tip: Download multiple datasets and practice the same analysis techniques on different data to strengthen your skills!
Project Requirements
Part 1: Data Cleaning & Preparation
- Check for and handle missing values
- Verify data types are correct
- Check for duplicate records
- Add calculated columns if needed (e.g., Month, Week)
Part 2: Analysis Questions to Answer
- Overall Performance:
- What was the total revenue for the 6-month period?
- How many orders were placed?
- What was the average order value?
- Regional Analysis:
- Which region generated the most revenue?
- How does order volume vary by region?
- Product Analysis:
- Which product category is the bestseller?
- What are the top 5 products by revenue?
- Time Trends:
- How did monthly sales change over time?
- Are there any seasonal patterns?
- Customer Analysis:
- What percentage of revenue comes from returning customers?
Part 3: KPIs to Calculate
Create a KPI summary section with:
- Total Revenue
- Total Orders
- Average Order Value (AOV)
- Revenue Growth (Month over Month)
- Returning Customer Rate
Part 4: Dashboard Creation
Build a summary dashboard that includes:
- KPI boxes at the top
- Pivot Table showing sales by Region and Category
- Pivot Chart showing monthly sales trend
- Top 5 products list
Part 5: Recommendations
Based on your analysis, write 3 actionable recommendations for the Sales Director. For example:
- "Focus marketing efforts on [Region] as it shows the highest growth potential..."
- "Consider expanding the [Category] product line as it has the highest average order value..."
Assessment Criteria
| Component | Weight | What's Evaluated |
|---|---|---|
| Project Submission | 50% | Correctness, completeness, presentation quality |
| Quiz: Week 1 Concepts | 30% | Understanding of analytics concepts, data types, KPIs, Excel functions |
| Peer Review | 20% | Quality of feedback given to peers |
Tips for Success
- Start with questions: Always refer back to the business questions you're trying to answer
- Clean first: Don't start analysis until your data is clean
- Document your work: Add comments explaining your formulas and logic
- Format for readability: Use consistent formatting, clear labels, and appropriate number formats
- Tell a story: Arrange your dashboard so it tells a logical story from overview to details
Week 1 Complete!
You've completed an intensive week covering:
- What data analytics is and how it's used in organizations
- Different types of data and quality dimensions
- How to frame business problems as analytical questions
- Understanding KPIs and metrics across industries
- Essential Excel functions for data analysis
- Advanced Excel including VLOOKUP, INDEX-MATCH, and Pivot Tables
You've built your first analytics dashboard!
Next week, you'll dive into SQL and databases - the foundation of working with large-scale data!
Week 1 Self-Assessment Quiz
Test Your Understanding
Answer these questions to check your understanding of Week 1 concepts. Try to answer without looking back!
Question 1: Types of Analytics
A company wants to predict how many units of a product will sell next quarter. Which type of analytics is this?
- Descriptive Analytics
- Diagnostic Analytics
- Predictive Analytics
- Prescriptive Analytics
Show Answer
C. Predictive Analytics - Predictive analytics answers "What will happen?" by forecasting future outcomes based on historical data.
Question 2: Data Types
Customer satisfaction ratings (Poor, Average, Good, Excellent) are an example of which data type?
- Nominal Data
- Ordinal Data
- Discrete Data
- Continuous Data
Show Answer
B. Ordinal Data - Ordinal data has a meaningful order (Poor < Average < Good < Excellent), but the intervals between categories aren't necessarily equal.
Question 3: SMART Framework
Which of these is a SMART business question?
- "How can we increase sales?"
- "Are customers happy with our service?"
- "Which product category showed more than 15% growth in Q4 2024 compared to Q4 2023?"
- "Why is marketing not working?"
Show Answer
C. - This question is Specific (product category, growth rate), Measurable (15%), Actionable, Relevant, and Time-bound (Q4 2024 vs Q4 2023).
Question 4: KPIs
A "leading indicator" is best described as:
- A metric that measures past performance
- A metric that predicts future performance
- The most important KPI in a dashboard
- A metric used only by senior leadership
Show Answer
B. A metric that predicts future performance - Leading indicators (like sales calls made) help predict future outcomes, while lagging indicators (like revenue) measure past results.
Question 5: Excel Functions
Which formula correctly calculates the total sales only for the "North" region?
- =SUM(B:B, "North")
- =SUMIF(A:A, "North", B:B)
- =COUNTIF(A:A, "North", B:B)
- =TOTAL(A:A="North", B:B)
Show Answer
B. =SUMIF(A:A, "North", B:B) - SUMIF sums values in B:B where the corresponding cell in A:A equals "North".
Question 6: VLOOKUP vs INDEX-MATCH
What is a key advantage of INDEX-MATCH over VLOOKUP?
- INDEX-MATCH is easier to write
- INDEX-MATCH can look up values to the left of the lookup column
- INDEX-MATCH works faster on small datasets
- INDEX-MATCH doesn't require exact matches
Show Answer
B. INDEX-MATCH can look up values to the left of the lookup column - VLOOKUP can only return values from columns to the right of the lookup column, while INDEX-MATCH has no such limitation.
Question 7: Cell References
In Excel, what does $A$1 represent?
- A relative reference that changes when copied
- An absolute reference that stays fixed when copied
- A named range
- A formula error
Show Answer
B. An absolute reference that stays fixed when copied - The $ signs lock both the column (A) and row (1), so the reference won't change when the formula is copied.
Question 8: Data Quality
One record shows "Bangalore" and another shows "Bengaluru" for the same city. This is a problem with which data quality dimension?
- Accuracy
- Completeness
- Consistency
- Timeliness
Show Answer
C. Consistency - Consistency means the same information should be recorded the same way everywhere. Both spellings refer to the same city but are inconsistent.
Quiz Complete!
How did you do? If you got 6 or more correct, you have a solid understanding of Week 1 concepts!
- 8/8: Excellent! You're ready for Week 2
- 6-7/8: Good understanding - review the topics you missed
- 4-5/8: Fair - revisit Days 1-4 concepts
- Below 4: Consider re-reading the material before moving on
About the Instructor
Pawan Rama Mali
Data Analytics professional and educator with extensive experience in business intelligence, data visualization, and teaching complex analytical concepts to beginners. Passionate about making data analytics accessible to everyone and helping students build practical, job-ready skills.
Specializations:
- Data Analytics & Business Intelligence
- Excel, SQL, Python for Data Analysis
- R Programming & R Shiny Dashboards
- Statistical Analysis & Data Visualization