Week 1: Data Foundations & Excel Analytics

A Complete Beginner's Guide to Starting Your Data Analytics Journey

Duration: 7 Days | Level: Beginner | No Prerequisites Required

Welcome to your first week in the exciting world of Data Analytics! Whether you're a fresh graduate, a working professional looking to switch careers, or someone simply curious about how companies make sense of their data, this guide is designed just for you.

Over the next seven days, you'll build a rock-solid foundation in data analytics concepts and gain practical skills in Excel that employers are actively looking for. By the end of this week, you'll be able to analyze real business data and create your first analytics dashboard.

Let's begin this transformative journey!

Day 1

Introduction to Data Analytics

Understanding the What, Why, and How of Data Analytics

Watch Video Lesson

What is Data Analytics?

Data Analytics is the science of examining raw data to discover patterns, draw conclusions, and make informed decisions. It involves collecting, cleaning, transforming, and modeling data to extract useful information that drives business strategy.

Think of data analytics as being a detective, but instead of solving crimes, you're solving business problems. Just like a detective collects evidence, examines clues, and connects the dots to solve a case, a data analyst collects data, examines patterns, and connects insights to solve business challenges.

Real-World Example: How You Already Use Data Analytics

Believe it or not, you already practice data analytics in your daily life:

  • Choosing a restaurant: You check ratings, read reviews, compare prices, and look at photos before deciding
  • Online shopping: You compare products, check prices across websites, read customer reviews
  • Planning a trip: You analyze weather data, compare flight prices over different dates, read hotel reviews

Professional data analytics follows the same logic, just at a much larger scale with more sophisticated tools.

The Four Types of Analytics

Data analytics can be categorized into four types, each building upon the previous one and offering increasingly valuable insights:

Type Question It Answers Example Complexity
Descriptive "What happened?" Sales dropped 15% last month Basic
Diagnostic "Why did it happen?" Sales dropped because competitor launched a discount campaign Intermediate
Predictive "What will happen?" Based on trends, sales will likely increase 10% next quarter Advanced
Prescriptive "What should we do?" Launch a loyalty program to retain customers and increase sales by 12% Expert

Example: Netflix and Analytics

Descriptive: 80% of users watched at least one show this week

Diagnostic: Viewership increased because a popular new series was released

Predictive: Based on viewing patterns, this user will likely enjoy sci-fi movies

Prescriptive: Show "Stranger Things" in the top row for users who watched similar sci-fi content

The Data Analytics Lifecycle

Every analytics project follows a structured process. Understanding this lifecycle is crucial for success:

  1. Define the Problem: Clearly understand what business question you're trying to answer. This is the most critical step - a well-defined problem is half-solved.
  2. Collect Data: Gather relevant data from various sources - databases, spreadsheets, APIs, surveys, etc.
  3. Clean & Prepare Data: Remove errors, handle missing values, format consistently. This typically takes 60-80% of an analyst's time!
  4. Analyze Data: Apply statistical methods, create visualizations, identify patterns and trends.
  5. Interpret Results: Draw meaningful conclusions from your analysis. What story does the data tell?
  6. Communicate Findings: Present insights to stakeholders through reports, dashboards, or presentations.
  7. Take Action: Implement recommendations and monitor outcomes.

Role of a Data Analyst in Organizations

As a data analyst, you'll wear many hats. Here's what a typical data analyst does:

  • Data Collection & Cleaning: Gathering data from multiple sources and ensuring it's accurate and usable
  • Data Analysis: Using statistical tools and software to analyze trends and patterns
  • Reporting & Visualization: Creating dashboards and reports that communicate insights clearly
  • Business Partnership: Working with different teams to understand their data needs
  • Problem Solving: Translating business problems into analytical questions and vice versa

Industry Applications

Data analytics is used across virtually every industry. Here are some examples:

BFSI (Banking, Financial Services, Insurance)

  • Fraud detection and prevention
  • Credit risk assessment
  • Customer segmentation for targeted marketing
  • Stock market trend analysis

Healthcare

  • Patient outcome prediction
  • Disease outbreak tracking
  • Hospital resource optimization
  • Drug effectiveness analysis

Retail & E-commerce

  • Customer behavior analysis
  • Inventory management
  • Price optimization
  • Recommendation systems

IT & Technology

  • User experience optimization
  • System performance monitoring
  • A/B testing for product features
  • Cybersecurity threat detection

Career Paths in Data Analytics

The data analytics field offers multiple career progression paths:

  1. Data Analyst (Entry Level): Analyze data, create reports, build dashboards
  2. Senior Data Analyst: Lead complex projects, mentor juniors, work with stakeholders
  3. Business Intelligence Analyst: Focus on strategic reporting and data visualization
  4. Data Scientist: Build predictive models, work with machine learning
  5. Analytics Manager: Lead analytics teams, define strategy
  6. Chief Data Officer: Executive-level data strategy leadership

Day 1 Key Takeaways

  • Data analytics is about turning raw data into actionable insights
  • There are four types: Descriptive, Diagnostic, Predictive, and Prescriptive
  • Every project follows the analytics lifecycle from problem definition to action
  • Data analysts are in high demand across all industries
  • Multiple career paths exist from entry-level analyst to C-suite executive
Day 2

Understanding Data Types & Structures

The Building Blocks of Data Analysis

Watch Video Lesson

Before you can analyze data effectively, you need to understand what types of data exist and how they're structured. This knowledge is fundamental because the type of data you're working with determines which analysis methods you can use.

Data Structure: Organized vs. Unorganized

Structured Data

Structured data is highly organized and fits neatly into rows and columns, like an Excel spreadsheet or a database table.

Example of Structured Data

Customer ID Name Age Purchase Amount Date
1001 Rahul Kumar 28 5,499 2024-01-15
1002 Priya Sharma 34 12,999 2024-01-16
1003 Amit Patel 45 3,250 2024-01-16

Easy to search, sort, and analyze. This is what you'll work with most often!

Unstructured Data

Unstructured data doesn't have a predefined format. It includes:

  • Text documents and emails
  • Social media posts
  • Images and videos
  • Audio recordings
  • Web pages

Unstructured data requires special techniques to analyze and often needs to be converted to structured format first.

Semi-structured Data

Semi-structured data falls between the two. It has some organizational properties but doesn't fit into traditional tables. Examples include:

  • JSON files
  • XML documents
  • Email (has fields like To, From, Subject, but body is unstructured)

Data Types: The Different Flavors of Data

Understanding data types is crucial because it determines how you can analyze and visualize information.

Numerical Data (Quantitative)

Numerical data represents quantities and can be measured. It comes in two forms:

Discrete Data

Countable values that cannot be broken down into smaller parts. Think of things you can count:

  • Number of customers (you can't have 2.5 customers)
  • Number of products sold
  • Number of website visits
  • Number of employees

Continuous Data

Measurable values that can take any value within a range, including decimals:

  • Height (175.5 cm)
  • Weight (68.3 kg)
  • Temperature (36.6 degrees)
  • Time (2.5 hours)
  • Revenue (Rs. 1,45,678.50)

Categorical Data (Qualitative)

Categorical data represents characteristics or qualities that can be grouped into categories.

Nominal Data

Categories with no inherent order or ranking:

  • Gender: Male, Female, Other
  • Blood Type: A, B, AB, O
  • City: Mumbai, Delhi, Bangalore
  • Product Category: Electronics, Clothing, Books

You can count them but can't rank them - is "Mumbai" greater than "Delhi"? No!

Ordinal Data

Categories with a meaningful order, but intervals between them aren't equal:

  • Education: High School < Bachelor's < Master's < PhD
  • Customer Satisfaction: Poor < Average < Good < Excellent
  • T-shirt Size: S < M < L < XL
  • Movie Rating: 1 star < 2 stars < 3 stars < 4 stars < 5 stars

You can rank them, but the "distance" between categories isn't necessarily equal.

Data Type Can Count? Can Order? Can Measure? Example
Nominal Yes No No Colors, Cities
Ordinal Yes Yes No Ratings, Rankings
Discrete Yes Yes Yes (whole numbers) Count of items
Continuous Yes Yes Yes (any value) Height, Weight

Date/Time Data

Date and time data is special because it can be treated as both categorical and numerical depending on your analysis needs.

  • As Categorical: Group sales by month (January, February, etc.) or day of week (Monday, Tuesday)
  • As Numerical: Calculate time differences, trends over time

Common date/time formats you'll encounter:

  • DD/MM/YYYY (15/01/2024) - Common in India and Europe
  • MM/DD/YYYY (01/15/2024) - Common in USA
  • YYYY-MM-DD (2024-01-15) - ISO standard, best for sorting
  • DD-Mon-YYYY (15-Jan-2024)

Pro Tip: Always Use Consistent Date Formats

Inconsistent date formats are one of the most common data quality issues. When working with dates, always ensure consistency across your entire dataset.

Data Quality Dimensions

"Garbage in, garbage out" - the quality of your analysis is only as good as the quality of your data. Here are the key dimensions of data quality:

1. Accuracy

Is the data correct? Does it reflect reality?

Example: A customer's age is recorded as 150 years - clearly inaccurate!

2. Completeness

Is all required data present? Are there missing values?

Example: 20% of customer records are missing email addresses - incomplete data!

3. Consistency

Is the same information recorded the same way everywhere?

Example: One record says "Bangalore" while another says "Bengaluru" for the same city - inconsistent!

4. Timeliness

Is the data up-to-date? Is it available when needed?

5. Validity

Does the data conform to defined rules and formats?

Example: Phone number field contains "abc123" - invalid format!

Common Data Formats

As a data analyst, you'll work with various file formats. Here are the most common ones:

Format Full Name Best For Pros Cons
CSV Comma-Separated Values Simple tabular data Universal, lightweight No formatting, limited data types
Excel (.xlsx) Excel Workbook Business data, reports Formatting, formulas, multiple sheets Larger file size, proprietary
JSON JavaScript Object Notation Web data, APIs Flexible structure, widely used Can be complex to read
XML Extensible Markup Language Configuration, data exchange Self-describing, widely supported Verbose, larger files

Day 2 Key Takeaways

  • Data can be structured (tables), unstructured (text, images), or semi-structured (JSON, XML)
  • Numerical data can be discrete (countable) or continuous (measurable)
  • Categorical data can be nominal (no order) or ordinal (ordered)
  • Data quality has five dimensions: Accuracy, Completeness, Consistency, Timeliness, Validity
  • CSV and Excel are the most common formats you'll encounter
Day 3

Business Questions & Problem Framing

The Art of Asking the Right Questions

Watch Video Lesson

The most critical skill a data analyst can develop isn't technical - it's the ability to ask the right questions. A perfectly executed analysis that answers the wrong question is worthless. Today, we'll learn how to frame business problems into questions that data can answer.

Problem Framing is the process of converting vague business challenges into specific, measurable, and answerable questions that guide your analysis.

Why Asking the Right Questions Matters

Consider this scenario:

The Wrong Approach

Manager says: "Our sales are down. Can you look at the data and tell me what's happening?"

Analyst thinks: "Let me create every possible chart and report about sales and show them all."

Result: Weeks of work, hundreds of charts, no clear insight, frustrated manager.

The Right Approach

Manager says: "Our sales are down. Can you look at the data?"

Analyst asks: "When you say 'down,' compared to what? Last month? Last year? A target? Which products or regions are you most concerned about?"

Result: Focused analysis, clear insights, actionable recommendations.

The SMART Framework for Business Questions

Good analytical questions follow the SMART framework:

  • S - Specific: Clearly defined, not vague
  • M - Measurable: Can be answered with data
  • A - Actionable: The answer leads to a decision or action
  • R - Relevant: Aligned with business goals
  • T - Time-bound: Has a clear time frame
Vague Question SMART Question
"Are customers happy?" "What percentage of customers rated their experience 4 or 5 stars in Q4 2024?"
"Is marketing working?" "Which marketing channel generated the highest ROI in the last 6 months?"
"Why are sales bad?" "Which product categories showed a decline of more than 10% compared to the same quarter last year?"
"How can we grow?" "Which customer segments have the highest lifetime value and lowest acquisition cost?"

Translating Business Problems to Analytical Questions

Here's a step-by-step process to translate business problems into analytical questions:

  1. Listen carefully: Understand what the stakeholder is really asking
  2. Clarify ambiguity: Ask follow-up questions to remove vagueness
  3. Identify the metric: What exactly needs to be measured?
  4. Define the comparison: Compared to what?
  5. Set the time frame: What period are we analyzing?
  6. Consider segmentation: Should we break this down by category, region, customer type?

Example: E-commerce Sales Decline

Business Problem: "Our online sales have dropped"

Clarifying Questions:

  • By how much? (20% decline)
  • Compared to when? (Same month last year)
  • All products or specific categories? (Electronics mainly)
  • All regions? (North India specifically)

Refined Analytical Question: "Why did electronics sales in North India decline by 20% in November 2024 compared to November 2023, and which specific factors contributed to this decline?"

Stakeholder Requirements Gathering

Before starting any analysis, you need to understand what your stakeholders actually need. Key questions to ask:

  • What decision will this analysis inform? This helps you focus on actionable insights.
  • Who will use this analysis? Executive summary for leadership? Detailed report for the team?
  • When do you need it? Urgent decisions need quick analysis; strategic planning allows deeper exploration.
  • What format do you prefer? Dashboard? Report? Presentation?
  • What data is available? No point planning an analysis if the required data doesn't exist.

Hypothesis Formation

A hypothesis is an educated guess about what you expect to find in the data. Forming hypotheses before diving into analysis helps you:

  • Stay focused on what matters
  • Avoid getting lost in the data
  • Recognize unexpected findings

Hypothesis Structure

"I believe [X] because [reason], and I can verify this by [measurement]."

Example: "I believe sales dropped in North India because a major competitor opened stores there last quarter. I can verify this by comparing sales trends before and after the competitor's entry and analyzing lost customers' zip codes."

Day 3 Key Takeaways

  • Asking the right question is more important than technical analysis skills
  • Use the SMART framework: Specific, Measurable, Actionable, Relevant, Time-bound
  • Always clarify vague requests before starting analysis
  • Understand stakeholder needs: What decision? Who? When? What format?
  • Form hypotheses to guide your analysis and stay focused
Day 4

KPIs & Metrics Design

Measuring What Matters

Watch Video Lesson

Today we'll learn about Key Performance Indicators (KPIs) and metrics - the numbers that tell you whether a business is succeeding or failing. Understanding KPIs is essential because they form the foundation of any business dashboard or report you'll create.

What's the Difference: Metrics vs. KPIs?

Metric: Any quantifiable measure that tracks some aspect of business performance.

KPI (Key Performance Indicator): A metric that is critical to the success of the business. All KPIs are metrics, but not all metrics are KPIs.

Example: E-commerce Store

Metrics (many things you can measure):

  • Number of page views
  • Time spent on site
  • Number of products viewed
  • Cart additions
  • Bounce rate

KPIs (the critical few):

  • Revenue
  • Conversion Rate
  • Customer Acquisition Cost

Leading vs. Lagging Indicators

Understanding this distinction is crucial for effective business management:

Aspect Leading Indicators Lagging Indicators
Definition Predict future performance Measure past performance
When to use To make proactive decisions To evaluate results
Example (Sales) Number of sales calls made Monthly revenue
Example (Health) Hours of exercise per week Weight/BMI
Advantage Can take corrective action early Accurate, factual

Industry-Specific KPIs

Different industries and functions track different KPIs. Here are the most important ones:

Sales KPIs

  • Revenue: Total money earned from sales
  • Conversion Rate: (Customers / Visitors) × 100
  • Average Order Value (AOV): Revenue / Number of Orders
  • Customer Acquisition Cost (CAC): Marketing Spend / New Customers
  • Customer Lifetime Value (LTV): Average purchase value × Purchase frequency × Customer lifespan
LTV:CAC Ratio = Customer Lifetime Value / Customer Acquisition Cost
A healthy ratio is 3:1 or higher (you earn 3x what you spend to acquire a customer)

Marketing KPIs

  • Click-Through Rate (CTR): (Clicks / Impressions) × 100
  • Cost Per Click (CPC): Ad Spend / Number of Clicks
  • Return on Ad Spend (ROAS): Revenue from Ads / Ad Spend
  • Engagement Rate: (Likes + Comments + Shares) / Followers × 100

Operations KPIs

  • Efficiency: Output / Input × 100
  • Utilization Rate: Actual Hours Worked / Available Hours × 100
  • Defect Rate: Defective Units / Total Units × 100
  • On-Time Delivery Rate: On-Time Deliveries / Total Deliveries × 100

Finance KPIs

  • Return on Investment (ROI): (Gain - Cost) / Cost × 100
  • Profit Margin: Net Profit / Revenue × 100
  • Cash Flow: Cash In - Cash Out
  • Accounts Receivable Turnover: Net Sales / Average Accounts Receivable

Calculating ROI - A Simple Example

You invest Rs. 10,000 in a marketing campaign and generate Rs. 25,000 in sales.

ROI = (25,000 - 10,000) / 10,000 × 100 = 150%

For every rupee spent, you earned Rs. 1.50 in return. That's a great ROI!

Setting KPI Targets and Benchmarks

A KPI without a target is just a number. Here's how to set meaningful targets:

  1. Historical Benchmarking: Compare to your own past performance
  2. Industry Benchmarking: Compare to industry standards
  3. Competitor Benchmarking: Compare to competitors (if data is available)
  4. Goal-Based: Work backward from business objectives

KPI Dashboard Best Practices

When building KPI dashboards, follow these principles:

  • Less is more: Focus on 5-7 KPIs maximum. More is confusing.
  • Show trends: Include historical data to show direction
  • Use visual cues: Red/yellow/green to indicate status
  • Enable drill-down: Allow users to explore underlying data
  • Update regularly: Stale data is worse than no data

Day 4 Key Takeaways

  • KPIs are the vital few metrics that indicate business success
  • Leading indicators predict future; lagging indicators measure past
  • Different functions track different KPIs (Sales, Marketing, Ops, Finance)
  • Always set targets using benchmarks (historical, industry, competitor, goal-based)
  • Effective dashboards focus on 5-7 KPIs with trends and visual cues
Day 5

Excel Fundamentals for Analytics

Your First Analytics Tool

Watch Video Lesson

Excel is where most data analysts start their careers. Despite advanced tools like Python and R, Excel remains essential because it's universally available, easy to learn, and perfect for quick analysis. Today we'll master the fundamentals.

Practice Along With Sample Data

Download these beginner-friendly datasets to practice the Excel functions below:

Essential Excel Functions for Data Analysis

Basic Aggregation Functions

=SUM(range)
Adds all numbers in a range
Example: =SUM(B2:B100) adds all values from B2 to B100
=AVERAGE(range)
Calculates the arithmetic mean
Example: =AVERAGE(C2:C50) finds the average of C2 to C50
=COUNT(range)
Counts cells containing numbers
Example: =COUNT(A2:A100) counts how many numeric values
=COUNTA(range)
Counts non-empty cells (numbers AND text)
Example: =COUNTA(A2:A100) counts all non-blank cells
=COUNTBLANK(range)
Counts empty cells - useful for finding missing data!
Example: =COUNTBLANK(D2:D100)

Statistical Functions

=MIN(range) and =MAX(range)
Find the smallest and largest values
Example: =MIN(B2:B100) finds the minimum value
=MEDIAN(range)
Finds the middle value (better than average for skewed data)
Example: =MEDIAN(C2:C100)
=MODE.SNGL(range)
Finds the most frequently occurring value
Example: =MODE.SNGL(D2:D100)

Logical Functions

=IF(condition, value_if_true, value_if_false)
Makes decisions based on conditions
Example: =IF(B2>1000, "High Value", "Low Value")
=AND(condition1, condition2, ...)
Returns TRUE only if ALL conditions are true
Example: =AND(B2>100, B2<500) - TRUE if B2 is between 100 and 500
=OR(condition1, condition2, ...)
Returns TRUE if ANY condition is true
Example: =OR(A2="Red", A2="Blue") - TRUE if A2 is Red or Blue

Combining IF with AND/OR

Classify customers based on purchase amount and frequency:

=IF(AND(B2>10000, C2>5), "Premium", "Regular")

This returns "Premium" if purchase amount (B2) is over 10,000 AND frequency (C2) is over 5.

Text Functions

=LEFT(text, num_chars)
Extracts characters from the left
Example: =LEFT("ABCDEF", 3) returns "ABC"
=RIGHT(text, num_chars)
Extracts characters from the right
Example: =RIGHT("ABCDEF", 2) returns "EF"
=MID(text, start_position, num_chars)
Extracts characters from the middle
Example: =MID("ABCDEF", 2, 3) returns "BCD"
=CONCATENATE(text1, text2, ...) or =text1 & text2
Joins text strings together
Example: =A2 & " " & B2 combines first and last name with a space

Cell References: The Foundation of Excel

Understanding cell references is crucial for building formulas that work correctly when copied.

Type Syntax Behavior When Copied Use Case
Relative A1 Changes based on new position Most calculations
Absolute $A$1 Never changes Referencing fixed values (tax rate, exchange rate)
Mixed (Row) A$1 Column changes, row stays fixed Looking up from a fixed row
Mixed (Column) $A1 Row changes, column stays fixed Looking up from a fixed column

Pro Tip: F4 Shortcut

Press F4 while editing a cell reference to cycle through reference types: A1 → $A$1 → A$1 → $A1 → A1

Named Ranges and Tables

Instead of referring to ranges like B2:B100, you can give them meaningful names:

  • Select the range
  • Click in the Name Box (left of formula bar)
  • Type a name (e.g., "SalesData")
  • Now use =SUM(SalesData) instead of =SUM(B2:B100)

This makes formulas more readable and easier to maintain.

Data Validation

Data validation prevents bad data from entering your spreadsheet:

  • Dropdown lists: Limit choices to predefined options
  • Number ranges: Only allow numbers between X and Y
  • Date ranges: Only allow dates in a specific range
  • Text length: Limit the number of characters

Access via: Data Tab → Data Validation

Try It Yourself - Day 5 Exercises

Download the Week 1 Sales Dataset and complete these exercises:

  1. Exercise 1: Calculate the total revenue (Total_Amount column) using SUM
  2. Exercise 2: Find the average order value using AVERAGE
  3. Exercise 3: Count how many orders are in the dataset using COUNTA
  4. Exercise 4: Use IF to create a new column "Order_Size" - "Large" if Total_Amount > 5000, otherwise "Small"
  5. Exercise 5: Extract the month from the Date column using MID or TEXT function
  6. Exercise 6: Find the MIN and MAX values in the Total_Amount column

Expected Answers: Total Revenue: ~Rs. 11,00,000+ | Average Order: ~Rs. 3,100 | Total Orders: 352

Day 5 Key Takeaways

  • Master SUM, AVERAGE, COUNT, COUNTA, COUNTBLANK for basic analysis
  • Use IF, AND, OR for creating conditional logic
  • Text functions (LEFT, RIGHT, MID) help clean and extract data
  • Understand relative ($A$1), absolute (A1), and mixed references ($A1, A$1)
  • Named ranges make formulas readable; data validation prevents errors
Day 6

Advanced Excel & Data Analysis

Lookup Functions, Conditional Aggregations, and Pivot Tables

Watch Video Lesson

Today we'll learn the functions that separate beginner Excel users from intermediate analysts. These skills are mentioned in almost every data analyst job description!

Practice Datasets for Advanced Excel

These datasets are ideal for practicing VLOOKUP, INDEX-MATCH, and Pivot Tables:

Lookup Functions: Finding Data Across Tables

VLOOKUP (Vertical Lookup)

VLOOKUP searches for a value in the first column of a range and returns a value from another column.

=VLOOKUP(lookup_value, table_array, col_index_num, [range_lookup])

lookup_value: The value you're searching for
table_array: The table to search in
col_index_num: Which column to return (1, 2, 3...)
range_lookup: FALSE for exact match (use this 99% of the time)

VLOOKUP Example

You have a Product ID and want to find the product name from a product list.

=VLOOKUP(A2, Products!A:C, 2, FALSE)

This looks up the value in A2 in the Products sheet, and returns the value from column 2 (product name).

INDEX-MATCH: The Better Alternative

INDEX-MATCH is more flexible than VLOOKUP because:

  • It can look left (VLOOKUP can only look right)
  • It doesn't break when you insert columns
  • It's faster on large datasets
=INDEX(return_range, MATCH(lookup_value, lookup_range, 0))

MATCH finds the position of the lookup value
INDEX returns the value at that position from the return range

INDEX-MATCH Example

Same lookup as above, but using INDEX-MATCH:

=INDEX(Products!B:B, MATCH(A2, Products!A:A, 0))

MATCH finds A2 in column A. INDEX returns the corresponding value from column B.

XLOOKUP (Excel 365)

If you have Excel 365, XLOOKUP combines the best of both worlds:

=XLOOKUP(lookup_value, lookup_array, return_array, [if_not_found])

Example: =XLOOKUP(A2, Products!A:A, Products!B:B, "Not Found")

Conditional Aggregation Functions

These functions aggregate data based on conditions - essential for any analyst!

SUMIF and SUMIFS

=SUMIF(range, criteria, [sum_range])
Sum values based on one condition
Example: =SUMIF(A:A, "Electronics", B:B)
Sums all values in column B where column A is "Electronics"
=SUMIFS(sum_range, criteria_range1, criteria1, criteria_range2, criteria2, ...)
Sum values based on multiple conditions
Example: =SUMIFS(C:C, A:A, "Electronics", B:B, "North")
Sums column C where A is "Electronics" AND B is "North"

COUNTIF and COUNTIFS

=COUNTIF(range, criteria)
Count cells based on one condition
Example: =COUNTIF(A:A, ">1000")
Counts how many values in column A are greater than 1000
=COUNTIFS(range1, criteria1, range2, criteria2, ...)
Count cells based on multiple conditions
Example: =COUNTIFS(A:A, "Mumbai", B:B, ">=10000")
Counts rows where city is Mumbai AND amount is >= 10000

AVERAGEIF and AVERAGEIFS

Same logic as SUMIF/SUMIFS, but calculates average instead of sum.

=AVERAGEIF(A:A, "Premium", B:B)
Average of column B where column A is "Premium"

Pivot Tables: The Most Powerful Excel Feature

Pivot Tables are arguably the most important feature for data analysts. They allow you to summarize, analyze, explore, and present large datasets in seconds.

Creating a Pivot Table

  1. Select your data (including headers)
  2. Insert → PivotTable
  3. Choose where to place it (new worksheet recommended)
  4. Drag fields to the appropriate areas:
    • Rows: Categories to group by (e.g., Product, Region)
    • Columns: Secondary grouping (e.g., Month, Year)
    • Values: Numbers to summarize (e.g., Sum of Sales)
    • Filters: Fields to filter the entire table

Pivot Table Example

You have sales data with columns: Date, Region, Product, Salesperson, Amount

To see total sales by Region and Product:

  • Rows: Region
  • Columns: Product
  • Values: Sum of Amount

In seconds, you get a cross-tabulation showing sales for every Region-Product combination!

Pivot Table Features

  • Grouping: Group dates by month/quarter/year, or group numbers into ranges
  • Calculated Fields: Create new calculations within the pivot table
  • Slicers: Visual filters that make your pivot table interactive
  • Pivot Charts: Create charts directly from pivot tables

Sorting, Filtering & Conditional Formatting

Sorting

Select your data → Data tab → Sort. You can sort by multiple columns (e.g., first by Region, then by Sales within each region).

Filtering

Select your data → Data tab → Filter. Dropdown arrows appear in headers. Click to filter by specific values, text, or conditions.

Conditional Formatting

Automatically format cells based on their values. Common uses:

  • Data Bars: Show relative values with bars inside cells
  • Color Scales: Gradient from low to high (red → green)
  • Icon Sets: Arrows, traffic lights, stars based on values
  • Highlight Rules: Color cells greater than X, top 10, duplicates, etc.

Try It Yourself - Day 6 Exercises

Continue using the Week 1 Sales Dataset for these advanced exercises:

  1. Exercise 1 (SUMIF): Calculate total sales for "Electronics" category only
  2. Exercise 2 (SUMIFS): Calculate total sales for "North" region AND "Electronics" category
  3. Exercise 3 (COUNTIF): Count how many orders are from "Returning" customers
  4. Exercise 4 (AVERAGEIF): Find the average order value for "New" customers only
  5. Exercise 5 (Pivot Table): Create a pivot table showing Total Sales by Region (rows) and Product Category (columns)
  6. Exercise 6 (Pivot Table): Create a pivot table showing Monthly Sales Trend (group Date by Month)
  7. Exercise 7: Apply conditional formatting to highlight orders above Rs. 5,000

Hints: Electronics total: ~Rs. 2,80,000 | North Electronics: ~Rs. 70,000+ | Returning customers: ~175 orders

Day 6 Key Takeaways

  • VLOOKUP is useful but INDEX-MATCH is more flexible
  • SUMIF/COUNTIF/AVERAGEIF aggregate data based on conditions
  • Add "S" (SUMIFS, COUNTIFS) for multiple conditions
  • Pivot Tables are the most powerful Excel feature for data analysis
  • Conditional formatting makes data patterns visible at a glance
Day 7

Week 1 Project & Assessment

Putting It All Together

Watch Video Lesson

Congratulations on reaching the final day of Week 1! Today, you'll apply everything you've learned to create your first complete analytics project: a Sales Performance Dashboard.

Project: Sales Performance Analysis Dashboard

Project Brief

You've been hired as a data analyst for a retail company. The Sales Director has asked you to analyze the last 6 months of sales data and create a dashboard that answers key business questions.

Dataset Description

You'll work with a sales dataset containing:

  • Date: Transaction date
  • Order ID: Unique identifier for each order
  • Region: North, South, East, West
  • Product Category: Electronics, Clothing, Home & Kitchen, Sports
  • Product Name: Specific product
  • Quantity: Units sold
  • Unit Price: Price per unit
  • Total Amount: Quantity × Unit Price
  • Customer Type: New, Returning

EDUSHARK Project Dataset - Ready to Use!

We've prepared a custom dataset specifically designed for this Week 1 project. It contains 352 sales transactions over 6 months with all the required columns.

Download Week 1 Sales Dataset (CSV)

352 rows | 9 columns | Jan-Jun 2024 | Perfect for Excel analysis

Additional Practice Datasets

Want more practice? Try these free datasets from Kaggle and other sources:

Recommended Datasets for Week 1 Project:

Beginner Practice Datasets:

Additional Resources:

Tip: Download multiple datasets and practice the same analysis techniques on different data to strengthen your skills!

Project Requirements

Part 1: Data Cleaning & Preparation

  1. Check for and handle missing values
  2. Verify data types are correct
  3. Check for duplicate records
  4. Add calculated columns if needed (e.g., Month, Week)

Part 2: Analysis Questions to Answer

  1. Overall Performance:
    • What was the total revenue for the 6-month period?
    • How many orders were placed?
    • What was the average order value?
  2. Regional Analysis:
    • Which region generated the most revenue?
    • How does order volume vary by region?
  3. Product Analysis:
    • Which product category is the bestseller?
    • What are the top 5 products by revenue?
  4. Time Trends:
    • How did monthly sales change over time?
    • Are there any seasonal patterns?
  5. Customer Analysis:
    • What percentage of revenue comes from returning customers?

Part 3: KPIs to Calculate

Create a KPI summary section with:

  1. Total Revenue
  2. Total Orders
  3. Average Order Value (AOV)
  4. Revenue Growth (Month over Month)
  5. Returning Customer Rate

Part 4: Dashboard Creation

Build a summary dashboard that includes:

  • KPI boxes at the top
  • Pivot Table showing sales by Region and Category
  • Pivot Chart showing monthly sales trend
  • Top 5 products list

Part 5: Recommendations

Based on your analysis, write 3 actionable recommendations for the Sales Director. For example:

  • "Focus marketing efforts on [Region] as it shows the highest growth potential..."
  • "Consider expanding the [Category] product line as it has the highest average order value..."

Assessment Criteria

Component Weight What's Evaluated
Project Submission 50% Correctness, completeness, presentation quality
Quiz: Week 1 Concepts 30% Understanding of analytics concepts, data types, KPIs, Excel functions
Peer Review 20% Quality of feedback given to peers

Tips for Success

  • Start with questions: Always refer back to the business questions you're trying to answer
  • Clean first: Don't start analysis until your data is clean
  • Document your work: Add comments explaining your formulas and logic
  • Format for readability: Use consistent formatting, clear labels, and appropriate number formats
  • Tell a story: Arrange your dashboard so it tells a logical story from overview to details

Week 1 Complete!

You've completed an intensive week covering:

  • What data analytics is and how it's used in organizations
  • Different types of data and quality dimensions
  • How to frame business problems as analytical questions
  • Understanding KPIs and metrics across industries
  • Essential Excel functions for data analysis
  • Advanced Excel including VLOOKUP, INDEX-MATCH, and Pivot Tables

You've built your first analytics dashboard!

Next week, you'll dive into SQL and databases - the foundation of working with large-scale data!

Quiz

Week 1 Self-Assessment Quiz

Test Your Understanding

Answer these questions to check your understanding of Week 1 concepts. Try to answer without looking back!

Question 1: Types of Analytics

A company wants to predict how many units of a product will sell next quarter. Which type of analytics is this?

  1. Descriptive Analytics
  2. Diagnostic Analytics
  3. Predictive Analytics
  4. Prescriptive Analytics
Show Answer

C. Predictive Analytics - Predictive analytics answers "What will happen?" by forecasting future outcomes based on historical data.

Question 2: Data Types

Customer satisfaction ratings (Poor, Average, Good, Excellent) are an example of which data type?

  1. Nominal Data
  2. Ordinal Data
  3. Discrete Data
  4. Continuous Data
Show Answer

B. Ordinal Data - Ordinal data has a meaningful order (Poor < Average < Good < Excellent), but the intervals between categories aren't necessarily equal.

Question 3: SMART Framework

Which of these is a SMART business question?

  1. "How can we increase sales?"
  2. "Are customers happy with our service?"
  3. "Which product category showed more than 15% growth in Q4 2024 compared to Q4 2023?"
  4. "Why is marketing not working?"
Show Answer

C. - This question is Specific (product category, growth rate), Measurable (15%), Actionable, Relevant, and Time-bound (Q4 2024 vs Q4 2023).

Question 4: KPIs

A "leading indicator" is best described as:

  1. A metric that measures past performance
  2. A metric that predicts future performance
  3. The most important KPI in a dashboard
  4. A metric used only by senior leadership
Show Answer

B. A metric that predicts future performance - Leading indicators (like sales calls made) help predict future outcomes, while lagging indicators (like revenue) measure past results.

Question 5: Excel Functions

Which formula correctly calculates the total sales only for the "North" region?

  1. =SUM(B:B, "North")
  2. =SUMIF(A:A, "North", B:B)
  3. =COUNTIF(A:A, "North", B:B)
  4. =TOTAL(A:A="North", B:B)
Show Answer

B. =SUMIF(A:A, "North", B:B) - SUMIF sums values in B:B where the corresponding cell in A:A equals "North".

Question 6: VLOOKUP vs INDEX-MATCH

What is a key advantage of INDEX-MATCH over VLOOKUP?

  1. INDEX-MATCH is easier to write
  2. INDEX-MATCH can look up values to the left of the lookup column
  3. INDEX-MATCH works faster on small datasets
  4. INDEX-MATCH doesn't require exact matches
Show Answer

B. INDEX-MATCH can look up values to the left of the lookup column - VLOOKUP can only return values from columns to the right of the lookup column, while INDEX-MATCH has no such limitation.

Question 7: Cell References

In Excel, what does $A$1 represent?

  1. A relative reference that changes when copied
  2. An absolute reference that stays fixed when copied
  3. A named range
  4. A formula error
Show Answer

B. An absolute reference that stays fixed when copied - The $ signs lock both the column (A) and row (1), so the reference won't change when the formula is copied.

Question 8: Data Quality

One record shows "Bangalore" and another shows "Bengaluru" for the same city. This is a problem with which data quality dimension?

  1. Accuracy
  2. Completeness
  3. Consistency
  4. Timeliness
Show Answer

C. Consistency - Consistency means the same information should be recorded the same way everywhere. Both spellings refer to the same city but are inconsistent.

Quiz Complete!

How did you do? If you got 6 or more correct, you have a solid understanding of Week 1 concepts!

  • 8/8: Excellent! You're ready for Week 2
  • 6-7/8: Good understanding - review the topics you missed
  • 4-5/8: Fair - revisit Days 1-4 concepts
  • Below 4: Consider re-reading the material before moving on

About the Instructor

Pawan Rama Mali

Data Analytics professional and educator with extensive experience in business intelligence, data visualization, and teaching complex analytical concepts to beginners. Passionate about making data analytics accessible to everyone and helping students build practical, job-ready skills.

Specializations:

  • Data Analytics & Business Intelligence
  • Excel, SQL, Python for Data Analysis
  • R Programming & R Shiny Dashboards
  • Statistical Analysis & Data Visualization

Visit Instructor's Portfolio