This project presents a complete end-to-end sales analysis of a retail superstore dataset using Microsoft Excel.
It demonstrates how raw transactional data can be transformed into actionable business insights through data cleaning, analysis, and visualization.
The project focuses on sales performance, profitability, customer behavior, and business optimization strategies.
- Analyze overall sales and profit trends
- Identify top-performing and underperforming products
- Evaluate regional and segment-wise performance
- Understand the impact of discounts on profit
- Perform customer segmentation using RFM analysis
- Estimate Customer Lifetime Value (LTV)
- Build a dashboard for decision-making
- Data Cleaning & Data Validation
- Exploratory Data Analysis (EDA)
- Feature Engineering
- Business Intelligence (BI)
- Statistical Analysis
- Customer Analytics (RFM, LTV, ABC Analysis)
- Dashboard Design & Data Storytelling
The dataset contains retail transaction data with the following key fields:
- Order Details: Order ID, Order Date, Ship Date
- Customer Info: Customer Name, Segment
- Product Info: Category, Sub-Category, Manufacturer
- Sales Metrics: Sales, Profit, Discount, Quantity
- Geographic Data: Region/State
- Cleaned raw dataset
- Handled missing values
- Created a Data Dictionary
- Performed data quality checks
- Created derived columns (e.g., profitability metrics)
- Structured dataset for analysis
- Performed statistical analysis
- Identified trends, patterns, and anomalies
- Category & Sub-category performance
- Regional sales distribution
- Customer segment analysis
- Product performance evaluation
- Recency → Last purchase timing
- Frequency → Purchase frequency
- Monetary → Spending amount
Used to identify:
- High-value customers
- Loyal customers
- At-risk customers
- Estimated long-term customer value
- Helps in retention and targeting strategies
- A → High-value customers/products
- B → Medium-value
- C → Low-value
- Identified loss-making transactions
- Highlighted profit leakage areas
- Analyzed discount vs profit relationship
- Detected over-discounting impact
- Monthly sales trends
- Seasonality patterns
- Region/state-wise performance
- Identified growth opportunities
An interactive dashboard was created to visualize:
- Sales & Profit KPIs
- Category performance
- Regional insights
- Customer analytics
- High discounts negatively impact profitability
- Some products generate high sales but low profit
- A small group of customers drives major revenue
- Regional performance varies significantly
- Microsoft Excel
- Pivot Tables
- Advanced Formulas
- Data Cleaning
- Dashboard Creation
Superstore-Sales-Analysis/ │ ├── Superstore Sales.xlsx # Raw dataset ├── Data Dictionary # Column definitions ├── Data Quality Report # Data validation checks ├── Analysis # Processed dataset ├── Statistical Overview # Summary statistics ├── Pivot Analysis Sheets # Business insights ├── Advanced Analysis # RFM, LTV, ABC ├── Seasonality Analysis # Time trends ├── Geographic Analysis # Region insights └── Dashboard # Final visualization
This project helps:
- Improve profitability
- Optimize pricing & discount strategies
- Identify high-value customers
- Support data-driven decision-making
This project demonstrates the ability to:
- Perform end-to-end data analysis
- Extract meaningful business insights
- Build decision-support dashboards
- Power BI / Tableau Dashboard
- Python-based Automation (Pandas, NumPy)
- Sales Forecasting (Machine Learning)
- SQL Integration