This project analyzes Zepto’s inventory dataset using SQL to extract insights related to pricing strategy, discount patterns, stock availability, and product distribution. The goal was to clean raw data and generate actionable business insights from retail inventory data.
The dataset contains product-level inventory information including:
- Category
- Product Name
- MRP
- Discount Percentage
- Available Quantity
- Discounted Selling Price
- Product Weight
- Stock Availability Status
- Checked and handled missing values
- Renamed incorrect column headers
- Removed duplicate product records
- Converted pricing fields from paise to rupees
- Standardized boolean stock availability column
- Added primary key column for row identification
- Calculated price per gram for weight-based comparison
- Created product weight segments (Low, Medium, Bulk)
- Converted text-based TRUE/FALSE values into boolean format
- Structured dataset for analytical queries
The following analytical problems were explored:
- Top 10 best-value products based on discount percentage
- Products with high MRP but currently out of stock
- Estimated revenue contribution by category
- Products where MRP > ₹500 and discount < 10%
- Top 5 categories offering highest average discount percentage
- Price-per-gram comparison for products above 100g
- Product segmentation into Low, Medium, and Bulk categories
- Total inventory weight distribution per category
Example: Top 5 categories offering highest average discount percentage
SELECT category, ROUND(AVG(discountPercent),2) AS avg_discount FROM zepto_v2 GROUP BY category ORDER BY avg_discount DESC LIMIT 5;
- Identified categories offering the highest average discounts to support promotion strategy decisions
- Detected premium products currently unavailable in inventory indicating restocking opportunities
- Evaluated pricing efficiency using price-per-gram comparison across product segments
- Estimated category-level revenue contribution for inventory prioritization
- Highlighted stock availability gaps across product categories
- Data Cleaning
- Aggregations (SUM, AVG, COUNT)
- Filtering with WHERE conditions
- GROUP BY and HAVING clauses
- CASE statements
- Sorting and ranking insights
- Business-oriented query design
- MySQL
- SQL
- GitHub (Project Documentation)
├── zepto_v2.csv # Raw dataset ├── sql_zepto_project.sql # Cleaning, transformation & analysis queries └── README.md # Project documentation