Zepto Inventory Data Analysis (SQL Project)

Project Overview

This project analyzes Zepto’s inventory dataset using SQL to extract insights related to pricing strategy, discount patterns, stock availability, and product distribution. The goal was to clean raw data and generate actionable business insights from retail inventory data.

Dataset Information

The dataset contains product-level inventory information including:

Category
Product Name
MRP
Discount Percentage
Available Quantity
Discounted Selling Price
Product Weight
Stock Availability Status

Data Cleaning Performed

Checked and handled missing values
Renamed incorrect column headers
Removed duplicate product records
Converted pricing fields from paise to rupees
Standardized boolean stock availability column
Added primary key column for row identification

Data Transformation Steps

Calculated price per gram for weight-based comparison
Created product weight segments (Low, Medium, Bulk)
Converted text-based TRUE/FALSE values into boolean format
Structured dataset for analytical queries

Business Questions Solved

The following analytical problems were explored:

Top 10 best-value products based on discount percentage
Products with high MRP but currently out of stock
Estimated revenue contribution by category
Products where MRP > ₹500 and discount < 10%
Top 5 categories offering highest average discount percentage
Price-per-gram comparison for products above 100g
Product segmentation into Low, Medium, and Bulk categories
Total inventory weight distribution per category

Sample SQL Query

Example: Top 5 categories offering highest average discount percentage

SELECT category, ROUND(AVG(discountPercent),2) AS avg_discount FROM zepto_v2 GROUP BY category ORDER BY avg_discount DESC LIMIT 5;

Key Insights Generated

Identified categories offering the highest average discounts to support promotion strategy decisions
Detected premium products currently unavailable in inventory indicating restocking opportunities
Evaluated pricing efficiency using price-per-gram comparison across product segments
Estimated category-level revenue contribution for inventory prioritization
Highlighted stock availability gaps across product categories

SQL Skills Demonstrated

Data Cleaning
Aggregations (SUM, AVG, COUNT)
Filtering with WHERE conditions
GROUP BY and HAVING clauses
CASE statements
Sorting and ranking insights
Business-oriented query design

Tools Used

MySQL
SQL
GitHub (Project Documentation)

Repository Structure

├── zepto_v2.csv # Raw dataset ├── sql_zepto_project.sql # Cleaning, transformation & analysis queries └── README.md # Project documentation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Zepto Inventory Data Analysis (SQL Project)

Project Overview

Dataset Information

Data Cleaning Performed

Data Transformation Steps

Business Questions Solved

Sample SQL Query

SELECT category, ROUND(AVG(discountPercent),2) AS avg_discount FROM zepto_v2 GROUP BY category ORDER BY avg_discount DESC LIMIT 5;

Key Insights Generated

SQL Skills Demonstrated

Tools Used

Repository Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
sql zepto_project.sql		sql zepto_project.sql
zepto_v2.csv		zepto_v2.csv

Folders and files

Latest commit

History

Repository files navigation

Zepto Inventory Data Analysis (SQL Project)

Project Overview

Dataset Information

Data Cleaning Performed

Data Transformation Steps

Business Questions Solved

Sample SQL Query

SELECT category, ROUND(AVG(discountPercent),2) AS avg_discount FROM zepto_v2 GROUP BY category ORDER BY avg_discount DESC LIMIT 5;

Key Insights Generated

SQL Skills Demonstrated

Tools Used

Repository Structure

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages