This project delivers an end-to-end data analysis pipeline to evaluate vendor performance in a retail environment. It focuses on identifying profitability drivers, optimizing procurement decisions, and improving inventory efficiency using data-driven insights.
Retail companies often struggle with:
- Lack of visibility into vendor-wise profitability
- Inefficient procurement strategies leading to increased costs
- Overstocking or understocking due to poor inventory planning
- Over-reliance on a small subset of vendors
This project addresses these issues by analyzing vendor transactions, inventory flow, and pricing data.
- Analyze vendor-wise sales, purchases, and profit contribution
- Identify top-performing and underperforming vendors
- Evaluate inventory turnover and stock efficiency
- Detect pricing inefficiencies and cost leakages
- Provide actionable recommendations for business optimization
- Imported multiple datasets (sales, purchases, inventory)
- Handled missing values and removed duplicates
- Standardized column formats for consistency
- Aggregated vendor-level sales and purchase data
- Merged datasets to create a unified analytical view
- Engineered new features such as:
- Profit = Sales − Purchase Cost
- Profit Margin (%)
- Inventory Turnover Ratio
- Vendor-wise performance evaluation
- Profitability and margin analysis
- Inventory efficiency assessment
- Identification of high-risk and high-opportunity vendors
- Bar charts for top vendors by profit
- Distribution plots for profit margins
- Interactive dashboard using Power BI
- Python (Pandas, NumPy) → Data processing & analysis
- Matplotlib → Visualization
- SQL (optional) → Querying structured data
- Power BI → Interactive dashboard
- Jupyter Notebook → Exploratory Data Analysis
- Git & GitHub → Version control
Retail-Vendor-Performance-Analysis/
│
├── data/ # Raw datasets
├── notebooks/ # EDA notebooks
├── scripts/ # Modular Python scripts
│ ├── data_processing.py
│ ├── analysis.py
│ └── visualization.py
├── outputs/ # Processed outputs
├── main.py # Pipeline entry point
├── requirements.txt
└── README.md
- Total Sales per Vendor
- Total Purchase Cost
- Profit (Revenue − Cost)
- Profit Margin (%)
- Inventory Turnover Ratio
- A small group of vendors contributes disproportionately to total revenue → vendor dependency risk
- Several vendors show high margins but low sales → growth opportunity
- Bulk purchasing significantly reduces per-unit cost → procurement optimization
- Low inventory turnover indicates capital inefficiency and overstocking
- Enables data-driven vendor selection
- Reduces procurement costs through optimized purchasing
- Improves inventory planning and reduces wastage
- Supports strategic decision-making with measurable insights
git clone https://github.com/nirjanadas/Retail-Vendor-Performance-Analysis.git
cd Retail-Vendor-Performance-Analysispip install -r requirements.txtpython main.py- Processed results stored in /outputs
- Visualizations displayed during execution
- Open Power BI dashboard (.pbix) for interactive insights
- Top vendors ranked by profit
- Profit margin distribution
- Inventory turnover insights
- Build predictive model for vendor risk scoring
- Automate ETL pipeline using Airflow
- Deploy dashboard to Power BI Service / Web
- Integrate real-time data streams
- Data Cleaning & Preprocessing
- Feature Engineering
- Exploratory Data Analysis (EDA)
- Data Visualization
- Business Analytics & Insight Generation Modular Code Design
Nirjana Das
GitHub:nirjanadas