This project analyzes over a decade of customer sales transaction data to uncover sales trends, customer purchasing behavior, and product performance.
The dataset initially contained inconsistencies, missing values, and data quality issues which were cleaned and transformed before analysis.
The goal of this project is to demonstrate practical data cleaning, transformation, and exploratory analysis skills using Python.
The dataset contains sales transaction records including:
- Transaction dates
- Customer information
- Product purchased
- Sales locations
- Quantity and price
- Revenue generated
- Product return information
Both dirty and cleaned versions of the dataset are included for demonstration purposes.
- Python
- Pandas
- NumPy
- Matplotlib
- Jupyter Notebook
- Seaborn
The project followed these steps:
- Data loading and inspection
- Handling missing and inconsistent values
- Outlier detection and correction
- Feature engineering
- Exploratory data analysis
- Visualization of key trends
- Sales performance trends
- Product performance comparison
- Customer purchasing patterns
- Relationship between returns and ratings
- Revenue distribution insights
customer-sales-analysis/
│
├── data/
│ ├── cleaned_customer_sales_data.csv
│ └── customer_sales_data_dirty.csv
│
├── notebooks/
│ ├── sales_analysis.ipynb
│ └── requirements.txt
│
├── scripts/
├── visuals/
└── README.md
- Clone the repository
- Install dependencies
- Open the notebook in Jupyter Notebook
- Run all cells to reproduce the analysis
Briggs Jobi Data Analyst