This project demonstrates a modern data engineering workflow using the Medallion Architecture (Bronze → Silver → Gold) to deliver operational logistics insights.
It's designed for enterprise-scale logistics or transport companies to monitor shipment performance, delays, and fleet/vendor KPIs using:
- 🚛 Azure Data Factory
- 🚀 Azure Databricks + PySpark
- 📁 Delta Lake + Parquet
- 📊 Power BI for Visualization
logistics-analytics-platform/ │ ├── data/ # Dummy CSVs (drivers, vendors, routes, shipments) ├── notebooks/ # Databricks Notebooks for each layer ├── adf_pipelines/ # ADF JSON definitions ├── powerbi/ # Power BI screenshots or .pbix files ├── architecture/ # Architecture diagram image ├── README.md # This file └── .gitignore
- ✅ KPI Cards: Total Shipments, Avg Delay, On-Time %
- 📊 Vendor performance bar charts
- 📈 Monthly delivery trend lines
- 🗺️ Route-level delay matrix (simulated map)
- 🎯 Filters: Vendor, Route Type, Origin, Destination
- Raw files:
drivers.csv,vendors.csv,routes.csv,shipments.csv - Ingested using ADF Copy Activity with ForEach loop
- Stored in Azure Data Lake under the
bronze/container
- Field renaming and schema standardization
- Converted timestamps and metrics (e.g., delay in minutes)
- Stored as partitioned Parquet files in
silver/container
- Enriched metrics like:
- On-Time %
- Delay by route
- Vendor KPIs
- Monthly trends
- Written to
gold/container, ready for Power BI
- ✅ Metadata-driven ADF pipelines (parameterized + looped)
- 🧠 Databricks data transformation using PySpark
- 🗃️ Medallion architecture (Bronze → Silver → Gold)
- 💾 Delta Lake & partitioned Parquet files
- 📅 Incremental + batch pipeline logic
- 📈 Clean and professional Power BI dashboards
- 🔐 Secure handling of storage & access configuration
- Clone this repo
- Upload
/data/CSVs to Azure Data Lake Gen2 (rawcontainer) - Import and run ADF pipelines from
/adf_pipelines/ - Execute transformation notebooks in
/notebooks/inside Databricks - Open Power BI file from
/powerbi/and connect togoldcontainer
📧 Email: rao.mohsin.54@gmail.com
🌐 LinkedIn
✍️ Medium Profile
If you found this project useful, give it a ⭐ on GitHub — and feel free to fork or adapt it!
