Skip to content

Raghava-07/api-data-warehouse

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

API Data Warehouse

A beginner-friendly Data Engineering project that demonstrates an ETL pipeline using Python and SQLite.

Architecture

API ↓ extract.py ↓ raw/products.json ↓ transform.py ↓ processed/products_clean.csv ↓ load.py ↓ SQLite Warehouse ↓ Analytics Queries

Tech Stack

  • Python
  • SQLite
  • SQL
  • Pandas
  • Requests

Project Structure

src/
data/
database/
sql/

Setup

pip install -r requirements.txt

Run pipeline:

python src/run_pipeline.py

Features

  • API ingestion
  • Data validation
  • Data cleaning
  • SQLite loading
  • SQL analytics

About

Python ETL pipeline that ingests product data from a REST API, performs data quality checks, and loads curated data into SQLite for analytics.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages