Skip to content
View Biswajit107927's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Biswajit107927

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Biswajit107927/README.md

Hi, I'm Biswajit Praharaj 👋

Senior Data Engineer | 14 Years Experience | 10+ Years at Amazon


🚀 About Me

I build production-scale data infrastructure that powers business decisions at enterprise scale.

  • 🏗️ Architected a data lake serving 2,200+ users and $1.8B+ in marketing budgets
  • 99.9% pipeline availability across 8+ enterprise business units
  • 🎯 33+ technical interviews conducted as Amazon interview panelist
  • 🤖 AI-assisted development using Claude (Anthropic)
  • 📍 Based in Seattle, WA

🔧 Tech Stack

Python Apache Spark Apache Airflow Amazon AWS Amazon Redshift SQL

Data Platforms: Apache Iceberg · AWS Glue · Amazon S3 · Data Lakehouse Architecture

Streaming & Orchestration: Amazon Kinesis · Apache Airflow (MWAA) · Dynamic DAG Generation

Warehousing: Amazon Redshift · Amazon Athena · DynamoDB · Redshift Spectrum

Governance: RLS · CLS · PII Masking · Data Contracts · DLQ · Metadata Management


📂 Featured Projects

📂 Featured Projects

Project Description Tech
transaction-pipeline Production financial transaction processing — deduplication, currency conversion, top spender analysis Python
data-platform-quicksight End-to-end data platform — Kinesis → Glue → Iceberg → Redshift → QuickSight with automated self-serve analytics Python · AWS · Airflow
airflow-etl-framework (coming soon) Parameterised multi-tenant DAG factory for scalable ETL pipelines Python · Airflow

📊 Impact at a Glance

Metric Value
Users served 2,200+
Marketing budgets managed $1.8B+
Daily events processed 10M+
Pipeline availability 99.9%
Technical interviews conducted 33+
Teams using my frameworks 12+

📫 Connect With Me

LinkedIn GitHub

Popular repositories Loading

  1. data-platform-quicksight data-platform-quicksight Public

    End-to-end AWS data platform — Kinesis → Glue → Iceberg → Redshift → QuickSight with automated self-serve analytics and zero manual onboarding

    Python 4

  2. Biswajit107927 Biswajit107927 Public

  3. transaction-pipeline transaction-pipeline Public

    Production-quality financial transaction processing pipeline in Python

    Python

  4. depytools depytools Public

    Python

  5. StockGPT StockGPT Public

    Python