I build production-scale data infrastructure that powers business decisions at enterprise scale.
- 🏗️ Architected a data lake serving 2,200+ users and $1.8B+ in marketing budgets
- ⚡ 99.9% pipeline availability across 8+ enterprise business units
- 🎯 33+ technical interviews conducted as Amazon interview panelist
- 🤖 AI-assisted development using Claude (Anthropic)
- 📍 Based in Seattle, WA
Data Platforms: Apache Iceberg · AWS Glue · Amazon S3 · Data Lakehouse Architecture
Streaming & Orchestration: Amazon Kinesis · Apache Airflow (MWAA) · Dynamic DAG Generation
Warehousing: Amazon Redshift · Amazon Athena · DynamoDB · Redshift Spectrum
Governance: RLS · CLS · PII Masking · Data Contracts · DLQ · Metadata Management
| Project | Description | Tech |
|---|---|---|
| transaction-pipeline | Production financial transaction processing — deduplication, currency conversion, top spender analysis | Python |
| data-platform-quicksight | End-to-end data platform — Kinesis → Glue → Iceberg → Redshift → QuickSight with automated self-serve analytics | Python · AWS · Airflow |
| airflow-etl-framework (coming soon) | Parameterised multi-tenant DAG factory for scalable ETL pipelines | Python · Airflow |
| Metric | Value |
|---|---|
| Users served | 2,200+ |
| Marketing budgets managed | $1.8B+ |
| Daily events processed | 10M+ |
| Pipeline availability | 99.9% |
| Technical interviews conducted | 33+ |
| Teams using my frameworks | 12+ |