An automated data quality monitoring solution for daily sales pipelines, built on top of the open-source framework Soda Core to execute SQL Server data validation and anomaly detection.
-
Updated
Mar 24, 2026 - Python
An automated data quality monitoring solution for daily sales pipelines, built on top of the open-source framework Soda Core to execute SQL Server data validation and anomaly detection.
A production-grade Modern Data Stack (MDS) implementation featuring automated ELT, SCD Type 2 history tracking, and CI/CD quality guardrails using Dagster, dbt Core, DuckDB, and Soda.
Personal data ops platform: multi-exchange crypto OHLCV ingestion with Airflow, DuckDB, Soda data quality checks, and runbook-driven incident response.
Production-style YouTube Analytics ELT pipeline with Airflow orchestration, dbt transformations, Soda Core data quality checks, PostgreSQL, and full Docker + GitHub Actions CI/CD.
End-to-end F2P monetization analytics on real GA4 data — BigQuery, dbt, Dagster, Soda, Looker Studio.
Production-grade F1 telemetry pipeline using Medallion Architecture on Databricks
Production-grade RAG data pipeline: Kafka → Spark Structured Streaming → Delta Lake (Bronze/Silver/Gold) → OpenAI Embeddings → Pinecone. Quality-gated with Soda Core, orchestrated by Airflow, observed with OpenLineage, served via FastAPI.
Real-time data platform with enforced data contracts— Kafka, Flink, Delta Lake, dbt, Soda Core
The standard benchmark for data quality tools — detection, transformation, entity resolution, and pipeline orchestration. 4 categories, 12 tiers, 161 tests.
Add a description, image, and links to the soda-core topic page so that developers can more easily learn about it.
To associate your repository with the soda-core topic, visit your repo's landing page and select "manage topics."