Data movement and transformation workflows
Discover open-source data engineering projects in ETL/ELT Pipeline from the community.
22 projects found
Production-ready data visualization for NASA dataset
Real-Time Vehicle Data Processing Pipeline
Production-grade analytics pipeline
Predict, simulate, and debug Airflow schedules before they fail.
Content monitoring analytics service using latest AWS S3 Tables along with MSK, EMR (SLA=20 mins)
UFC Data Warehouse and dashboards based on up-to-date data
AI-native e-commerce data platform you can run locally (Airflow + dbt + MCP)
Enterprise-grade ETL pipeline transforming medical XML data into actionable business intelligence
A real-time data streaming pipeline that captures live posts from Bluesky regarding the NBA, perform
A batch ETL pipeline that processes Yelp business raw data to generate analytics and insights
An Open-source accelerator for a ready-to-run, end-to-end analytics platform
Building medallion architecture for crowd-sourced reviews using Snowflake native features
LLM Based Smart Clothing Suggestion
Reddit Data Engineering ETL Pipeline: Spark, Airflow, MinIO in Docker Medallion Architecture
Real-Time E-Commerce Sales Analytics Pipeline
Fully AWS-native data pipelines for processing basketball (NBA) data.
Never miss a new top starred repository
From FRED to Forecasts: A Modern Data Stack for Economic Intelligence
What if your dashboards were as realtime as Max vestappen!
Batch Data Pipeline with Airflow, DuckDB, Delta Lake, Trino and Metabase. Observability and quality.
An end-to-end automated pipeline for collecting, processing, and analyzing news articles with machin
SCALABLE_YAHOO_API_ETL_PIPELINE_USING_AIRFLOW