Data Project HuntDPH
LeaderboardSubmitSign In
© 2025 Data Project Hunt
AboutFound an issue?Privacy & Cookie Policy

Discover, Share & Showcase
the Best Data Engineering Projects

Explore curated Data Engineering projects from the community. Be recognized for your projects, vote for your favorites and share your own creations.

Last Week

Nov 23 - 29

44votes
•
11 projects
1

1.Smart Wardrobe Suggestion

LLM Based Smart Clothing Suggestion

Batch ProcessingETL/ELT PipelineML/AI Pipeline
by @rahrajlat_x9qnhf
2

2.Reddit ETL Pipeline in Docker

Reddit Data Engineering ETL Pipeline: Spark, Airflow, MinIO in Docker Medallion Architecture

ETL/ELT PipelineData PlatformBatch Processing
+1
by @mabdullahdurrani7_geyeck
3

3.Flink Sales Pipeline

Real-Time E-Commerce Sales Analytics Pipeline

Analytics & BIReal-time StreamingETL/ELT Pipeline
+1
by @nutikrish4_gcme8s
4

4.Baskpipe

Fully AWS-native data pipelines for processing basketball (NBA) data.

Analytics & BIETL/ELT PipelineBatch Processing
+1
by @dominikzsajovic_g63xxu
5

5.Data Warehousing for Realtime Pipelines

Building a real-time data warehouse with the use of state-of-the-art tools like Apache Kafka..etc

Real-time StreamingData Platform
by @wahomewilberforce_frgowo
6

6.Github Stars Monitor

Never miss a new top starred repository

Analytics & BIETL/ELT PipelineBatch Processing
by @maximelemaitre_fmqqg3
7

7.Macro Agents Economic Data Platform

From FRED to Forecasts: A Modern Data Stack for Economic Intelligence

ML/AI PipelineETL/ELT PipelineData Quality & Testing
by @anoonan_ep227v
8

8.E2E Real-Time Data Pipeline

Real-time data pipeline with Kafka, Flink, Iceberg, Trino, and Superset.

Analytics & BIData PlatformReal-time Streaming
+1
by @abelst9_engdt3
9

9.F1 Insights Real Time Replay

What if your dashboards were as realtime as Max vestappen!

Analytics & BIETL/ELT PipelineReal-time Streaming
+1
by @hiteshkhk0105_enfa9t
10

10.Batch data pipeline

Batch Data Pipeline with Airflow, DuckDB, Delta Lake, Trino and Metabase. Observability and quality.

Data PlatformAnalytics & BIETL/ELT Pipeline
+1
by @abelst9_engdt3
11

11.Daggie The Airflow DAG Quality Auditor

A friendly (and sometimes strict!) animated DAG auditor for Apache Airflow 3.1+

Batch ProcessingExperiments
by @rahrajlat_x9qnhf

Week of Nov 16

Nov 16 - 22

6votes
•
2 projects
12

12.Automated News Intelligence Pipeline

An end-to-end automated pipeline for collecting, processing, and analyzing news articles with machin

ETL/ELT PipelineBatch ProcessingML/AI Pipeline
by @charbeldaher34_4hks8s
13

13.Dbt power tools AI based Documentation

A powerful CLI tool that generates LLM-powered documentation for dbt models and columns

Analytics & BIExperimentsBatch Processing
+1
by @rahrajlat_x9qnhf

Week of Nov 9

Nov 9 - 15

10votes
•
3 projects
14

14.AIRFLOW YAHOO ETL

SCALABLE_YAHOO_API_ETL_PIPELINE_USING_AIRFLOW

Analytics & BIBatch ProcessingETL/ELT Pipeline
+1
by @ravitejach888_z9833z
15

15.Airflow Bulk Pause Unpause Plugin

Bulk manage Airflow DAG states effortlessly — pause or unpause in one action.

Data PlatformBatch ProcessingExperiments
by @rahrajlat_x9qnhf
16

16.AirfloGotchi

AirfloGotchi is a virtual pet game integrated with Airflow to keep your DAGs healthy

Experiments
by @marclambertiml_khmma9

Top Makers

View All
🥇
H

hiteshkhk0105

12.0avg
🥈
A

a.noonan

11.0avg