Data

Data Engineering

ETL pipelines, data warehousing, streaming, orchestration and lakehouse architecture

55
lessons
Skills in this topic
View full skill map →
ETL Basics
beginner
Write a Python ETL pipeline with pandas
Workflow Orchestration
intermediate
Build a DAG in Airflow with sensors and operators
Streaming Data
intermediate
Produce and consume Kafka topics with Python
Data Warehousing
intermediate
Model a star schema with dbt
Lakehouse Architecture
advanced
Manage ACID transactions on a data lake with Delta Lake
All Reads (23) Articles (11)Blog Posts (7)Tutorials (5)
How I Broke Down My ETL Pipeline Project Into Smaller Engineering Exercises
Dev.to · Tanmay 🔄 Data Engineering ⚡ AI Lesson 4d ago
How I Broke Down My ETL Pipeline Project Into Smaller Engineering Exercises
Recently, I started building an ETL pipeline project to better understand how modern data systems...
LinkedIn Data Engineering Interview Questions: Full Prep Guide
Dev.to · Gowtham Potureddi 🔄 Data Engineering ⚡ AI Lesson 3w ago
LinkedIn Data Engineering Interview Questions: Full Prep Guide
LinkedIn data engineering interview questions lean toward trust-heavy modeling: member-centric...
ETL vs. ELT: Which Approach Should You Use and Why?
Dev.to · Gathuru_M 🔄 Data Engineering ⚡ AI Lesson 3w ago
ETL vs. ELT: Which Approach Should You Use and Why?
1. Introduction Understanding a company's data architecture can feel overwhelming, but...
Containerizing Apache Airflow: Building Portable Data Pipelines with Docker
Dev.to · peter muriya 🔄 Data Engineering ⚡ AI Lesson 1mo ago
Containerizing Apache Airflow: Building Portable Data Pipelines with Docker
Apache Airflow is one of the most widely used orchestration tools in data engineering. It enables...
Automating ETL Workflows with Apache Airflow: From Python Script to Scheduled Pipeline
Dev.to · peter muriya 🔄 Data Engineering ⚡ AI Lesson 1mo ago
Automating ETL Workflows with Apache Airflow: From Python Script to Scheduled Pipeline
Modern data engineering revolves around automation, reliability, and scalability. Writing an ETL...
Columnar Databases (ClickHouse/Snowflake)
Dev.to · Aviral Srivastava 🔄 Data Engineering ⚡ AI Lesson 1mo ago
Columnar Databases (ClickHouse/Snowflake)
The Data Titans: Diving Deep into the World of Columnar Databases (ClickHouse &...
What 166 Modules Taught Us About Building an ETL Pipeline for Website Content
Dev.to · Smuves 🔄 Data Engineering ⚡ AI Lesson 1mo ago
What 166 Modules Taught Us About Building an ETL Pipeline for Website Content
ETL is a solved problem in most of the software world. Data teams have been extracting, transforming,...