Data Modeling, Transformation, and Serving

External: Coursera Courses ↗ · Coursera

Open Course on External: Coursera

Free to audit · Opens on External: Coursera

Data Modeling, Transformation, and Serving

Coursera · Intermediate ·🔄 Data Engineering ·3mo ago

Skills: Data Literacy70%ML Pipelines60%

Key Takeaways

Models, transforms, and serves data for analytics and machine learning use cases

Original Description

In this course, you’ll model, transform, and serve data for both analytics and machine learning use cases. You’ll explore various data modeling techniques for batch analytics, including normalization, star schema, data vault, and one big table, and you’ll use dbt to transform a dataset based on a star schema and one big table. You’ll also compare the Inmon vs Kimball data modeling approaches for data warehouses. You’ll model and transform a tabular dataset for machine learning purposes. You’ll also model and transform unstructured image and textual data. You’ll explore distributed processing frameworks such as Hadoop MapReduce and Spark, and perform stream processing. You’ll identify different ways of serving data for analytics and machine learning, including using views and materialized views, and you’ll describe how a semantic layer built on top of your data model can support the business. In the last week of this course, you’ll complete a capstone project where you’ll build an end-to-end data pipeline that encompasses all of the stages of the data engineering lifecycle to serve data that provides business value.

Watch on External: Coursera ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

More on: Data Literacy

View skill →

Analyzing Billing Data with BigQuery

PySpark in Action: Hands-On Data Processing

PySpark in Action: Hands-On Data Processing

Analyze and Visualize Data Using Splunk Statistics

Analyze and Visualize Data Using Splunk Statistics

Apply SCD2 to Build Dynamic Data Models

Automate Financial Insights with AI Tools & Dashboards

Automate Financial Insights with AI Tools & Dashboards

Automate Excel Data with Power Query and Lookups

Automate Excel Data with Power Query and Lookups

Related AI Lessons

How I built the OSS alternatives directory: GitHub ETL, Turso, and the UPSERT trap I hit

Learn how to build a data pipeline for an open-source alternatives directory using GitHub ETL, Turso, and Claude Haiku summaries

Dev.to · MORINAGA

Apache Iceberg in Production: Compaction, Catalogs, and the Pitfalls Nobody Warns You About

Learn how to use Apache Iceberg in production, including compaction, catalogs, and common pitfalls to avoid, to improve data engineering workflows

Dev.to · Gabriel Henrique

Your First Task as a Data Engineer in a New Company? Make the ETL Pipeline Testable

As a new data engineer, make the ETL pipeline testable to ensure data quality and reliability

Towards Data Science

From DataStage and Informatica to Databricks Medallion Architecture: Why Migration Is More Than Code Conversion

Learn how to migrate legacy ETL systems like DataStage to modern architectures like Databricks Medallion, and why it's more than just code conversion

Dev.to · Amit Kumar Singh

A Moment Frozen in Time | Arnav Iyengar | TEDxJenks Youth