Apache Spark: Design & Execute ETL Pipelines Hands-On
This hands-on course equips learners with the skills to design, build, and manage end-to-end ETL (Extract, Transform, Load) workflows using Apache Spark in a real-world data engineering context. Structured into two comprehensive modules, the course begins with foundational setup, guiding learners through the installation of essential components such as PySpark, Hadoop, and MySQL. Participants will learn how to configure their environment, organize project structures, and explore source datasets effectively.
As the course progresses, learners will develop Spark applications to perform full and…
Watch on Coursera ↗
(saves to browser)
DeepCamp AI