PySpark in Action: Hands-On Data Processing
PySpark in Action: Hands-on Data Processing is a practical course that equips you to work confidently with large-scale data using PySpark and distributed data processing frameworks. You’ll discover the fundamentals of Big Data, Apache Hadoop, and Apache Spark, then build on this knowledge through real-world exercises where you’ll process and analyze massive datasets.
During the course, you’ll gain hands-on experience with:
- Foundational concepts of Big Data and components of the Hadoop ecosystem such as HDFS, enabling you to understand modern data storage and processing.
- Spark architecture…
Watch on Coursera ↗
(saves to browser)
DeepCamp AI