Cloud Computing Applications, Part 2: Big Data and Applications in the Cloud

Coursera Courses ↗ · Coursera

Open Course on Coursera

Free to audit · Opens on Coursera

Cloud Computing Applications, Part 2: Big Data and Applications in the Cloud

Coursera · Beginner ·📊 Data Analytics & Business Intelligence ·1mo ago
Welcome to the Cloud Computing Applications course, the second part of a two-course series designed to give you a comprehensive view on the world of Cloud Computing and Big Data! In this second course we continue Cloud Computing Applications by exploring how the Cloud opens up data analytics of huge volumes of data that are static or streamed at high velocity and represent an enormous variety of information. Cloud applications and data analytics represent a disruptive change in the ways that society is informed by, and uses information. We start the first week by introducing some major systems for data analysis including Spark and the major frameworks and distributions of analytics applications including Hortonworks, Cloudera, and MapR. By the middle of week one we introduce the HDFS distributed and robust file system that is used in many applications like Hadoop and finish week one by exploring the powerful MapReduce programming model and how distributed operating systems like YARN and Mesos support a flexible and scalable environment for Big Data analytics. In week two, our course introduces large scale data storage and the difficulties and problems of consensus in enormous stores that use quantities of processors, memories and disks. We discuss eventual consistency, ACID, and BASE and the consensus algorithms used in data centers including Paxos and Zookeeper. Our course presents Distributed Key-Value Stores and in memory databases like Redis used in data centers for performance. Next we present NOSQL Databases. We visit HBase, the scalable, low latency database that supports database operations in applications that use Hadoop. Then again we show how Spark SQL can program SQL queries on huge data. We finish up week two with a presentation on Distributed Publish/Subscribe systems using Kafka, a distributed log messaging system that is finding wide use in connecting Big Data and streaming applications together to form complex systems. Week three moves to fast data
Watch on Coursera ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Carbon Emission: Master Scopes 1, 2, &3 Emissions with Python— [PART 2]
Master carbon emission scopes with Python to classify and measure emissions effectively
Medium · Python
The Rise of Decision Intelligence in Modern Enterprises
Learn how decision intelligence is transforming modern enterprises by leveraging data analytics and AI to inform business decisions
Medium · AI
Ingin Jadi Data Analyst? Begini Cara Memulainya dengan Mudah!
Learn how to become a data analyst with easy steps, a profession with high growth prospects according to the World Economic Forum
Medium · Data Science
The Data Ingestion Strategy That Saved My Pipeline From Collapsing
Learn how to choose the right data ingestion method to save your pipeline from collapsing and improve data engineering efficiency
Medium · Python
Up next
Excel Pivot Table Analysis & Visualization
Coursera
Watch →