Big Data - Capstone Project

External: Coursera Courses ↗ · Coursera

Open Course on External: Coursera

Free to audit · Opens on External: Coursera

Big Data - Capstone Project

Coursera · Beginner ·📊 Data Analytics & Business Intelligence ·3mo ago

Skills: ML Pipelines80%

Key Takeaways

Teaches big data analysis and ecosystem building with a capstone project

Original Description

Welcome to the Capstone Project for Big Data! In this culminating project, you will build a big data ecosystem using tools and methods form the earlier courses in this specialization. You will analyze a data set simulating big data generated from a large number of users who are playing our imaginary game "Catch the Pink Flamingo". During the five week Capstone Project, you will walk through the typical big data science steps for acquiring, exploring, preparing, analyzing, and reporting. In the first two weeks, we will introduce you to the data set and guide you through some exploratory analysis using tools such as Splunk and Open Office. Then we will move into more challenging big data problems requiring the more advanced tools you have learned including KNIME, Spark's MLLib and Gephi. Finally, during the fifth and final week, we will show you how to bring it all together to create engaging and compelling reports and slide presentations. As a result of our collaboration with Splunk, a software company focus on analyzing machine-generated big data, learners with the top projects will be eligible to present to Splunk and meet Splunk recruiters and engineering leadership.

Watch on External: Coursera ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

More on: ML Pipelines

View skill →

Building a Dog Breed Identifier App from scratch - DogNet

Building a Dog Breed Identifier App from scratch - DogNet

Aladdin Persson

Complete Dockers For Data Science Tutorial In One Shot

Complete Dockers For Data Science Tutorial In One Shot

Part 6 | Deploy ML Model on Kubernetes | Auto-Scaling with HPA and Monitoring with Prometheus

Part 6 | Deploy ML Model on Kubernetes | Auto-Scaling with HPA and Monitoring with Prometheus

Abonia Sojasingarayar

Vertex Pipelines: Qwik Start

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation

Automate R scripts with GitHub Actions: Deploy a model

Related AI Lessons

Python for Data Science — Probability Basics for Data Science

Learn probability basics for data science with Python to enhance your statistical analysis skills

Medium · Data Science

Python for Data Science — Probability Basics for Data Science

Learn probability basics for data science in Python to improve statistical analysis and modeling skills

Medium · Python

The Attention Economy: Your Attention Is Worth More Than Gold

Learn how the attention economy works and why your focus is a valuable resource in the digital age

Medium · Data Science

What I Learned Building a Tableau Dashboard for Deloitte’s Data Analytics Simulation

Learn how to build a Tableau dashboard for data analytics by exploring a real-world project for Deloitte's simulation, focusing on machine downtime and pay equity

Medium · Data Science

Spreadsheet Guy Meets the CFO: "Define How Much"

Digital Transformation with Eric Kimberling