Data Prep for Machine Learning in Python

Coursera Courses ↗ · Coursera

Open Course on Coursera

Free to audit · Opens on Coursera

Data Prep for Machine Learning in Python

Coursera · Beginner ·📐 ML Fundamentals ·1mo ago
Machine learning models rely on good data to produce meaningful insights. For that reason, data prep is one of the most critical skills for machine learning. In this course, you’ll learn how to import and clean data before populating missing values using imputation. You’ll learn how to visualize histograms, scatter charts, and box plots to identify trends of interest before using the analysis to select the most important features. Feature engineering techniques such as one hot encoding, binning and scaling will help us transform the structure of our data to produce higher quality machine learning insights. This data prep course in Python includes more interactive exercises and challenges than previous BIDA courses have. You will also have the opportunity to test your skills on a comprehensive guided Python case study before completing the final exam. Upon completing this course, you will be able to: • Import and clean your data in Python • Apply imputation to estimate missing values in the dataset • Conduct exploratory data analysis (EDA) to find initial patterns to guide our analysis • Select features to focus on the most important variables • Apply feature engineering to make datasets machine learning-friendly • Select appropriate feature engineering techniques based on the model type Whether you are a business leader or an aspiring analyst exploring data science, this Data Prep for Machine Learning in Python course will serve as your comprehensive introduction to this fascinating subject. You’ll learn all the key terminology to allow you to talk data science with your teams, begin implementing analysis, and understand how data science can help your business.
Watch on Coursera ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

What a GPU Actually Is (and Why ML Stole It)
Learn what a GPU actually is and why it's crucial for machine learning, with a deep dive into its architecture and capabilities
Dev.to AI
Python Sets: One of the Most Powerful Data Structures Beginners Often Ignore
Learn to harness the power of Python sets for efficient data manipulation and improved code performance
Medium · Python
Bigger AI models aren't always better. Here's how to actually choose.
Larger AI models don't always outperform smaller ones, and choosing the right model requires careful consideration of several factors
Dev.to · Rohini Gaonkar
Nobody Knows What The Beach Is Saying. And That’s The Point.
Learn how signal and semantic models form the foundation of powerful AI systems and why understanding their gap is crucial
Medium · Deep Learning
Up next
Deep Learning in Electronic Health Records
Coursera
Watch →