Resampling and Regularization | Data Science with Marco

Data Science with Marco · Beginner ·📐 ML Fundamentals ·6y ago

Key Takeaways

The video covers resampling and regularization techniques in Python, including validation set, leave-one-out cross-validation, k-fold cross-validation, ridge regression, and lasso regularization.

Original Description

Get the notebook and the dataset: https://github.com/marcopeix/datasciencewithmarco 📚 Theory: 0:00 - 5:17 🐍 Code: 5:18 - 14:40 In this video, we cover resampling and regularization in Python. We cover 3 different approaches to resampling: validation set, leave-one-out cross-validation (LOOCV) and k-fold cross-validation. Then, we introduce two resampling methods: ridge regression and lasso. Finally, we apply all those methods in Python and see how they can improve our models. Follow me on Medium for data science articles: https://medium.com/@marcopeixeiro
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Data Science with Marco · Data Science with Marco · 3 of 38

1 Linear Regression in Python | Data Science with Marco
Linear Regression in Python | Data Science with Marco
Data Science with Marco
2 Classification in Python | logistic regression, LDA, QDA | Data Science With Marco
Classification in Python | logistic regression, LDA, QDA | Data Science With Marco
Data Science with Marco
Resampling and Regularization | Data Science with Marco
Resampling and Regularization | Data Science with Marco
Data Science with Marco
4 Decision Trees | Data Science with Marco
Decision Trees | Data Science with Marco
Data Science with Marco
5 Suppor Vector Machine (SVM) in Python | Data Science with Marco
Suppor Vector Machine (SVM) in Python | Data Science with Marco
Data Science with Marco
6 Unsupervised Learning | PCA and Clustering | Data Science with Marco
Unsupervised Learning | PCA and Clustering | Data Science with Marco
Data Science with Marco
7 Data Science Portfolio Project: Regression #1 | Data Science with Marco
Data Science Portfolio Project: Regression #1 | Data Science with Marco
Data Science with Marco
8 Data Science Portfolio Project: Regression #2 | Data Science with Marco
Data Science Portfolio Project: Regression #2 | Data Science with Marco
Data Science with Marco
9 What Are Time Series - Applied Time Series Analysis in Python and TensorFlow
What Are Time Series - Applied Time Series Analysis in Python and TensorFlow
Data Science with Marco
10 Basic Statistics - Applied Time Series Analysis in Python and TensorFlow
Basic Statistics - Applied Time Series Analysis in Python and TensorFlow
Data Science with Marco
11 Autocorrelation and White Noise - Applied Time Series Analysis in Python and TensorFlow
Autocorrelation and White Noise - Applied Time Series Analysis in Python and TensorFlow
Data Science with Marco
12 Stationarity and Differencing - Applied Time Series Analysis in Python and TensorFlow
Stationarity and Differencing - Applied Time Series Analysis in Python and TensorFlow
Data Science with Marco
13 Random Walk Model - Applied Time Series Analysis in Python and TensorFlow
Random Walk Model - Applied Time Series Analysis in Python and TensorFlow
Data Science with Marco
14 Moving Average Process - Applied Time Series Analysis in Python and TensorFlow
Moving Average Process - Applied Time Series Analysis in Python and TensorFlow
Data Science with Marco
15 Autoregressive Process - Applied Time Series Analysis in Python and TensorFlow
Autoregressive Process - Applied Time Series Analysis in Python and TensorFlow
Data Science with Marco
16 ARMA Model - Time Series Analysis in Python and TensorFlow
ARMA Model - Time Series Analysis in Python and TensorFlow
Data Science with Marco
17 What is data science?
What is data science?
Data Science with Marco
18 Answering DATA SCIENCE questions #1 - Why learn SQL when Python and R exist?
Answering DATA SCIENCE questions #1 - Why learn SQL when Python and R exist?
Data Science with Marco
19 R vs Python in the Industry - Data Science Q&A #datascience #datasciencecareer #careeradvice
R vs Python in the Industry - Data Science Q&A #datascience #datasciencecareer #careeradvice
Data Science with Marco
20 Data science or data engineering - which is best for you? #datascience #datasciencecareer
Data science or data engineering - which is best for you? #datascience #datasciencecareer
Data Science with Marco
21 Where to find data for data science projetcs? #datascience #datasciencecareer
Where to find data for data science projetcs? #datascience #datasciencecareer
Data Science with Marco
22 Data science certificates on resume? #datascience #datasciencecareer #careeradvice
Data science certificates on resume? #datascience #datasciencecareer #careeradvice
Data Science with Marco
23 Should you aim for data science or data engineering? | Data Science Q&A #1
Should you aim for data science or data engineering? | Data Science Q&A #1
Data Science with Marco
24 Don't waste time on this | #datascience #datasciencecareer
Don't waste time on this | #datascience #datasciencecareer
Data Science with Marco
25 Low-code AI tools - are they good? | #datascience #datasciencecareer #careeradvice
Low-code AI tools - are they good? | #datascience #datasciencecareer #careeradvice
Data Science With Marco
26 How to grow as a data scientist after 2+ years of experience? #datascience #datasciencecareer
How to grow as a data scientist after 2+ years of experience? #datascience #datasciencecareer
Data Science with Marco
27 Transition into DATA SCIENCE without a masters or bootcamp #careertransition
Transition into DATA SCIENCE without a masters or bootcamp #careertransition
Data Science With Marco
28 How to improve your data science profile?
How to improve your data science profile?
Data Science With Marco
29 How to learn Python for data science?
How to learn Python for data science?
Data Science With Marco
30 Does Scrum/Agile work for data science?
Does Scrum/Agile work for data science?
Data Science With Marco
31 What are the major roles in analytics and how to choose?
What are the major roles in analytics and how to choose?
Data Science with Marco
32 Thoughts and advice for a live SQL coding round
Thoughts and advice for a live SQL coding round
Data Science With Marco
33 Data science interview question: difference between type 1 and type 2 error
Data science interview question: difference between type 1 and type 2 error
Data Science With Marco
34 Feature selection in machine learning | Full course
Feature selection in machine learning | Full course
Data Science With Marco
35 Anomaly detection in time series with Python | Data Science with Marco
Anomaly detection in time series with Python | Data Science with Marco
Data Science With Marco
36 Podcast - TimeGPT, predicting the future, and more
Podcast - TimeGPT, predicting the future, and more
Data Science With Marco
37 Big announcement - Revealing my new book
Big announcement - Revealing my new book
Data Science With Marco
38 Get Started in Time Series Forecasting in Python | Full Course
Get Started in Time Series Forecasting in Python | Full Course
Data Science With Marco

This video teaches resampling and regularization techniques in Python to improve model performance. It covers three resampling approaches and introduces two regularization methods, applying them in Python.

Key Takeaways
  1. Import necessary libraries
  2. Load dataset
  3. Split data into training and testing sets
  4. Apply validation set resampling
  5. Apply LOOCV resampling
  6. Apply k-fold cross-validation resampling
  7. Implement ridge regression
  8. Implement lasso regularization
  9. Compare model performance
💡 Resampling and regularization techniques can significantly improve model performance by reducing overfitting and bias.

Related Reads

📰
What Is MLIR and Why Does It Exist?
Learn about MLIR, a intermediate representation for machine learning models, and its purpose in optimizing ML workflows
Dev.to · Fedor Nikolaev
📰
Why Choosing the Right Machine Learning Development Company Matters More Than the AI Model
Choosing the right machine learning development company is crucial for turning AI investments into measurable results, as it can make or break the success of AI projects
Medium · Machine Learning
📰
Data privacy in AI training: federated learning, differential privacy, and synthetic data
Learn how federated learning, differential privacy, and synthetic data preserve data privacy in AI training, and why they matter for secure machine learning
Dev.to AI
📰
Data Preprocessing: Encoding and Feature Scaling in Machine Learning
Learn to preprocess data by encoding and scaling features for better machine learning model performance
Medium · Machine Learning
Up next
Is Python Dead in 2026?| Truth About Python in AI Era | 90 Days Roadmap @FameWorldEducationalHub
FAME WORLD EDUCATIONAL HUB
Watch →