Speed up Cleaning Data with Pandas and your GPU! (CuDF Episode 5)
In this episode, we dive deep into data manipulation and cleaning using cuDF, a CUDA-accelerated version of Pandas from NVIDIA. You'll learn how to handle missing values, drop columns, and change data types—particularly useful for time series data. The video also explores common challenges with unclean real-world data and demonstrates how to best leverage GPU acceleration for large datasets. The tutorial is carried out on a Dell Workstation Precision 3680 equipped with an Nvidia RTX 5000 Ada GPU, showcasing the efficiency of GPU-accelerated Python packages. Follow along in a Jupyter Notebook a…
Watch on YouTube ↗
(saves to browser)
Chapters (9)
Introduction and Series Recap
0:16
Data Manipulation Basics
0:30
Working with Real-World Data
1:02
Dell Workstation Overview
2:21
Setting Up the Jupyter Notebook
4:00
Cleaning Data with Pandas
4:38
Handling Null Values
8:33
Working with Dates in Pandas
11:16
Conclusion and Next Steps
DeepCamp AI