How Salesforce Research Cut AI Training Costs by 42% with Google Cloud Managed Lustre
Skills:
ML Pipelines60%
Accelerate your AI training with Managed Lustre. Get started today with Google Cloud Managed Lustre: https://goo.gle/3OudNBm
Stop starving your GPUs. Salesforce Research reduced their AI training costs by 42% and achieved a 5.3x performance speedup by solving critical I/O bottlenecks with Google Cloud. Learn how they leveraged Managed Lustre and Vertex AI to stop starving their powerful GPUs and unlock near-100% utilization. By switching to Google Cloud Managed Lustre, they transformed their training pipeline for Llama 3.1 models, increasing GPU utilization by 70%.
Join Lavanya Karanam and Avinash Gudagi from the Salesforce AI Research team as they detail their journey from battling storage latency to accelerating the development of some of the world's most complex AI models. Discover how Salesforce achieved game-changing results, including: A 5.3x increase in training speed. A 42% reduction in overall compute costs. GPU Utilization that jumped from ~60% to near 100% saturation.
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
More on: ML Pipelines
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
I Built a $0 Search Engine on Real Web Data (No Algolia or Elasticsearch)
Medium · Python
TPU Mythbusting: vendor lock-in
Dev.to · Maciej Strzelczyk
Confusion Matrix Explained Using Random Forest
Medium · Python
When Preprocessing Helps — and When It Hurts: Why Your Image Classification Model’s Accuracy Varies
Medium · Machine Learning
🎓
Tutor Explanation
DeepCamp AI