How Salesforce Research Cut AI Training Costs by 42% with Google Cloud Managed Lustre

Google Cloud · Advanced ·📐 ML Fundamentals ·1h ago
Skills: ML Pipelines60%
Accelerate your AI training with Managed Lustre. Get started today with Google Cloud Managed Lustre: https://goo.gle/3OudNBm Stop starving your GPUs. Salesforce Research reduced their AI training costs by 42% and achieved a 5.3x performance speedup by solving critical I/O bottlenecks with Google Cloud. Learn how they leveraged Managed Lustre and Vertex AI to stop starving their powerful GPUs and unlock near-100% utilization. By switching to Google Cloud Managed Lustre, they transformed their training pipeline for Llama 3.1 models, increasing GPU utilization by 70%. Join Lavanya Karanam and Avinash Gudagi from the Salesforce AI Research team as they detail their journey from battling storage latency to accelerating the development of some of the world's most complex AI models. Discover how Salesforce achieved game-changing results, including: A 5.3x increase in training speed. A 42% reduction in overall compute costs. GPU Utilization that jumped from ~60% to near 100% saturation.
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

I Built a $0 Search Engine on Real Web Data (No Algolia or Elasticsearch)
Learn how to build a $0 search engine using Python, Typesense, and Bright Data on real web data without relying on Algolia or Elasticsearch
Medium · Python
TPU Mythbusting: vendor lock-in
Learn to separate facts from myths about TPUs and vendor lock-in, and understand how to make informed decisions about your machine learning infrastructure
Dev.to · Maciej Strzelczyk
Confusion Matrix Explained Using Random Forest
Learn to evaluate machine learning model performance using a confusion matrix with Random Forest in Python
Medium · Python
When Preprocessing Helps — and When It Hurts: Why Your Image Classification Model’s Accuracy Varies
Learn how preprocessing affects image classification model accuracy, improving it from 65% to 87% on CIFAR-10 with Convolutional Neural Networks
Medium · Machine Learning
Up next
Machine Learning With Python Full Course 2026 | Python Machine Learning For Beginners | Simplilearn
Simplilearn
Watch →