Throughput Is Not All You Need and more
In this meetup, Neha led our discussion of the paper, Throughput Is Not All You Need, and other related works.
Our Meetup: https://www.meetup.com/East-Bay-Tri-Valley-Machine-Learning-Meetup/
*Content*
00:00 Intro
02:25 Basic Concepts
11:47 Throughput vs Goodput
18:24 Prefill
32:41 Colocate
35:45 KV cache
42:40 Related papers
50:30 LMCache
============================
😊About Us
West Coast Machine Learning is a channel dedicated to exploring the exciting world of machine learning! Our group of techies is passionate about deep learning, neural networks, computer vision, tiny ML, and other cool geeky machine learning topics. We love to dive deep into the technical details and stay up to date with the latest research developments.
Our Meetup group and YouTube channel is the perfect place to connect with other like-minded individuals who share your love of machine learning. We offer a mix of research paper discussions, coding reviews, and other data science topics. So, if you're looking to stay up to date with the latest developments in machine learning, connect with other techies, and learn something new, be sure to subscribe to our channel and join our Meetup community today!
Meetup: https://www.meetup.com/east-bay-tri-valley-machine-learning-meetup/
=============================
#Throughput-vs-Goodput #Model-training-performance #model-training #Goodput #LMCache #Prefill-decode #Disaggregated-inference
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
More on: ML Maths Basics
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
Role of Model Architecture In Inference — Inference Series
Medium · Machine Learning
Role of Model Architecture In Inference — Inference Series
Medium · Deep Learning
What isn’t said clearly
cannot be relied on as truth.
Medium · Deep Learning
The Idempotency Nightmare in AI Pipelines: Data Loss and Recovery
Dev.to AI
Chapters (8)
Intro
2:25
Basic Concepts
11:47
Throughput vs Goodput
18:24
Prefill
32:41
Colocate
35:45
KV cache
42:40
Related papers
50:30
LMCache
🎓
Tutor Explanation
DeepCamp AI