Personalization in the Era of LLMs - Shivam Verma, Spotify
Spotify represents Ariana Grande and Bruno Mars as sequences of six tokens. The first two are shared because both are pop artists. The remaining tokens diverge to capture what makes each distinct. That is a Semantic ID, and it is how Spotify teaches open-weight LLMs to reason over a catalog of 100 million tracks the same way they reason over words.
Shivam Verma from Spotify's AI foundation team walks through the three components they assembled to personalize LLMs at scale without full fine-tuning. User embeddings trained on streaming history across 750 million users form the base. Semantic IDs compress catalog vectors into tokens the model can autoregressively generate, predicting the next song or episode as the next token in a sequence. A soft tokenization layer projects a user's embedding directly into the LLM's token space, giving the frozen model a user-specific token to attend over. Podcast next-episode recommendations are already running on this stack in production.
Speaker info:
- https://x.com/kaffeinated
- https://www.linkedin.com/in/shivam13verma
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
More on: LLM Foundations
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
# Day 5 of learning AI Engineering: built a small RAG app over a PDF
Dev.to AI
Inside LLMs Part 1: How Large Language Models Read, Encode, and Position Every Word You Write |…
Medium · AI
Inside LLMs Part 1: How Large Language Models Read, Encode, and Position Every Word You Write |…
Medium · Machine Learning
Inside LLMs Part 1: How Large Language Models Read, Encode, and Position Every Word You Write |…
Medium · NLP
🎓
Tutor Explanation
DeepCamp AI