Multi-Vector Search with Amélie Chatelain and Antoine Chaffin - Weaviate Podcast #134!
Amélie Chatelain and Antoine Chaffin from LightOn are leading the way in the next generation of search powered by Multi-Vector representations and Late Interaction. The podcast begins with what motivates them to work on Multi-Vector Search, continuing to discuss particular details such as the combination between lexical and semantic search, as well as bi-encoder speed with cross encoder accuracy. This discussion continues to present insights about training multi-vector models and how they differ from their single-vector predecessors. The conversation continues into particular successes of Late Interaction such as code, reasoning-intensive, and multimodal retrieval. Agents are great at searching with grep, but they are even better with ColGrep! Reasoning-Intensive Retrieval is a step change in how we think about search systems, beautifully enabled by both Late Interaction models and Agentic Search. Further, Multimodal Search, such as matching text with videos, is seeing massive benefits from Multi-Vector representations. The podcast continues to dive into the cost of MaxSim and how efficient methods such as MUVERA and PLAID can help. The podcast concludes with a presentation of their recent work on ColBERT-Zero, pre-training with Late Interaction instead of Single-Vector Dense Embedding models. LightOn are also the developers of PyLate, the world's leading open-source library for training these kinds of models.
Chapters
0:00 Welcome to the Weaviate Podcast!
2:17 An Introduction to Multi-Vector Search
8:17 Multi- vs. Single-Vector
11:12 Comparison with Cross Encoders
18:12 ColGrep for Coding Agents
32:51 Reasoning-Intensive Retrieval
44:19 Multimodal Multi-Vector
50:51 The Cost of Multi-Vector
55:43 MUVERA and PLAID
1:08:35 ColBERT-Zero and PyLate
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
More on: RAG Basics
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
Why StarRocks Is Better Than Elasticsearch for RAG and AI-Powered Vector Search Analytics
Medium · LLM
Production RAG: Shipping a RAG System Into an Enterprise Product
Medium · RAG
HyDE: Search With the Answer You Wish You Had
Medium · RAG
Hierarchical Indices: Find the Section First, Then Find the Sentence
Medium · RAG
Chapters (10)
Welcome to the Weaviate Podcast!
2:17
An Introduction to Multi-Vector Search
8:17
Multi- vs. Single-Vector
11:12
Comparison with Cross Encoders
18:12
ColGrep for Coding Agents
32:51
Reasoning-Intensive Retrieval
44:19
Multimodal Multi-Vector
50:51
The Cost of Multi-Vector
55:43
MUVERA and PLAID
1:08:35
ColBERT-Zero and PyLate
🎓
Tutor Explanation
DeepCamp AI