Search Agents with Nandan Thakur - Weaviate Podcast #137!

Weaviate vector database · Advanced ·🤖 AI Agents & Automation ·1w ago
Dr. Nandan Thakur returns to the Weaviate Podcast fresh off defending his dissertation to discuss the evolution from neural retrieval to agentic search and his new work on Orbit, a synthetic training data pipeline for search agents. The conversation opens with reflections on his PhD journey, tracing the field's shift from ColBERT-style models and sparse retrievers through RAG and into today's agentic search paradigm where LLMs iteratively search, reason, and refine. The discussion dives deep into how Orbit generates multi-hop, riddle-style training queries using DeepSeek's API on a personal laptop over four to six months, making high-quality search agent training data accessible without massive compute budgets. Thakur draws a sharp distinction between deep research (broad, multi-tool report generation) and search agents (focused on search and browse tools to answer specific questions), then connects Orbit's multi-hop queries to BrowseComp's filter-style riddles where each clue narrows the answer space like a funnel. The conversation explores the design of deep research harnesses, chunking strategies, Anthropic's contextual retrieval for entity disambiguation, context compaction to manage bloated agent contexts, and memory services like Weaviate's Engram for compressing search results between reasoning rounds. From there, the episode tackles sequential versus parallel search trajectories, the pass@K approach to rollouts in GRPO training, and whether isolated trajectories should share progress through message passing. Thakur makes a compelling case for training search agents to produce keyword-focused queries optimized for BM25 versus semantic queries for dense retrieval: the idea that one query does not fit all search engines. The conversation closes on future directions: efficiency-focused Pareto frontiers for search agents, long-form report generation evaluation through TREC RAG, and the coming wave of multilingual and multimodal search benchmarks. Chapters: 0:0
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Understanding Real-Time Customer Intent: The New Frontier for Retail AI Chatbots
Learn how retail AI chatbots can leverage real-time customer intent to drive sales and loyalty, and why it matters for modern retail
Medium · AI
Artificial Intelligence Is Not Replacing Humans - It’s Replacing Certain Behaviors
AI is replacing certain human behaviors, not humans themselves, and understanding this distinction is crucial for effective AI integration
Medium · AI
How I cut my LangChain agent's token costs by 93% with one import
Cut LangChain agent's token costs by 93% with a simple import and optimization technique
Dev.to · Mahika jadhav
5 Passive Income Streams Your AI Agent Can Run While You Sleep
Automate passive income streams with AI agents to earn money while you sleep, leveraging affiliate marketing, print-on-demand stores, and more
Dev.to AI
Up next
Introducing Interwhen: Steering reasoning agents with real-time verification
Microsoft Research
Watch →