How We Built Zeta2: Training an Edit Prediction Model in Production — Ben Kunkle, Zed

AI Engineer · Advanced ·🧠 Large Language Models ·6h ago
To validate settled data, Zed ran 10 frontier model predictions per example and measured Levenshtein distance to the final state. For 100,000 training examples that is a million frontier model requests, which is prohibitively expensive. The fix: Zeta 2's student model now approaches teacher quality, so they run it 50 times instead at negligible cost. Ben Conungle, edit predictions lead at Zed, walks through how this pipeline came together. The pipeline pulls opt in production edit traces, distills them through a frontier teacher, and routes bad predictions through a repair step before formatting for the student. The ideal training examples sit in the middle of the Levenshtein distance distribution: too close to the settled state is obvious, too far is noise. A metric called reversal ratio, how often the model undoes exactly what the user just typed, was the key diagnostic for catching bad model behavior before shipping.
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Cost Analysis of my $6.4k Local LLM Server
Learn how to calculate the total cost of ownership of a local LLM server versus API equivalent and make informed decisions about your AI infrastructure
Reddit r/LocalLLaMA
# NotebookLM’s Latest Features and What They Mean for Research Workflows
NotebookLM's new features enhance research workflows by improving document handling, and understanding these updates is crucial for efficient research and development
Medium · AI
Everytime You Type a Prompt into ChatGPT, a Data Center Consumes Water to Stop Servers From…
Learn how data centers consume water to cool servers when using ChatGPT, and why it matters for AI sustainability
Medium · ChatGPT
Docling + VectorLess + Gemma 3.5 Flash To Get Higher Accuracy
Improve AI accuracy with Docling, VectorLess, and Gemma 3.5 for financial statement analysis
Medium · AI
Up next
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)
Watch →