How We Built Zeta2: Training an Edit Prediction Model in Production — Ben Kunkle, Zed
To validate settled data, Zed ran 10 frontier model predictions per example and measured Levenshtein distance to the final state. For 100,000 training examples that is a million frontier model requests, which is prohibitively expensive. The fix: Zeta 2's student model now approaches teacher quality, so they run it 50 times instead at negligible cost. Ben Conungle, edit predictions lead at Zed, walks through how this pipeline came together.
The pipeline pulls opt in production edit traces, distills them through a frontier teacher, and routes bad predictions through a repair step before formatting for the student. The ideal training examples sit in the middle of the Levenshtein distance distribution: too close to the settled state is obvious, too far is noise. A metric called reversal ratio, how often the model undoes exactly what the user just typed, was the key diagnostic for catching bad model behavior before shipping.
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
More on: LLM Engineering
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
Cost Analysis of my $6.4k Local LLM Server
Reddit r/LocalLLaMA
# NotebookLM’s Latest Features and What They Mean for Research Workflows
Medium · AI
Everytime You Type a Prompt into ChatGPT, a Data Center Consumes Water to Stop Servers From…
Medium · ChatGPT
Docling + VectorLess + Gemma 3.5 Flash To Get Higher Accuracy
Medium · AI
🎓
Tutor Explanation
DeepCamp AI