How We Built Zeta2: Training an Edit Prediction Model in Production — Ben Kunkle, Zed

Name: How We Built Zeta2: Training an Edit Prediction Model in Production — Ben Kunkle, Zed
Uploaded: 2026-05-30T16:00:06Z
Channel: AI Engineer
Description: To validate settled data, Zed ran 10 frontier model predictions per example and measured Levenshtein distance to the final state. For 100,000 training e...

AI Engineer · Advanced ·🧠 Large Language Models ·6h ago

Skills: LLM Engineering90%Model Deployment70%

To validate settled data, Zed ran 10 frontier model predictions per example and measured Levenshtein distance to the final state. For 100,000 training examples that is a million frontier model requests, which is prohibitively expensive. The fix: Zeta 2's student model now approaches teacher quality, so they run it 50 times instead at negligible cost. Ben Conungle, edit predictions lead at Zed, walks through how this pipeline came together. The pipeline pulls opt in production edit traces, distills them through a frontier teacher, and routes bad predictions through a repair step before formatting for the student. The ideal training examples sit in the middle of the Levenshtein distance distribution: too close to the settled state is obvious, too far is noise. A metric called reversal ratio, how often the model undoes exactly what the user just typed, was the key diagnostic for catching bad model behavior before shipping.

Watch on YouTube ↗ (saves to browser)