Why building eval platforms is hard — Phil Hetzel, Braintrust

AI Engineer · Intermediate ·🛠️ AI Tools & Apps ·1w ago

Skills: RAG Evaluation90%AI Workflow Automation60%

An eval platform is not just a test runner. You are building shared definitions of "good," reliable data pipelines, labelling workflows, versioning, and trust in results across many teams and model changes. This session breaks down the hidden complexity, the common failure modes, and the design principles that make evals credible and usable in day-to-day engineering. Speaker info: - https://www.linkedin.com/in/philliphetzel/

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

More on: RAG Evaluation

View skill →

[Evals Workshop] Mastering AI Evaluation: From Playground to Production

[Evals Workshop] Mastering AI Evaluation: From Playground to Production

GenAI Interview Questions: LLM Evaluation Pipeline in Production #generativeai

GenAI Interview Questions: LLM Evaluation Pipeline in Production #generativeai

[Full Workshop] Building Metrics that actually work — David Karam, Pi Labs (fmr Google Search)

[Full Workshop] Building Metrics that actually work — David Karam, Pi Labs (fmr Google Search)

Build a RAG Evaluation Tool and Python Library

Build a RAG Evaluation Tool and Python Library

[VOD] First Look At Claude 3 - Can It Beat GPT-4?

[VOD] First Look At Claude 3 - Can It Beat GPT-4?

Advanced LLM Evaluation Techniques: Chapter 22

Advanced LLM Evaluation Techniques: Chapter 22

Weights & Biases

Related AI Lessons

The handoff is the workflow

The handoff between AI tools is the key to a successful workflow, not the tools themselves

Free AI Tools for Students in 2026 That Actually Save Time

Discover free AI tools that can save students time and increase productivity in 2026

I Built an AI Lead Generation Tool in 24 Hours That Captures Leads While I Sleep

Learn how to build an AI lead generation tool in under 24 hours to automate lead capture and boost sales

10 AI Productivity Tools to Supercharge Your Workflow in 2024

Discover 10 AI productivity tools to streamline your workflow and boost efficiency in 2024

How She Builds Trust-Driven Global Marketing

Digital Web Solutions