AI Dev 26 x SF | Ara Khan: Evals Are Broken Use Them Anyway

DeepLearningAI · Intermediate ·🤖 AI Agents & Automation ·2h ago
This talk by Cline's Ara Khan explains why they went from "evals are useless" to using them as a core part of my agent improvement loop. I share practical heuristics for interpreting, running, and creating evals, and why doing them anyway is better than pure "vibes".
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Agent Portability Is the Next AI Lock-In Problem
Agent portability is becoming a concern in the AI market as companies focus on building better agents, potentially leading to lock-in problems
Medium · AI
Agent Portability Is the Next AI Lock-In Problem
Agent portability is becoming a concern in the AI market as companies focus on building better agents, highlighting the need for interoperability and standards
Medium · Deep Learning
Agent Portability Is the Next AI Lock-In Problem
Agent portability is becoming a major concern in the AI market as companies focus on building better agents, and understanding this issue is crucial for avoiding lock-in problems
Medium · LLM
Using Docker Compose for AI Agent Development
Learn to use Docker Compose for AI agent development and simplify your workflow
Medium · AI
Up next
Antigravity 2.0 + Codex + Agent OS Just Changed Everything
Julian Goldie SEO
Watch →