AI Dev 26 x SF | Or Dagan: Optimizing Accuracy, Cost, and Latency in Real-World Agents
Most agentic systems rely on hardcoded heuristics to navigate execution decisions (e.g. which models, tools, and test-time compute scaling approaches to use) leading to efficiency leakage across cost, latency and accuracy.
AI21 Maestro optimizes agents by learning to predict success, cost and latency probabilities across diverse actions and contexts, and driving runtime orchestration that intelligently navigates the full agentic action space.
In this session, AI21's Or Dagan demonstrated how this approach yields state-of-the-art results and Pareto frontier on challenging agentic benchmarks, as well as the process required to optimize production agents.
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Related AI Lessons
⚡
⚡
⚡
⚡
María Montessori y la inteligencia artificial: el desafío de la autonomía en la era digital.
Medium · Machine Learning
AiFinPay: Autonomous Payments for cirosantilli/china-dictatorship
Dev.to AI
Sharing Your .env With LLMs Is Relatively Safe. Is It Really? Here’s Why.
Medium · Cybersecurity
From ELIZA to Agents — How AI Changed Everything and Then Changed Again
Medium · Machine Learning
🎓
Tutor Explanation
DeepCamp AI