Shipping complex AI applications — Braintrust & Trainline

AI Engineer · Intermediate ·🤖 AI Agents & Automation ·1w ago
Getting a prototype working is straightforward. Making it reliable in production, especially with multi-step agents, tool use, and real users is the hard part. In this hands-on workshop, you'll work through the core parts of building production-grade AI applications with Giran Moodley, Mayank Soni, and Oussama Hafferssas. Socials: - https://uk.linkedin.com/in/mayank-soni - https://x.com/OussamaHaff - https://www.linkedin.com/in/giran/ Timestamps 0:00 - Introduction and Welcome 4:07 - Workshop Overview and Agenda 4:39 - Understanding AI Engineering and Operational Challenges 9:55 - Introduction to Braintrust 12:56 - Experience from Trainline 28:35 - Building the Support Triage Agent (Overview) 33:57 - Basic Implementation: Single Shot Prompting 40:32 - Adding Local Tools for Determinism 41:30 - Implementing Specialist Stages (Agentic Flow) 46:19 - Instrumenting and Tracing the Application 56:43 - Evaluating AI Systems and Golden Data Sets 1:05:07 - Deploying and Managing AI in Production 1:13:58 - Online Scoring and Monitoring Production Logs 1:19:13 - Identifying and Remediating Failure Modes 1:33:05 - Key Takeaways and Summary 1:36:58 - Further Resources and Documentation
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Chapters (16)

Introduction and Welcome
4:07 Workshop Overview and Agenda
4:39 Understanding AI Engineering and Operational Challenges
9:55 Introduction to Braintrust
12:56 Experience from Trainline
28:35 Building the Support Triage Agent (Overview)
33:57 Basic Implementation: Single Shot Prompting
40:32 Adding Local Tools for Determinism
41:30 Implementing Specialist Stages (Agentic Flow)
46:19 Instrumenting and Tracing the Application
56:43 Evaluating AI Systems and Golden Data Sets
1:05:07 Deploying and Managing AI in Production
1:13:58 Online Scoring and Monitoring Production Logs
1:19:13 Identifying and Remediating Failure Modes
1:33:05 Key Takeaways and Summary
1:36:58 Further Resources and Documentation
Up next
Inside Abridge: The AI Listening to 100 Million Doctor Visits — Abridge's Janie Lee & Chai Asawa
Latent Space
Watch →