Evaluating AI Systems | Trends in AI - May 2025

Zeta Alpha · Advanced ·🧠 Large Language Models ·10mo ago
Join us for the Zeta Alpha "Trends in AI" webinar on Friday, May 9th at 8 AM PST / 5 PM CEST, live from LAB42 in Amsterdam, and online from San Francisco and around the globe. This month, we'll cover everything related to AI evaluations - from public benchmarking of LLMs and relevance metrics for RAG to popular evaluation libraries and the nuances of using the LLM-as-a-Judge approach for automated, continuous assessment of AI system performance. As always, we'll discuss recent model releases like Gemini 2.5 Flash, GPT-4.1, o3 & o4-mini, Qwen 3, and more, along with the most notable developmen…
Watch on YouTube ↗ (saves to browser)
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Next Up
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)