Build real-time multimodal agents with Gemini and Pipecat

Google for Developers · Beginner ·🧠 Large Language Models ·15h ago
Chad Bailey from the Pipecat team walks through what's possible with the new Gemini 3 multimodal real-time model: flight search, lodging lookup, Google Search grounding, trip report generation, and a language tutor agent, all in a single voice conversation. Note: The public string for this model is gemini-3.1-flash-live. The string used in the video is for the Early Access Partner program and is now turned down. What's covered: Scaffolding a bot with the Pipecat CLI, configuring Gemini 3 with minimal thinking for lower latency, writing system prompts that hold up across long conversations, d…
Watch on YouTube ↗ (saves to browser)
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Next Up
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)