Build real-time multimodal agents with Gemini and Pipecat
Chad Bailey from the Pipecat team walks through what's possible with the new Gemini 3 multimodal real-time model: flight search, lodging lookup, Google Search grounding, trip report generation, and a language tutor agent, all in a single voice conversation.
Note: The public string for this model is gemini-3.1-flash-live. The string used in the video is for the Early Access Partner program and is now turned down.
What's covered: Scaffolding a bot with the Pipecat CLI, configuring Gemini 3 with minimal thinking for lower latency, writing system prompts that hold up across long conversations, d…
Watch on YouTube ↗
(saves to browser)
DeepCamp AI