Give your robot a Voice with Gemini Live

Google for Developers · Intermediate ·🧠 Large Language Models ·10h ago
Run Gemini 3.1 Flash Live on an open source robot. Thor from Google DeepMind connects the Gemini Live API to Reachy Mini, an open source robot from Hugging Face and Pollen Robotics. Real-time conversation, vision, multilingual switching, music generation with Lyria 3, Google Search grounding, and body movement synced to speech, all running through the Gemini Live API. What's covered: Setting up the Reachy Mini SDK and environment, integrating the Gemini Live API using Antigravity, configuring voices and system prompt profiles, enabling real-time weather via Google Search grounding, generatin…
Watch on YouTube ↗ (saves to browser)

Chapters (9)

Intro
1:39 Generating Moody Synth Music
3:18 Multilingual Modes: German, French & Chinese
4:40 Intro to the Open-Source Hardware & Gemini Live API
5:46 SDK & Virtual Environment Setup
7:37 Exploring the Codebase: System Prompts & Personalities
8:56 Under the Hood: Lyria 3 Music & Search Grounding
10:08 How Reachy Moves: Head Wobbler & Speech Sway
10:58 Final Thoughts & Community Use Cases
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Next Up
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)