Gemini 2.0 and the evolution of agentic AI | Oriol Vinyals

Google DeepMind · Beginner ·🤖 AI Agents & Automation ·1y ago
In this episode, Hannah is joined by Oriol Vinyals, VP of Drastic Research and Gemini co-lead. They discuss the evolution of agents from single-task models to more general-purpose models capable of broader applications, like Gemini. Vinyals guides Hannah through the two-step process behind multi modal models: pre-training (imitation learning) and post-training (reinforcement learning). They discuss the complexities of scaling and the importance of innovation in architecture and training processes. They close on a quick whirlwind tour of some of the new agentic capabilities recently released by Google DeepMind. Note: To see the full demos, unedited versions, and other videos related to Gemini 2.0 head to our Gemini playlist: https://www.youtube.com/playlist?list=PLqYmG7hTraZD8qyQmEfXrJMpGsQKk-LCY Timecodes 00:00 Intro 02:30 Games and early AI agents 04:28 Weights 09:27 Architectures and the digital brain 10:24 Agentic behaviour 13:31 Digital body 14:09 Scaling 19:02 Data 20:59 Complex understanding and knowledge 25:14 Post training challenges 30:43 Reasoning 33:11 Planning 34:19 Systems 2 37:00 Memory 40:54 Gemini and agentic capabilities Additional learning: https://deepmind.google/ https://www.youtube.com/watch?v=lH74gNeryhQ& https://youtu.be/64pndvbbokA?si=O9Ep7fD5eF5YUNYe Thanks to everyone who made this possible, including but not limited to: Presenter: Professor Hannah Fry Series Producer: Dan Hardoon Editor: Rami Tzabar, TellTale Studios Commissioner & Producer: Emma Yousif Music composition: Eleni Shaw Camera Director and Video Editor: Bernardo Resende Audio Engineer: Perry Rogantin Video Studio Production: Nicholas Duke Video Editor: Bilal Merhi Video Production Design: James Barton Visual Identity and Design: Eleanor Tomlinson Commissioned by Google DeepMind — Subscribe to our channel https://www.youtube.com/@UCP7jMXSY2xbc3KCAE0MHQ-A Find us on X https://twitter.com/GoogleDeepMind Follow us on Instagram https://instagram.com/googlede
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Google DeepMind · Google DeepMind · 0 of 60

← Previous Next →
1 RL Course by David Silver - Lecture 8: Integrating Learning and Planning
RL Course by David Silver - Lecture 8: Integrating Learning and Planning
Google DeepMind
2 RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning
RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning
Google DeepMind
3 RL Course by David Silver - Lecture 2: Markov Decision Process
RL Course by David Silver - Lecture 2: Markov Decision Process
Google DeepMind
4 RL Course by David Silver - Lecture 5: Model Free Control
RL Course by David Silver - Lecture 5: Model Free Control
Google DeepMind
5 RL Course by David Silver - Lecture 6: Value Function Approximation
RL Course by David Silver - Lecture 6: Value Function Approximation
Google DeepMind
6 RL Course by David Silver - Lecture 4: Model-Free Prediction
RL Course by David Silver - Lecture 4: Model-Free Prediction
Google DeepMind
7 RL Course by David Silver - Lecture 3: Planning by Dynamic Programming
RL Course by David Silver - Lecture 3: Planning by Dynamic Programming
Google DeepMind
8 RL Course by David Silver - Lecture 10: Classic Games
RL Course by David Silver - Lecture 10: Classic Games
Google DeepMind
9 RL Course by David Silver - Lecture 7: Policy Gradient Methods
RL Course by David Silver - Lecture 7: Policy Gradient Methods
Google DeepMind
10 Google DeepMind: Ground-breaking AlphaGo masters the game of Go
Google DeepMind: Ground-breaking AlphaGo masters the game of Go
Google DeepMind
11 Match 1 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo
Match 1 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo
Google DeepMind
12 Match 2 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo
Match 2 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo
Google DeepMind
13 Match 1 15 min Summary - Google DeepMind Challenge Match
Match 1 15 min Summary - Google DeepMind Challenge Match
Google DeepMind
14 Match 3 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo
Match 3 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo
Google DeepMind
15 Match 2 15 Minute Summary - Google DeepMind Challenge Match 2016
Match 2 15 Minute Summary - Google DeepMind Challenge Match 2016
Google DeepMind
16 Match 3 15 Minute Summary - Google DeepMind Challenge Match 2016
Match 3 15 Minute Summary - Google DeepMind Challenge Match 2016
Google DeepMind
17 Match 4 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo
Match 4 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo
Google DeepMind
18 Match 4 15 Minute Summary - Google DeepMind Challenge Match 2016
Match 4 15 Minute Summary - Google DeepMind Challenge Match 2016
Google DeepMind
19 Match 5 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo
Match 5 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo
Google DeepMind
20 Match 5 15 Minute Summary - Google DeepMind Challenge Match 2016
Match 5 15 Minute Summary - Google DeepMind Challenge Match 2016
Google DeepMind
21 DQN SPACE INVADERS
DQN SPACE INVADERS
Google DeepMind
22 DQN Breakout
DQN Breakout
Google DeepMind
23 Asynchronous Methods for Deep Reinforcement Learning: Labyrinth
Asynchronous Methods for Deep Reinforcement Learning: Labyrinth
Google DeepMind
24 Asynchronous Methods for Deep Reinforcement Learning: MuJoCo
Asynchronous Methods for Deep Reinforcement Learning: MuJoCo
Google DeepMind
25 Asynchronous Methods for Deep Reinforcement Learning: TORCS
Asynchronous Methods for Deep Reinforcement Learning: TORCS
Google DeepMind
26 Differentiable neural computer family tree inference task
Differentiable neural computer family tree inference task
Google DeepMind
27 StarCraft II DeepMind feature layer API
StarCraft II DeepMind feature layer API
Google DeepMind
28 DeepMind Health – Partnership with the Royal Free London NHS Foundation Trust
DeepMind Health – Partnership with the Royal Free London NHS Foundation Trust
Google DeepMind
29 DeepMind Health – Michael Wise – a patient's journey
DeepMind Health – Michael Wise – a patient's journey
Google DeepMind
30 Streams – a platform for a digital NHS
Streams – a platform for a digital NHS
Google DeepMind
31 DeepMind Lab - Nav Maze Level 1
DeepMind Lab - Nav Maze Level 1
Google DeepMind
32 DeepMind Lab - Stairway to Melon Level
DeepMind Lab - Stairway to Melon Level
Google DeepMind
33 DeepMind Lab - Laser Tag Space Bounce Level (Hard)
DeepMind Lab - Laser Tag Space Bounce Level (Hard)
Google DeepMind
34 Exploring the mysteries of Go with AlphaGo and China's top players
Exploring the mysteries of Go with AlphaGo and China's top players
Google DeepMind
35 Demis Hassabis on AlphaGo: its legacy and the 'Future of Go Summit' in Wuzhen, China
Demis Hassabis on AlphaGo: its legacy and the 'Future of Go Summit' in Wuzhen, China
Google DeepMind
36 The Future of Go Summit: AlphaGo & Ke Jie match 1 moves analysis
The Future of Go Summit: AlphaGo & Ke Jie match 1 moves analysis
Google DeepMind
37 The Future of Go Summit: AlphaGo & Ke Jie match 2 moves analysis
The Future of Go Summit: AlphaGo & Ke Jie match 2 moves analysis
Google DeepMind
38 The Future of Go Summit: Pair Go moves analysis
The Future of Go Summit: Pair Go moves analysis
Google DeepMind
39 The Future of Go Summit: AlphaGo & Ke Jie match 3 moves analysis
The Future of Go Summit: AlphaGo & Ke Jie match 3 moves analysis
Google DeepMind
40 Emergence of Locomotion Behaviours in Rich Environments
Emergence of Locomotion Behaviours in Rich Environments
Google DeepMind
41 StarCraft II 'mini games' for AI research
StarCraft II 'mini games' for AI research
Google DeepMind
42 Trained and untrained agents play StarCraft II full 1vs1 game
Trained and untrained agents play StarCraft II full 1vs1 game
Google DeepMind
43 DeepMind open source PySC2 toolset for Starcraft II
DeepMind open source PySC2 toolset for Starcraft II
Google DeepMind
44 ICML 2017: Test of Time Award (Sylvain Gelly & David Silver)
ICML 2017: Test of Time Award (Sylvain Gelly & David Silver)
Google DeepMind
45 Ke Jie and DeepMind's Go Ambassador Fan Hui review the 3rd AlphaGo vs Ke Jie game
Ke Jie and DeepMind's Go Ambassador Fan Hui review the 3rd AlphaGo vs Ke Jie game
Google DeepMind
46 Ke Jie and DeepMind's Go Ambassador Fan Hui review the 1st AlphaGo vs Ke Jie game
Ke Jie and DeepMind's Go Ambassador Fan Hui review the 1st AlphaGo vs Ke Jie game
Google DeepMind
47 Ke Jie and DeepMind's Go Ambassador Fan Hui review the 2nd AlphaGo vs Ke Jie game
Ke Jie and DeepMind's Go Ambassador Fan Hui review the 2nd AlphaGo vs Ke Jie game
Google DeepMind
48 AlphaGo Zero: Discovering new knowledge
AlphaGo Zero: Discovering new knowledge
Google DeepMind
49 AlphaGo Zero: Starting from scratch
AlphaGo Zero: Starting from scratch
Google DeepMind
50 Defining principles for tech companies in the NHS: DeepMind Health's Collaborative Listening Summit
Defining principles for tech companies in the NHS: DeepMind Health's Collaborative Listening Summit
Google DeepMind
51 A systems neuroscience approach to building AGI - Demis Hassabis, Singularity Summit 2010
A systems neuroscience approach to building AGI - Demis Hassabis, Singularity Summit 2010
Google DeepMind
52 Retour de Rémi Munos en France et ouverture de DeepMind Paris
Retour de Rémi Munos en France et ouverture de DeepMind Paris
Google DeepMind
53 Grid cells - Caswell Barry, UCL
Grid cells - Caswell Barry, UCL
Google DeepMind
54 DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: What our research shows
DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: What our research shows
Google DeepMind
55 DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: A Patient's Story
DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: A Patient's Story
Google DeepMind
56 Deep Learning 3: Neural Networks Foundations
Deep Learning 3: Neural Networks Foundations
Google DeepMind
57 Deep Learning 5: Optimization for Machine Learning
Deep Learning 5: Optimization for Machine Learning
Google DeepMind
58 Deep Learning 8: Unsupervised learning and generative models
Deep Learning 8: Unsupervised learning and generative models
Google DeepMind
59 Reinforcement Learning 1: Introduction to Reinforcement Learning
Reinforcement Learning 1: Introduction to Reinforcement Learning
Google DeepMind
60 Deep Learning 2: Introduction to TensorFlow
Deep Learning 2: Introduction to TensorFlow
Google DeepMind

Related AI Lessons

Build Your Own AI Dream Team: Craft a Multi-Agent Research Assistant in Python!
Learn to build a multi-agent research assistant in Python that can search, gather, and synthesize information to deliver structured reports
Dev.to AI
AI agent payments enable autonomous transactions using cryptocurrency and HTTP 4
Enable autonomous transactions using AI agents and cryptocurrency with the x402 standard and AiFinPay SDK
Dev.to AI
The future of Web3 is autonomous AI agents paying each other for services. With
Learn how autonomous AI agents can pay each other for services using cryptocurrency and HTTP 402 protocol, enabling a new era of Web3 transactions
Dev.to AI
AI agent payments enable autonomous transactions using cryptocurrency and HTTP 4
Learn how AI agent payments enable autonomous transactions using cryptocurrency and HTTP, and how to implement them using the AIFinPay SDK
Dev.to AI

Chapters (15)

Intro
2:30 Games and early AI agents
4:28 Weights
9:27 Architectures and the digital brain
10:24 Agentic behaviour
13:31 Digital body
14:09 Scaling
19:02 Data
20:59 Complex understanding and knowledge
25:14 Post training challenges
30:43 Reasoning
33:11 Planning
34:19 Systems 2
37:00 Memory
40:54 Gemini and agentic capabilities
Up next
Streamline Employee Onboarding with AI
The Information
Watch →