Gemini 2.0 and the evolution of agentic AI | Oriol Vinyals

Google DeepMind · Beginner ·🤖 AI Agents & Automation ·1y ago

Skills: Agent Foundations90%

In this episode, Hannah is joined by Oriol Vinyals, VP of Drastic Research and Gemini co-lead. They discuss the evolution of agents from single-task models to more general-purpose models capable of broader applications, like Gemini. Vinyals guides Hannah through the two-step process behind multi modal models: pre-training (imitation learning) and post-training (reinforcement learning). They discuss the complexities of scaling and the importance of innovation in architecture and training processes. They close on a quick whirlwind tour of some of the new agentic capabilities recently released by Google DeepMind. Note: To see the full demos, unedited versions, and other videos related to Gemini 2.0 head to our Gemini playlist: https://www.youtube.com/playlist?list=PLqYmG7hTraZD8qyQmEfXrJMpGsQKk-LCY Timecodes 00:00 Intro 02:30 Games and early AI agents 04:28 Weights 09:27 Architectures and the digital brain 10:24 Agentic behaviour 13:31 Digital body 14:09 Scaling 19:02 Data 20:59 Complex understanding and knowledge 25:14 Post training challenges 30:43 Reasoning 33:11 Planning 34:19 Systems 2 37:00 Memory 40:54 Gemini and agentic capabilities Additional learning: https://deepmind.google/ https://www.youtube.com/watch?v=lH74gNeryhQ& https://youtu.be/64pndvbbokA?si=O9Ep7fD5eF5YUNYe Thanks to everyone who made this possible, including but not limited to: Presenter: Professor Hannah Fry Series Producer: Dan Hardoon Editor: Rami Tzabar, TellTale Studios Commissioner & Producer: Emma Yousif Music composition: Eleni Shaw Camera Director and Video Editor: Bernardo Resende Audio Engineer: Perry Rogantin Video Studio Production: Nicholas Duke Video Editor: Bilal Merhi Video Production Design: James Barton Visual Identity and Design: Eleanor Tomlinson Commissioned by Google DeepMind — Subscribe to our channel https://www.youtube.com/@UCP7jMXSY2xbc3KCAE0MHQ-A Find us on X https://twitter.com/GoogleDeepMind Follow us on Instagram https://instagram.com/googlede

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Google DeepMind · Google DeepMind · 0 of 60

← Previous Next →

RL Course by David Silver - Lecture 8: Integrating Learning and Planning

RL Course by David Silver - Lecture 8: Integrating Learning and Planning

Google DeepMind

RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning

RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning

Google DeepMind

RL Course by David Silver - Lecture 2: Markov Decision Process

RL Course by David Silver - Lecture 2: Markov Decision Process

Google DeepMind

RL Course by David Silver - Lecture 5: Model Free Control

RL Course by David Silver - Lecture 5: Model Free Control

Google DeepMind

RL Course by David Silver - Lecture 6: Value Function Approximation

RL Course by David Silver - Lecture 6: Value Function Approximation

Google DeepMind

RL Course by David Silver - Lecture 4: Model-Free Prediction

RL Course by David Silver - Lecture 4: Model-Free Prediction

Google DeepMind

RL Course by David Silver - Lecture 3: Planning by Dynamic Programming

RL Course by David Silver - Lecture 3: Planning by Dynamic Programming

Google DeepMind

RL Course by David Silver - Lecture 10: Classic Games

RL Course by David Silver - Lecture 10: Classic Games

Google DeepMind

RL Course by David Silver - Lecture 7: Policy Gradient Methods

RL Course by David Silver - Lecture 7: Policy Gradient Methods

Google DeepMind

Google DeepMind: Ground-breaking AlphaGo masters the game of Go

Google DeepMind: Ground-breaking AlphaGo masters the game of Go

Google DeepMind

Match 1 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Match 1 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Google DeepMind

Match 2 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Match 2 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Google DeepMind

Match 1 15 min Summary - Google DeepMind Challenge Match

Match 1 15 min Summary - Google DeepMind Challenge Match

Google DeepMind

Match 3 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Match 3 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Google DeepMind

Match 2 15 Minute Summary - Google DeepMind Challenge Match 2016

Match 2 15 Minute Summary - Google DeepMind Challenge Match 2016

Google DeepMind

Match 3 15 Minute Summary - Google DeepMind Challenge Match 2016

Match 3 15 Minute Summary - Google DeepMind Challenge Match 2016

Google DeepMind

Match 4 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Match 4 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Google DeepMind

Match 4 15 Minute Summary - Google DeepMind Challenge Match 2016

Match 4 15 Minute Summary - Google DeepMind Challenge Match 2016

Google DeepMind

Match 5 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Match 5 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Google DeepMind

Match 5 15 Minute Summary - Google DeepMind Challenge Match 2016

Match 5 15 Minute Summary - Google DeepMind Challenge Match 2016

Google DeepMind

DQN SPACE INVADERS

DQN SPACE INVADERS

Google DeepMind

Google DeepMind

Asynchronous Methods for Deep Reinforcement Learning: Labyrinth

Asynchronous Methods for Deep Reinforcement Learning: Labyrinth

Google DeepMind

Asynchronous Methods for Deep Reinforcement Learning: MuJoCo

Asynchronous Methods for Deep Reinforcement Learning: MuJoCo

Google DeepMind

Asynchronous Methods for Deep Reinforcement Learning: TORCS

Asynchronous Methods for Deep Reinforcement Learning: TORCS

Google DeepMind

Differentiable neural computer family tree inference task

Differentiable neural computer family tree inference task

Google DeepMind

StarCraft II DeepMind feature layer API

StarCraft II DeepMind feature layer API

Google DeepMind

DeepMind Health – Partnership with the Royal Free London NHS Foundation Trust

DeepMind Health – Partnership with the Royal Free London NHS Foundation Trust

Google DeepMind

DeepMind Health – Michael Wise – a patient's journey

DeepMind Health – Michael Wise – a patient's journey

Google DeepMind

Streams – a platform for a digital NHS

Streams – a platform for a digital NHS

Google DeepMind

DeepMind Lab - Nav Maze Level 1

DeepMind Lab - Nav Maze Level 1

Google DeepMind

DeepMind Lab - Stairway to Melon Level

DeepMind Lab - Stairway to Melon Level

Google DeepMind

DeepMind Lab - Laser Tag Space Bounce Level (Hard)

DeepMind Lab - Laser Tag Space Bounce Level (Hard)

Google DeepMind

Exploring the mysteries of Go with AlphaGo and China's top players

Exploring the mysteries of Go with AlphaGo and China's top players

Google DeepMind

Demis Hassabis on AlphaGo: its legacy and the 'Future of Go Summit' in Wuzhen, China

Demis Hassabis on AlphaGo: its legacy and the 'Future of Go Summit' in Wuzhen, China

Google DeepMind

The Future of Go Summit: AlphaGo & Ke Jie match 1 moves analysis

The Future of Go Summit: AlphaGo & Ke Jie match 1 moves analysis

Google DeepMind

The Future of Go Summit: AlphaGo & Ke Jie match 2 moves analysis

The Future of Go Summit: AlphaGo & Ke Jie match 2 moves analysis

Google DeepMind

The Future of Go Summit: Pair Go moves analysis

The Future of Go Summit: Pair Go moves analysis

Google DeepMind

The Future of Go Summit: AlphaGo & Ke Jie match 3 moves analysis

The Future of Go Summit: AlphaGo & Ke Jie match 3 moves analysis

Google DeepMind

Emergence of Locomotion Behaviours in Rich Environments

Emergence of Locomotion Behaviours in Rich Environments

Google DeepMind

StarCraft II 'mini games' for AI research

StarCraft II 'mini games' for AI research

Google DeepMind

Trained and untrained agents play StarCraft II full 1vs1 game

Trained and untrained agents play StarCraft II full 1vs1 game

Google DeepMind

DeepMind open source PySC2 toolset for Starcraft II

DeepMind open source PySC2 toolset for Starcraft II

Google DeepMind

ICML 2017: Test of Time Award (Sylvain Gelly & David Silver)

ICML 2017: Test of Time Award (Sylvain Gelly & David Silver)

Google DeepMind

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 3rd AlphaGo vs Ke Jie game

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 3rd AlphaGo vs Ke Jie game

Google DeepMind

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 1st AlphaGo vs Ke Jie game

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 1st AlphaGo vs Ke Jie game

Google DeepMind

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 2nd AlphaGo vs Ke Jie game

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 2nd AlphaGo vs Ke Jie game

Google DeepMind

AlphaGo Zero: Discovering new knowledge

AlphaGo Zero: Discovering new knowledge

Google DeepMind

AlphaGo Zero: Starting from scratch

AlphaGo Zero: Starting from scratch

Google DeepMind

Defining principles for tech companies in the NHS: DeepMind Health's Collaborative Listening Summit

Defining principles for tech companies in the NHS: DeepMind Health's Collaborative Listening Summit

Google DeepMind

A systems neuroscience approach to building AGI - Demis Hassabis, Singularity Summit 2010

A systems neuroscience approach to building AGI - Demis Hassabis, Singularity Summit 2010

Google DeepMind

Retour de Rémi Munos en France et ouverture de DeepMind Paris

Retour de Rémi Munos en France et ouverture de DeepMind Paris

Google DeepMind

Grid cells - Caswell Barry, UCL

Grid cells - Caswell Barry, UCL

Google DeepMind

DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: What our research shows

DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: What our research shows

Google DeepMind

DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: A Patient's Story

DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: A Patient's Story

Google DeepMind

Deep Learning 3: Neural Networks Foundations

Deep Learning 3: Neural Networks Foundations

Google DeepMind

Deep Learning 5: Optimization for Machine Learning

Deep Learning 5: Optimization for Machine Learning

Google DeepMind

Deep Learning 8: Unsupervised learning and generative models

Deep Learning 8: Unsupervised learning and generative models

Google DeepMind

Reinforcement Learning 1: Introduction to Reinforcement Learning

Reinforcement Learning 1: Introduction to Reinforcement Learning

Google DeepMind

Deep Learning 2: Introduction to TensorFlow

Deep Learning 2: Introduction to TensorFlow

Google DeepMind

More on: Agent Foundations

View skill →

Build and Deploy an Agent with Reasoning Engine in Vertex AI

Adding a Phone Gateway to a Virtual Agent

From Zero to Working AI Agent in 60 Seconds

From Zero to Working AI Agent in 60 Seconds

Create An AI Agent With Replit That Automates Your Sales

Create An AI Agent With Replit That Automates Your Sales

Capstone: Autonomous Runway Detection for IoT

Capstone: Autonomous Runway Detection for IoT

AI Agents with Model Context Protocol & Typescript

AI Agents with Model Context Protocol & Typescript

Related AI Lessons

Build Your Own AI Dream Team: Craft a Multi-Agent Research Assistant in Python!

Learn to build a multi-agent research assistant in Python that can search, gather, and synthesize information to deliver structured reports

AI agent payments enable autonomous transactions using cryptocurrency and HTTP 4

Enable autonomous transactions using AI agents and cryptocurrency with the x402 standard and AiFinPay SDK

The future of Web3 is autonomous AI agents paying each other for services. With

Learn how autonomous AI agents can pay each other for services using cryptocurrency and HTTP 402 protocol, enabling a new era of Web3 transactions

AI agent payments enable autonomous transactions using cryptocurrency and HTTP 4

Learn how AI agent payments enable autonomous transactions using cryptocurrency and HTTP, and how to implement them using the AIFinPay SDK

Chapters (15)

Intro

2:30 Games and early AI agents

4:28 Weights

9:27 Architectures and the digital brain

10:24 Agentic behaviour

13:31 Digital body

14:09 Scaling

19:02 Data

20:59 Complex understanding and knowledge

25:14 Post training challenges

30:43 Reasoning

33:11 Planning

34:19 Systems 2

37:00 Memory

40:54 Gemini and agentic capabilities

Streamline Employee Onboarding with AI

The Information