AI safety…ok doomer | Anca Dragan

Google DeepMind · Beginner ·🛡️ AI Safety & Ethics ·1y ago

Skills: AI Alignment Basics80%AI Ethics & Policy70%

Building safe and capable models is one of the greatest challenges of our time. Can we make AI work for everyone? How do we prevent existential threats? Why is alignment so important? Join Professor Hannah Fry as she delves into these critical questions with Anca Dragan, lead for AI safety and alignment at Google DeepMind. Want to share feedback? Have a suggestion for a guest that we should have on next? Why not leave a review on YouTube and stay tuned for future episodes. Timecodes: 00:00 Introduction to Anca Dragan 02:16 Short and long term risks 04:35 Designing a safe bridge 05:36 Robotics 06:56 Human and AI interaction 12:33 The objective of alignment 14:30 Value alignment and recommendation systems 17:57 Ways to approach alignment with competing objectives 19:54 Deliberative alignment 22:24 Scalable oversight 23:33 Example of scalable oversight 26:14 What comes next? 27:20 Gemini 30:14 Long term risk and frontier safety framework 35:09 Importance of AI safety 38:02 Conclusion Further reading: https://deepmind.google/discover/blog/introducing-the-frontier-safety-framework/ https://arxiv.org/pdf/2403.13793 ___ Search for Google DeepMind: The Podcast on: Spotify: https://open.spotify.com/show/39fjU5Q5L5UecTCRMeqjwb Apple Podcasts: https://podcasts.apple.com/gb/podcast/google-deepmind-the-podcast/id1476316441 IHeartRadio: https://www.iheart.com/podcast/269-deepmind-the-podcast-48983807/ Thanks to everyone who made this possible, including but not limited to: Presenter: Professor Hannah Fry Series Producer: Dan Hardoon Editor: Rami Tzabar, TellTale Studios Commissioner & Producer: Emma Yousif Music composition: Eleni Shaw Camera Director and Video Editor: Tommy Bruce Audio Engineer: Perry Rogantin Video Studio Production: Nicholas Duke Video Editor: Bilal Merhi Video Production Design: James Barton Visual Identity and Design: Eleanor Tomlinson Commissioned by Google DeepMind ___ Subscribe to our channel https://www.youtube.com/@UCP7jMXSY2xbc3KCA

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Google DeepMind · Google DeepMind · 0 of 60

← Previous Next →

RL Course by David Silver - Lecture 8: Integrating Learning and Planning

RL Course by David Silver - Lecture 8: Integrating Learning and Planning

Google DeepMind

RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning

RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning

Google DeepMind

RL Course by David Silver - Lecture 2: Markov Decision Process

RL Course by David Silver - Lecture 2: Markov Decision Process

Google DeepMind

RL Course by David Silver - Lecture 5: Model Free Control

RL Course by David Silver - Lecture 5: Model Free Control

Google DeepMind

RL Course by David Silver - Lecture 6: Value Function Approximation

RL Course by David Silver - Lecture 6: Value Function Approximation

Google DeepMind

RL Course by David Silver - Lecture 4: Model-Free Prediction

RL Course by David Silver - Lecture 4: Model-Free Prediction

Google DeepMind

RL Course by David Silver - Lecture 3: Planning by Dynamic Programming

RL Course by David Silver - Lecture 3: Planning by Dynamic Programming

Google DeepMind

RL Course by David Silver - Lecture 10: Classic Games

RL Course by David Silver - Lecture 10: Classic Games

Google DeepMind

RL Course by David Silver - Lecture 7: Policy Gradient Methods

RL Course by David Silver - Lecture 7: Policy Gradient Methods

Google DeepMind

Google DeepMind: Ground-breaking AlphaGo masters the game of Go

Google DeepMind: Ground-breaking AlphaGo masters the game of Go

Google DeepMind

Match 1 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Match 1 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Google DeepMind

Match 2 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Match 2 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Google DeepMind

Match 1 15 min Summary - Google DeepMind Challenge Match

Match 1 15 min Summary - Google DeepMind Challenge Match

Google DeepMind

Match 3 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Match 3 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Google DeepMind

Match 2 15 Minute Summary - Google DeepMind Challenge Match 2016

Match 2 15 Minute Summary - Google DeepMind Challenge Match 2016

Google DeepMind

Match 3 15 Minute Summary - Google DeepMind Challenge Match 2016

Match 3 15 Minute Summary - Google DeepMind Challenge Match 2016

Google DeepMind

Match 4 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Match 4 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Google DeepMind

Match 4 15 Minute Summary - Google DeepMind Challenge Match 2016

Match 4 15 Minute Summary - Google DeepMind Challenge Match 2016

Google DeepMind

Match 5 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Match 5 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo

Google DeepMind

Match 5 15 Minute Summary - Google DeepMind Challenge Match 2016

Match 5 15 Minute Summary - Google DeepMind Challenge Match 2016

Google DeepMind

DQN SPACE INVADERS

DQN SPACE INVADERS

Google DeepMind

Google DeepMind

Asynchronous Methods for Deep Reinforcement Learning: Labyrinth

Asynchronous Methods for Deep Reinforcement Learning: Labyrinth

Google DeepMind

Asynchronous Methods for Deep Reinforcement Learning: MuJoCo

Asynchronous Methods for Deep Reinforcement Learning: MuJoCo

Google DeepMind

Asynchronous Methods for Deep Reinforcement Learning: TORCS

Asynchronous Methods for Deep Reinforcement Learning: TORCS

Google DeepMind

Differentiable neural computer family tree inference task

Differentiable neural computer family tree inference task

Google DeepMind

StarCraft II DeepMind feature layer API

StarCraft II DeepMind feature layer API

Google DeepMind

DeepMind Health – Partnership with the Royal Free London NHS Foundation Trust

DeepMind Health – Partnership with the Royal Free London NHS Foundation Trust

Google DeepMind

DeepMind Health – Michael Wise – a patient's journey

DeepMind Health – Michael Wise – a patient's journey

Google DeepMind

Streams – a platform for a digital NHS

Streams – a platform for a digital NHS

Google DeepMind

DeepMind Lab - Nav Maze Level 1

DeepMind Lab - Nav Maze Level 1

Google DeepMind

DeepMind Lab - Stairway to Melon Level

DeepMind Lab - Stairway to Melon Level

Google DeepMind

DeepMind Lab - Laser Tag Space Bounce Level (Hard)

DeepMind Lab - Laser Tag Space Bounce Level (Hard)

Google DeepMind

Exploring the mysteries of Go with AlphaGo and China's top players

Exploring the mysteries of Go with AlphaGo and China's top players

Google DeepMind

Demis Hassabis on AlphaGo: its legacy and the 'Future of Go Summit' in Wuzhen, China

Demis Hassabis on AlphaGo: its legacy and the 'Future of Go Summit' in Wuzhen, China

Google DeepMind

The Future of Go Summit: AlphaGo & Ke Jie match 1 moves analysis

The Future of Go Summit: AlphaGo & Ke Jie match 1 moves analysis

Google DeepMind

The Future of Go Summit: AlphaGo & Ke Jie match 2 moves analysis

The Future of Go Summit: AlphaGo & Ke Jie match 2 moves analysis

Google DeepMind

The Future of Go Summit: Pair Go moves analysis

The Future of Go Summit: Pair Go moves analysis

Google DeepMind

The Future of Go Summit: AlphaGo & Ke Jie match 3 moves analysis

The Future of Go Summit: AlphaGo & Ke Jie match 3 moves analysis

Google DeepMind

Emergence of Locomotion Behaviours in Rich Environments

Emergence of Locomotion Behaviours in Rich Environments

Google DeepMind

StarCraft II 'mini games' for AI research

StarCraft II 'mini games' for AI research

Google DeepMind

Trained and untrained agents play StarCraft II full 1vs1 game

Trained and untrained agents play StarCraft II full 1vs1 game

Google DeepMind

DeepMind open source PySC2 toolset for Starcraft II

DeepMind open source PySC2 toolset for Starcraft II

Google DeepMind

ICML 2017: Test of Time Award (Sylvain Gelly & David Silver)

ICML 2017: Test of Time Award (Sylvain Gelly & David Silver)

Google DeepMind

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 3rd AlphaGo vs Ke Jie game

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 3rd AlphaGo vs Ke Jie game

Google DeepMind

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 1st AlphaGo vs Ke Jie game

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 1st AlphaGo vs Ke Jie game

Google DeepMind

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 2nd AlphaGo vs Ke Jie game

Ke Jie and DeepMind's Go Ambassador Fan Hui review the 2nd AlphaGo vs Ke Jie game

Google DeepMind

AlphaGo Zero: Discovering new knowledge

AlphaGo Zero: Discovering new knowledge

Google DeepMind

AlphaGo Zero: Starting from scratch

AlphaGo Zero: Starting from scratch

Google DeepMind

Defining principles for tech companies in the NHS: DeepMind Health's Collaborative Listening Summit

Defining principles for tech companies in the NHS: DeepMind Health's Collaborative Listening Summit

Google DeepMind

A systems neuroscience approach to building AGI - Demis Hassabis, Singularity Summit 2010

A systems neuroscience approach to building AGI - Demis Hassabis, Singularity Summit 2010

Google DeepMind

Retour de Rémi Munos en France et ouverture de DeepMind Paris

Retour de Rémi Munos en France et ouverture de DeepMind Paris

Google DeepMind

Grid cells - Caswell Barry, UCL

Grid cells - Caswell Barry, UCL

Google DeepMind

DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: What our research shows

DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: What our research shows

Google DeepMind

DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: A Patient's Story

DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: A Patient's Story

Google DeepMind

Deep Learning 3: Neural Networks Foundations

Deep Learning 3: Neural Networks Foundations

Google DeepMind

Deep Learning 5: Optimization for Machine Learning

Deep Learning 5: Optimization for Machine Learning

Google DeepMind

Deep Learning 8: Unsupervised learning and generative models

Deep Learning 8: Unsupervised learning and generative models

Google DeepMind

Reinforcement Learning 1: Introduction to Reinforcement Learning

Reinforcement Learning 1: Introduction to Reinforcement Learning

Google DeepMind

Deep Learning 2: Introduction to TensorFlow

Deep Learning 2: Introduction to TensorFlow

Google DeepMind

More on: AI Alignment Basics

View skill →

Interpretable machine learning applications: Part 5

Interpretable machine learning applications: Part 5

GenAI news from Weights & Biases CEO, Lukas Biewald

GenAI news from Weights & Biases CEO, Lukas Biewald

Weights & Biases

Responsible AI Winners, 2020 PyTorch Summer Hackathon

Responsible AI Winners, 2020 PyTorch Summer Hackathon

Near Real-Time Analytics to GenAI Centralized Observability | Amazon Web Services

Near Real-Time Analytics to GenAI Centralized Observability | Amazon Web Services

Amazon Web Services

Kiro Hooks | Event-Driven Automation for Your IDE | Amazon Web Services

Kiro Hooks | Event-Driven Automation for Your IDE | Amazon Web Services

Amazon Web Services

Get Started with Raven AGI

Get Started with Raven AGI

Related AI Lessons

When AI Gets It Wrong: The Hidden Security Risk of Hallucinations in Cybersecurity

Learn about the hidden security risk of AI hallucinations in cybersecurity and how to mitigate it

AI Can't Stop AI? Wrong Problem. Wrong Layer.

AI security issues require a layered approach, not a single solution

Dev.to · Cor E

The Hidden Cost of Every Query You Send

Learn how AI queries impact your electricity bill and the environment, and what you can do to reduce the hidden cost

Dev.to · Talal Ahmad

When AI Becomes Someone

Learn how AI intimacy can impact safety and why it matters for future AI development

Chapters (16)

Introduction to Anca Dragan

2:16 Short and long term risks

4:35 Designing a safe bridge

5:36 Robotics

6:56 Human and AI interaction

12:33 The objective of alignment

14:30 Value alignment and recommendation systems

17:57 Ways to approach alignment with competing objectives

19:54 Deliberative alignment

22:24 Scalable oversight

23:33 Example of scalable oversight

26:14 What comes next?

27:20 Gemini

30:14 Long term risk and frontier safety framework

35:09 Importance of AI safety

38:02 Conclusion

AI Management Essentials: Integrating ISO 42001 & ISO 23894