AI safety…ok doomer | Anca Dragan
Building safe and capable models is one of the greatest challenges of our time. Can we make AI work for everyone? How do we prevent existential threats? Why is alignment so important? Join Professor Hannah Fry as she delves into these critical questions with Anca Dragan, lead for AI safety and alignment at Google DeepMind.
Want to share feedback? Have a suggestion for a guest that we should have on next? Why not leave a review on YouTube and stay tuned for future episodes.
Timecodes:
00:00 Introduction to Anca Dragan
02:16 Short and long term risks
04:35 Designing a safe bridge
05:36 Robotics
06:56 Human and AI interaction
12:33 The objective of alignment
14:30 Value alignment and recommendation systems
17:57 Ways to approach alignment with competing objectives
19:54 Deliberative alignment
22:24 Scalable oversight
23:33 Example of scalable oversight
26:14 What comes next?
27:20 Gemini
30:14 Long term risk and frontier safety framework
35:09 Importance of AI safety
38:02 Conclusion
Further reading:
https://deepmind.google/discover/blog/introducing-the-frontier-safety-framework/
https://arxiv.org/pdf/2403.13793
___
Search for Google DeepMind: The Podcast on:
Spotify: https://open.spotify.com/show/39fjU5Q5L5UecTCRMeqjwb
Apple Podcasts: https://podcasts.apple.com/gb/podcast/google-deepmind-the-podcast/id1476316441
IHeartRadio: https://www.iheart.com/podcast/269-deepmind-the-podcast-48983807/
Thanks to everyone who made this possible, including but not limited to:
Presenter: Professor Hannah Fry
Series Producer: Dan Hardoon
Editor: Rami Tzabar, TellTale Studios
Commissioner & Producer: Emma Yousif
Music composition: Eleni Shaw
Camera Director and Video Editor: Tommy Bruce
Audio Engineer: Perry Rogantin
Video Studio Production: Nicholas Duke
Video Editor: Bilal Merhi
Video Production Design: James Barton
Visual Identity and Design: Eleanor Tomlinson
Commissioned by Google DeepMind
___
Subscribe to our channel https://www.youtube.com/@UCP7jMXSY2xbc3KCA
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from Google DeepMind · Google DeepMind · 0 of 60
← Previous
Next →
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
RL Course by David Silver - Lecture 8: Integrating Learning and Planning
Google DeepMind
RL Course by David Silver - Lecture 1: Introduction to Reinforcement Learning
Google DeepMind
RL Course by David Silver - Lecture 2: Markov Decision Process
Google DeepMind
RL Course by David Silver - Lecture 5: Model Free Control
Google DeepMind
RL Course by David Silver - Lecture 6: Value Function Approximation
Google DeepMind
RL Course by David Silver - Lecture 4: Model-Free Prediction
Google DeepMind
RL Course by David Silver - Lecture 3: Planning by Dynamic Programming
Google DeepMind
RL Course by David Silver - Lecture 10: Classic Games
Google DeepMind
RL Course by David Silver - Lecture 7: Policy Gradient Methods
Google DeepMind
Google DeepMind: Ground-breaking AlphaGo masters the game of Go
Google DeepMind
Match 1 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo
Google DeepMind
Match 2 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo
Google DeepMind
Match 1 15 min Summary - Google DeepMind Challenge Match
Google DeepMind
Match 3 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo
Google DeepMind
Match 2 15 Minute Summary - Google DeepMind Challenge Match 2016
Google DeepMind
Match 3 15 Minute Summary - Google DeepMind Challenge Match 2016
Google DeepMind
Match 4 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo
Google DeepMind
Match 4 15 Minute Summary - Google DeepMind Challenge Match 2016
Google DeepMind
Match 5 - Google DeepMind Challenge Match: Lee Sedol vs AlphaGo
Google DeepMind
Match 5 15 Minute Summary - Google DeepMind Challenge Match 2016
Google DeepMind
DQN SPACE INVADERS
Google DeepMind
DQN Breakout
Google DeepMind
Asynchronous Methods for Deep Reinforcement Learning: Labyrinth
Google DeepMind
Asynchronous Methods for Deep Reinforcement Learning: MuJoCo
Google DeepMind
Asynchronous Methods for Deep Reinforcement Learning: TORCS
Google DeepMind
Differentiable neural computer family tree inference task
Google DeepMind
StarCraft II DeepMind feature layer API
Google DeepMind
DeepMind Health – Partnership with the Royal Free London NHS Foundation Trust
Google DeepMind
DeepMind Health – Michael Wise – a patient's journey
Google DeepMind
Streams – a platform for a digital NHS
Google DeepMind
DeepMind Lab - Nav Maze Level 1
Google DeepMind
DeepMind Lab - Stairway to Melon Level
Google DeepMind
DeepMind Lab - Laser Tag Space Bounce Level (Hard)
Google DeepMind
Exploring the mysteries of Go with AlphaGo and China's top players
Google DeepMind
Demis Hassabis on AlphaGo: its legacy and the 'Future of Go Summit' in Wuzhen, China
Google DeepMind
The Future of Go Summit: AlphaGo & Ke Jie match 1 moves analysis
Google DeepMind
The Future of Go Summit: AlphaGo & Ke Jie match 2 moves analysis
Google DeepMind
The Future of Go Summit: Pair Go moves analysis
Google DeepMind
The Future of Go Summit: AlphaGo & Ke Jie match 3 moves analysis
Google DeepMind
Emergence of Locomotion Behaviours in Rich Environments
Google DeepMind
StarCraft II 'mini games' for AI research
Google DeepMind
Trained and untrained agents play StarCraft II full 1vs1 game
Google DeepMind
DeepMind open source PySC2 toolset for Starcraft II
Google DeepMind
ICML 2017: Test of Time Award (Sylvain Gelly & David Silver)
Google DeepMind
Ke Jie and DeepMind's Go Ambassador Fan Hui review the 3rd AlphaGo vs Ke Jie game
Google DeepMind
Ke Jie and DeepMind's Go Ambassador Fan Hui review the 1st AlphaGo vs Ke Jie game
Google DeepMind
Ke Jie and DeepMind's Go Ambassador Fan Hui review the 2nd AlphaGo vs Ke Jie game
Google DeepMind
AlphaGo Zero: Discovering new knowledge
Google DeepMind
AlphaGo Zero: Starting from scratch
Google DeepMind
Defining principles for tech companies in the NHS: DeepMind Health's Collaborative Listening Summit
Google DeepMind
A systems neuroscience approach to building AGI - Demis Hassabis, Singularity Summit 2010
Google DeepMind
Retour de Rémi Munos en France et ouverture de DeepMind Paris
Google DeepMind
Grid cells - Caswell Barry, UCL
Google DeepMind
DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: What our research shows
Google DeepMind
DeepMind Health Research and Moorfields Eye Hospital NHS Foundation Trust: A Patient's Story
Google DeepMind
Deep Learning 3: Neural Networks Foundations
Google DeepMind
Deep Learning 5: Optimization for Machine Learning
Google DeepMind
Deep Learning 8: Unsupervised learning and generative models
Google DeepMind
Reinforcement Learning 1: Introduction to Reinforcement Learning
Google DeepMind
Deep Learning 2: Introduction to TensorFlow
Google DeepMind
More on: AI Alignment Basics
View skill →Related AI Lessons
Chapters (16)
Introduction to Anca Dragan
2:16
Short and long term risks
4:35
Designing a safe bridge
5:36
Robotics
6:56
Human and AI interaction
12:33
The objective of alignment
14:30
Value alignment and recommendation systems
17:57
Ways to approach alignment with competing objectives
19:54
Deliberative alignment
22:24
Scalable oversight
23:33
Example of scalable oversight
26:14
What comes next?
27:20
Gemini
30:14
Long term risk and frontier safety framework
35:09
Importance of AI safety
38:02
Conclusion
🎓
Tutor Explanation
DeepCamp AI