Rethinking Pre-Training for Agentic AI [Aakanksha Chowdhery] - 759

TWIML AI Podcast · Advanced ·🤖 AI Agents & Automation ·3mo ago
Today, we're joined by Aakanksha Chowdhery, member of technical staff at Reflection, to explore the fundamental shifts required to build true agentic AI. While the industry has largely focused on post-training techniques to improve reasoning, Aakanksha draws on her experience leading pre-training efforts for Google’s PaLM and early Gemini models to argue that pre-training itself must be rethought to move beyond static benchmarks. We explore the limitations of next-token prediction for multi-step workflows and examine how attention mechanisms, loss objectives, and training data must evolve to s…
Watch on YouTube ↗ (saves to browser)

Chapters (11)

Introduction
2:26 Reflection
4:54 Limitations of post-training for building agents
7:31 Rethinking pre-training in agents
10:51 Scaling
11:27 Evolving attention mechanisms for agentic capabilities
12:39 Memory as a tool
14:13 Loss objectives and training data
15:50 Fine-tuning loss in agent performance
19:37 Training data
21:29 Augmenting dominant training da

Playlist

Uploads from The TWIML AI Podcast with Sam Charrington · The TWIML AI Podcast with Sam Charrington · 0 of 60

← Previous Next →
1 TWiML x Fast ai Machine Learning Study Group - Session 12 - December 23, 2018
TWiML x Fast ai Machine Learning Study Group - Session 12 - December 23, 2018
The TWIML AI Podcast with Sam Charrington
2 Legal and Policy Implications of Model Interpretability with Solon Barocas - TWiML Talk #219
Legal and Policy Implications of Model Interpretability with Solon Barocas - TWiML Talk #219
The TWIML AI Podcast with Sam Charrington
3 Deep Learning for Remote Sensing Applications @ TWiML Online Meetup   EMEA   3 January 2019 1080p
Deep Learning for Remote Sensing Applications @ TWiML Online Meetup EMEA 3 January 2019 1080p
The TWIML AI Podcast with Sam Charrington
4 Naver Labs at CES2019
Naver Labs at CES2019
The TWIML AI Podcast with Sam Charrington
5 Voicea at CES2019
Voicea at CES2019
The TWIML AI Podcast with Sam Charrington
6 Wheelie by HooBox Robotics at CES2019
Wheelie by HooBox Robotics at CES2019
The TWIML AI Podcast with Sam Charrington
7 Omron Automation at CES2019
Omron Automation at CES2019
The TWIML AI Podcast with Sam Charrington
8 Kepler Vision Technologies at CES2019
Kepler Vision Technologies at CES2019
The TWIML AI Podcast with Sam Charrington
9 Forpheus by OMRON at CES2019
Forpheus by OMRON at CES2019
The TWIML AI Podcast with Sam Charrington
10 Building a Recommender System from Scratch at 20th Century Fox with JJ Espinoza - TWiML Talk #220
Building a Recommender System from Scratch at 20th Century Fox with JJ Espinoza - TWiML Talk #220
The TWIML AI Podcast with Sam Charrington
11 Self-Tuning Services vis Real-Time Machine Learning with Vladimir Bychkovsky - TWiML Talk #221
Self-Tuning Services vis Real-Time Machine Learning with Vladimir Bychkovsky - TWiML Talk #221
The TWIML AI Podcast with Sam Charrington
12 AI Innovation at CES - TWiML Talk #222
AI Innovation at CES - TWiML Talk #222
The TWIML AI Podcast with Sam Charrington
13 Counterfactual Risk Minimization @TWiML Online Meetup   Americas   15 January 2019 1080p
Counterfactual Risk Minimization @TWiML Online Meetup Americas 15 January 2019 1080p
The TWIML AI Podcast with Sam Charrington
14 TWiML x Fast.ai Machine & Deep Learning Study Group - Session 1 - 5 January 2019 - Spring 2019
TWiML x Fast.ai Machine & Deep Learning Study Group - Session 1 - 5 January 2019 - Spring 2019
The TWIML AI Podcast with Sam Charrington
15 TWiML x Fast.ai Machine & Deep Learning Study Group - Session 3 - 19 January 2019 - Spring 2019
TWiML x Fast.ai Machine & Deep Learning Study Group - Session 3 - 19 January 2019 - Spring 2019
The TWIML AI Podcast with Sam Charrington
16 AI at the Edge at Qualcomm with Gary Brotman - TWiML Talk #223
AI at the Edge at Qualcomm with Gary Brotman - TWiML Talk #223
The TWIML AI Podcast with Sam Charrington
17 TWiML x Fast.ai Machine & Deep Learning Study Group - Session 4 - Spring 2019
TWiML x Fast.ai Machine & Deep Learning Study Group - Session 4 - Spring 2019
The TWIML AI Podcast with Sam Charrington
18 Holistic Optimization of the LinkedIn News Feed - TWiML Talk #224
Holistic Optimization of the LinkedIn News Feed - TWiML Talk #224
The TWIML AI Podcast with Sam Charrington
19 Teaching AI to Preschoolers with Randi Williams - TWiML Talk #225
Teaching AI to Preschoolers with Randi Williams - TWiML Talk #225
The TWIML AI Podcast with Sam Charrington
20 TWiML x Fast.ai Deep Learning Part 1 Review Study Group Spring 2019 - Lesson 1
TWiML x Fast.ai Deep Learning Part 1 Review Study Group Spring 2019 - Lesson 1
The TWIML AI Podcast with Sam Charrington
21 AI for Accessibility with Wendy Chisholm - TWiML Talk #227
AI for Accessibility with Wendy Chisholm - TWiML Talk #227
The TWIML AI Podcast with Sam Charrington
22 AI for Earth with Lucas Joppa - TWiML Talk #228
AI for Earth with Lucas Joppa - TWiML Talk #228
The TWIML AI Podcast with Sam Charrington
23 Why is my Classifier Discriminatory  @ TWiML Online Meetup   EMEA   5 February 2019 1080p
Why is my Classifier Discriminatory @ TWiML Online Meetup EMEA 5 February 2019 1080p
The TWIML AI Podcast with Sam Charrington
24 Pathologies of Neural Models and Interpretability with Alvin Grissom II - TWiML Talk #229
Pathologies of Neural Models and Interpretability with Alvin Grissom II - TWiML Talk #229
The TWIML AI Podcast with Sam Charrington
25 TWiML x Fast.ai Deep Learning Part 1 Review Study Group Winter 2019 - Lesson 2
TWiML x Fast.ai Deep Learning Part 1 Review Study Group Winter 2019 - Lesson 2
The TWIML AI Podcast with Sam Charrington
26 An Optimized Recurrent Unit for Ultra-Low Power Acoustic Event Detection with Justice Amoh Jr. -...
An Optimized Recurrent Unit for Ultra-Low Power Acoustic Event Detection with Justice Amoh Jr. -...
The TWIML AI Podcast with Sam Charrington
27 AI for Healthcare with Peter Lee - TWiML Talk #231
AI for Healthcare with Peter Lee - TWiML Talk #231
The TWIML AI Podcast with Sam Charrington
28 Dissecting the Controversy around OpenAI's New Language Model
Dissecting the Controversy around OpenAI's New Language Model
The TWIML AI Podcast with Sam Charrington
29 Fairness in Machine Learning with Hanna Wallach - TWiML Talk #232
Fairness in Machine Learning with Hanna Wallach - TWiML Talk #232
The TWIML AI Podcast with Sam Charrington
30 TWiML x Fast.ai Deep Learning Part 1 Review Study Group Winter 2019 - Lesson 3
TWiML x Fast.ai Deep Learning Part 1 Review Study Group Winter 2019 - Lesson 3
The TWIML AI Podcast with Sam Charrington
31 Human-Centered Design with Mira Lane - TWiML Talk #233
Human-Centered Design with Mira Lane - TWiML Talk #233
The TWIML AI Podcast with Sam Charrington
32 Implicit Self Regularization in Deep Neural Networks @ TWiML Online Meetup Americas 20 February 2019
Implicit Self Regularization in Deep Neural Networks @ TWiML Online Meetup Americas 20 February 2019
The TWIML AI Podcast with Sam Charrington
33 TWiML x Fast.ai Deep Learning Part 1 Review Study Group Winter 2019 - Lesson 4
TWiML x Fast.ai Deep Learning Part 1 Review Study Group Winter 2019 - Lesson 4
The TWIML AI Podcast with Sam Charrington
34 Safer Exploration in Deep Reinforcement Learning using Action Priors with Sicelukwanda Zwane -...
Safer Exploration in Deep Reinforcement Learning using Action Priors with Sicelukwanda Zwane -...
The TWIML AI Podcast with Sam Charrington
35 Scaling Machine Learning on Graphs at LinkedIn with Hema Raghavan and Scott Meyer - TWiML Talk #236
Scaling Machine Learning on Graphs at LinkedIn with Hema Raghavan and Scott Meyer - TWiML Talk #236
The TWIML AI Podcast with Sam Charrington
36 Tech Special Docker for Data Science @ TWiML Online Meetup   EMEA   4 March 2019 1080p
Tech Special Docker for Data Science @ TWiML Online Meetup EMEA 4 March 2019 1080p
The TWIML AI Podcast with Sam Charrington
37 TWiML x Fast.ai Deep Learning Part 1 Review Study Group Winter 2019 - Lesson 5
TWiML x Fast.ai Deep Learning Part 1 Review Study Group Winter 2019 - Lesson 5
The TWIML AI Podcast with Sam Charrington
38 Deep Learning in Optics with Aydogan Ozcan - TWiML Talk #237
Deep Learning in Optics with Aydogan Ozcan - TWiML Talk #237
The TWIML AI Podcast with Sam Charrington
39 Active Learning for Materials Design with Kevin Tran - TWiML Talk #238
Active Learning for Materials Design with Kevin Tran - TWiML Talk #238
The TWIML AI Podcast with Sam Charrington
40 TWiML x Fast ai Deep Learning Part 1 Review Study Group - Winter 2019 - Lesson 6
TWiML x Fast ai Deep Learning Part 1 Review Study Group - Winter 2019 - Lesson 6
The TWIML AI Podcast with Sam Charrington
41 Building a Recommendation Agent for The North Face with Andrew Guldman - TWiML Talk #239
Building a Recommendation Agent for The North Face with Andrew Guldman - TWiML Talk #239
The TWIML AI Podcast with Sam Charrington
42 TWiML x Fast ai v3 Deep Learning Part 1 Review Study Group - Winter 2019 - Lesson 7
TWiML x Fast ai v3 Deep Learning Part 1 Review Study Group - Winter 2019 - Lesson 7
The TWIML AI Podcast with Sam Charrington
43 The Unreasonable Effectiveness of the Forget Gate with Jos Van Der Westhuizen - TWiML Talk #240
The Unreasonable Effectiveness of the Forget Gate with Jos Van Der Westhuizen - TWiML Talk #240
The TWIML AI Podcast with Sam Charrington
44 AD as it relates to Differentiable Programming for ML @ TWiML Online Meetup Americas 20 March 2019
AD as it relates to Differentiable Programming for ML @ TWiML Online Meetup Americas 20 March 2019
The TWIML AI Podcast with Sam Charrington
45 Privacy-Preserving Decentralized Data Science with Andrew Trask - TWiML Talk #241
Privacy-Preserving Decentralized Data Science with Andrew Trask - TWiML Talk #241
The TWIML AI Podcast with Sam Charrington
46 Exploring TensorFlow 2.0 with Paige Bailey - TWiML Talk #242
Exploring TensorFlow 2.0 with Paige Bailey - TWiML Talk #242
The TWIML AI Podcast with Sam Charrington
47 TWiML x Fast.ai v3 Deep Learning Part 2 Study Group - Lesson 8 - Spring 2019 1080p
TWiML x Fast.ai v3 Deep Learning Part 2 Study Group - Lesson 8 - Spring 2019 1080p
The TWIML AI Podcast with Sam Charrington
48 Mining the Vatican Secret Archives with TensorFlow w/ Elena Nieddu - TWiML Talk #243
Mining the Vatican Secret Archives with TensorFlow w/ Elena Nieddu - TWiML Talk #243
The TWIML AI Podcast with Sam Charrington
49 Supporting TensorFlow at Airbnb with Alfredo Luque - TWiML Talk #244
Supporting TensorFlow at Airbnb with Alfredo Luque - TWiML Talk #244
The TWIML AI Podcast with Sam Charrington
50 TWiML x Fast.ai v3 Deep Learning Part 2 Study Group - Lesson 9 - Spring 2019 1080p
TWiML x Fast.ai v3 Deep Learning Part 2 Study Group - Lesson 9 - Spring 2019 1080p
The TWIML AI Podcast with Sam Charrington
51 Pragmatic Quantum Machine Learning with Peter Wittek - TWiML Talk #245
Pragmatic Quantum Machine Learning with Peter Wittek - TWiML Talk #245
The TWIML AI Podcast with Sam Charrington
52 *Bonus Episode* A Quantum Machine Learning Algorithm Takedown with Ewin Tang - TWiML Talk #246
*Bonus Episode* A Quantum Machine Learning Algorithm Takedown with Ewin Tang - TWiML Talk #246
The TWIML AI Podcast with Sam Charrington
53 Matching Networks for One Shot Learning @ TWiML Online Meetup - EMEA  - 2 April 2019 1080p
Matching Networks for One Shot Learning @ TWiML Online Meetup - EMEA - 2 April 2019 1080p
The TWIML AI Podcast with Sam Charrington
54 Benchmarking Custom Computer Vision Services at Urban Outfitters with Tom Szumowski - TWiML Talk...
Benchmarking Custom Computer Vision Services at Urban Outfitters with Tom Szumowski - TWiML Talk...
The TWIML AI Podcast with Sam Charrington
55 Empathy in AI with Rob Walker - TWiML Talk #248
Empathy in AI with Rob Walker - TWiML Talk #248
The TWIML AI Podcast with Sam Charrington
56 Deep Learning for Population Genetic Inference with Dan Schrider - TWiML Talk #249
Deep Learning for Population Genetic Inference with Dan Schrider - TWiML Talk #249
The TWIML AI Podcast with Sam Charrington
57 TWiML x Fast ai v3 Deep Learning Part 2 Study Group - Lesson 10 -  Spring 2019 1080p
TWiML x Fast ai v3 Deep Learning Part 2 Study Group - Lesson 10 - Spring 2019 1080p
The TWIML AI Podcast with Sam Charrington
58 Mapping Dark Matter with Bayesian Neural Networks w/ Yashar Hezaveh - TWiML Talk #250
Mapping Dark Matter with Bayesian Neural Networks w/ Yashar Hezaveh - TWiML Talk #250
The TWIML AI Podcast with Sam Charrington
59 TWiML x Fast ai v3 Deep Learning Part 2 Study Group - Lesson 11 -  Spring 2019 1080p
TWiML x Fast ai v3 Deep Learning Part 2 Study Group - Lesson 11 - Spring 2019 1080p
The TWIML AI Podcast with Sam Charrington
60 Domain Adaptation and Generative Models for Single Cell Genomics with Gerald Quon - TWiML Talk #251
Domain Adaptation and Generative Models for Single Cell Genomics with Gerald Quon - TWiML Talk #251
The TWIML AI Podcast with Sam Charrington
They Hired Me to Steal a Shopping Cart Full of Human DNA 🧬 Darknet Diaries Ep. 160: Greg
Next Up
They Hired Me to Steal a Shopping Cart Full of Human DNA 🧬 Darknet Diaries Ep. 160: Greg
Jack Rhysider