How does AI actually work? Transformers explained
How GPT and other large language models (LLMs) work. Transformers deep dive. #ai #llm #machinelearning #datascience #agi
Thanks to our sponsor Genspark. Try it for free https://bit.ly/4uM3PLS
Attention is all you need https://arxiv.org/html/1706.03762v7
0:00 Intro
0:33 The transformer model
1:30 Predicting the next word
2:30 Tokenization
5:06 Representing meaning
7:17 Positional encoding
9:17 Attention head
14:49 Genspark
16:35 Multiple heads
19:30 Add and norm
21:45 Feed forward neural net
24:08 Multiple decoder blocks
24:50 Final layer
27:03 Training the model
Newsletter: https://aisear…
Watch on YouTube ↗
(saves to browser)
Chapters (14)
Intro
0:33
The transformer model
1:30
Predicting the next word
2:30
Tokenization
5:06
Representing meaning
7:17
Positional encoding
9:17
Attention head
14:49
Genspark
16:35
Multiple heads
19:30
Add and norm
21:45
Feed forward neural net
24:08
Multiple decoder blocks
24:50
Final layer
27:03
Training the model
DeepCamp AI