Self Attention in Transformers | Transformers in Deep Learning

Name: Self Attention in Transformers | Transformers in Deep Learning
Uploaded: 2024-11-02T17:19:40+00:00
Channel: Learn With Jay
Description: We dive deep into the concept of Self Attention in Transformers! Self attention is a key mechanism that allows models like BERT and GPT to capture long-...

Learn With Jay · Beginner ·🧠 Large Language Models ·1y ago

We dive deep into the concept of Self Attention in Transformers! Self attention is a key mechanism that allows models like BERT and GPT to capture long-range dependencies within text, making them powerful for NLP tasks. We’ll break down how self attention in transformers works, looking at the math of how it generates a new word representation from embeddings. Whether you're new to Transformers or looking to strengthen your understanding, this video provides a clear and accessible explanation of Self Attention in Transformers with visuals and complete mathematics. ➖➖➖➖➖➖➖➖➖➖➖➖➖➖➖ Timestamps: …

Watch on YouTube ↗ (saves to browser)