The curious case of LLama: How a leaked model sparked an open source AI revolution

DeepLearning Hero · Advanced ·🧠 Large Language Models ·2y ago
In this video, we discuss Llama an LLM developed by Meta that claims performance close to monolithic GPT-3. Llama has given birth to instruction finetuned open source models that are challenging ChatGPT. Meta's official announcement: https://ai.facebook.com/blog/large-language-model-llama-meta-ai/?ref=the-batch-deeplearning-ai Llama paper: https://arxiv.org/pdf/2302.13971.pdf Instruct GPT paper: https://arxiv.org/pdf/2203.02155.pdf Sparks of AGI (GPT-4 demo) paper: https://arxiv.org/pdf/2303.12712.pdf Chinchilla paper: https://arxiv.org/pdf/2203.15556.pdf RoPE paper: https://arxiv.org/pdf/210…
Watch on YouTube ↗ (saves to browser)

Chapters (11)

Introduction
0:37 OpenAI and secrecy
1:30 Instruction based finetuning
2:07 Introducing Llama
2:40 Mysterious leak of Llama
4:20 Motivation for Llama
5:50 Deepmind's Chinchilla scaling laws
8:32 Architecture of Llama
9:18 How RoPE works
11:37 Training and evaluating Llama
13:06 Conclusion
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Next Up
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)