The Winds of AI Winter (Q2 Four Wars of the AI Stack Recap)

Latent Space · Beginner ·🧠 Large Language Models ·1y ago

Key Takeaways

This video discusses the current state of AI, recapping Q2 2024, and the concept of AI winter, with a focus on LLMs

Original Description

Thank you for 1m downloads of the podcast and 2m readers of the Substack! 🎉 This is the audio discussion following The Winds of AI Winter essay that also serves as a recap of Q2 2024 in AI viewed through the lens of our Four Wars framework. Enjoy! 00:00:00 Intro Song by Suno.ai 00:02:01 Swyx and Alessio in Singapore 00:05:49 GPU Rich vs Poors: Frontier Labs 00:06:35 GPU Rich Frontier Models: Claude 3.5 00:10:37 GPU Rich helping Poors: Llama 3.1: The Synthetic Data Model 00:15:41 GPU Rich helping Poors: Frontier Labs Vibe Shift - Phi 3, Gemma 2 00:18:26 GPU Rich: Mistral Large 00:21:56 GPU Rich: Nvidia + FlashAttention 3 00:23:45 GPU Rich helping Poors: Noam Shazeer & Character.AI 00:28:14 GPU Poors: On Device LLMs: Mozilla Llamafile, Chrome (Gemini Nano), Apple Intelligence 00:35:33 Quality Data Wars: NYT vs The Atlantic lawyer up vs partner up 00:37:41 Quality Data Wars: Reddit, ScarJo, RIAA vs Udio & Suno 00:41:03 Quality Data Wars: Synthetic Data, Jagged Intelligence, AlphaProof 00:45:33 Multimodality War: ChatGPT Voice Mode, OpenAI demo at AIEWF 00:47:34 Multimodality War: Meta Llama 3 multimodality + Chameleon 00:50:54 Multimodality War: PaliGemma + CoPaliGemma 00:52:55 Renaming Rag/Ops War to LLM OS War 00:55:31 LLM OS War: Ops War: Prompt Management vs Gateway vs Observability 01:02:57 LLM OS War: BM42 Vector DB Wars, Memory Databases, GraphRAG 01:06:15 LLM OS War: Agent Tooling 01:08:26 LLM OS War: Agent Protocols 01:10:43 Trend: Commoditization of Intelligence 01:16:45 Trend: Vertical Service as Software, AI Employees, Brightwave, Dropzone 01:20:44 Trend: Benchmark Frontiers after MMLU 01:23:31 Crowdstrike will save us from Skynet

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Latent Space · Latent Space · 39 of 60

← Previous Next →

Ep 18: Petaflops to the People — with George Hotz of tinycorp

Ep 18: Petaflops to the People — with George Hotz of tinycorp

FlashAttention-2: Making Transformers 800% faster AND exact

FlashAttention-2: Making Transformers 800% faster AND exact

RWKV: Reinventing RNNs for the Transformer Era

RWKV: Reinventing RNNs for the Transformer Era

Generating your AI Media Empire - with Youssef Rizk of Wondercraft.ai

Generating your AI Media Empire - with Youssef Rizk of Wondercraft.ai

RAG is a hack - with Jerry Liu of LlamaIndex

RAG is a hack - with Jerry Liu of LlamaIndex

The End of Finetuning — with Jeremy Howard of Fast.ai

The End of Finetuning — with Jeremy Howard of Fast.ai

Why AI Agents Don't Work (yet) - with Kanjun Qiu of Imbue

Why AI Agents Don't Work (yet) - with Kanjun Qiu of Imbue

Powering your Copilot for Data - with Artem Keydunov from Cube.dev

Powering your Copilot for Data - with Artem Keydunov from Cube.dev

Beating GPT-4 with Open Source Models - with Michael Royzen of Phind

Beating GPT-4 with Open Source Models - with Michael Royzen of Phind

The State of Silicon and the GPU Poors - with Dylan Patel of SemiAnalysis

The State of Silicon and the GPU Poors - with Dylan Patel of SemiAnalysis

The "Normsky" architecture for AI coding agents — with Beyang Liu + Steve Yegge of SourceGraph

The "Normsky" architecture for AI coding agents — with Beyang Liu + Steve Yegge of SourceGraph

The AI-First Graphics Editor - with Suhail Doshi of Playground AI

The AI-First Graphics Editor - with Suhail Doshi of Playground AI

The Accidental AI Canvas - with Steve Ruiz of tldraw

The Accidental AI Canvas - with Steve Ruiz of tldraw

The Origin and Future of RLHF: the secret ingredient for ChatGPT - with Nathan Lambert

The Origin and Future of RLHF: the secret ingredient for ChatGPT - with Nathan Lambert

The Four Wars of the AI Stack - Dec 2023 Recap

The Four Wars of the AI Stack - Dec 2023 Recap

The State of AI in production — with David Hsu of Retool

The State of AI in production — with David Hsu of Retool

Building an open AI company - with Ce and Vipul of Together AI

Building an open AI company - with Ce and Vipul of Together AI

Truly Serverless Infra for AI Engineers - with Erik Bernhardsson of Modal

Truly Serverless Infra for AI Engineers - with Erik Bernhardsson of Modal

A Brief History of the Open Source AI Hacker - with Ben Firshman of Replicate

A Brief History of the Open Source AI Hacker - with Ben Firshman of Replicate

Open Source AI is AI we can Trust — with Soumith Chintala of Meta AI

Open Source AI is AI we can Trust — with Soumith Chintala of Meta AI

Making Transformers Sing - with Mikey Shulman of Suno

Making Transformers Sing - with Mikey Shulman of Suno

A Comprehensive Overview of Large Language Models - Latent Space Paper Club

A Comprehensive Overview of Large Language Models - Latent Space Paper Club

Why Google failed to make GPT-3 -- with David Luan of Adept

Why Google failed to make GPT-3 -- with David Luan of Adept

Personal AI Meetup - Bee, BasedHardware, LangChain LangFriend, Deepgram EmilyAI

Personal AI Meetup - Bee, BasedHardware, LangChain LangFriend, Deepgram EmilyAI

Supervise the Process of AI Research — with Jungwon Byun and Andreas Stuhlmüller of Elicit

Supervise the Process of AI Research — with Jungwon Byun and Andreas Stuhlmüller of Elicit

Breaking down the OG GPT Paper by Alec Radford

Breaking down the OG GPT Paper by Alec Radford

High Agency Pydantic over VC Backed Frameworks — with Jason Liu of Instructor

High Agency Pydantic over VC Backed Frameworks — with Jason Liu of Instructor

This World Does Not Exist — Joscha Bach, Karan Malhotra, Rob Haisfield (WorldSim, WebSim, Liquid AI)

This World Does Not Exist — Joscha Bach, Karan Malhotra, Rob Haisfield (WorldSim, WebSim, Liquid AI)

LLM Asia Paper Club Survey Round

LLM Asia Paper Club Survey Round

How to train a Million Context LLM — with Mark Huang of Gradient.ai

How to train a Million Context LLM — with Mark Huang of Gradient.ai

How AI is Eating Finance - with Mike Conover of Brightwave

How AI is Eating Finance - with Mike Conover of Brightwave

How To Hire AI Engineers (ft. James Brady and Adam Wiggins of Elicit)

How To Hire AI Engineers (ft. James Brady and Adam Wiggins of Elicit)

State of the Art: Training 70B LLMs on 10,000 H100 clusters

State of the Art: Training 70B LLMs on 10,000 H100 clusters

The 10,000x Yolo Researcher Metagame — with Yi Tay of Reka

The 10,000x Yolo Researcher Metagame — with Yi Tay of Reka

Training Llama 2, 3 & 4: The Path to Open Source AGI — with Thomas Scialom of Meta AI

Training Llama 2, 3 & 4: The Path to Open Source AGI — with Thomas Scialom of Meta AI

[LLM Paper Club] Llama 3.1 Paper: The Llama Family of Models

[LLM Paper Club] Llama 3.1 Paper: The Llama Family of Models

Synthetic data + tool use for LLM improvements 🦙

Synthetic data + tool use for LLM improvements 🦙

RLHF vs SFT to break out of local maxima 📈

RLHF vs SFT to break out of local maxima 📈

The Winds of AI Winter (Q2 Four Wars of the AI Stack Recap)

The Winds of AI Winter (Q2 Four Wars of the AI Stack Recap)

Segment Anything 2: Memory + Vision = Object Permanence — with Nikhila Ravi and Joseph Nelson

Segment Anything 2: Memory + Vision = Object Permanence — with Nikhila Ravi and Joseph Nelson

Answer.ai & AI Magic with Jeremy Howard

Answer.ai & AI Magic with Jeremy Howard

Is finetuning GPT4o worth it?

Is finetuning GPT4o worth it?

Personal benchmarks vs HumanEval - with Nicholas Carlini of DeepMind

Personal benchmarks vs HumanEval - with Nicholas Carlini of DeepMind

Building AGI with OpenAI's Structured Outputs API

Building AGI with OpenAI's Structured Outputs API

Q* for model distillation 🍓

Q* for model distillation 🍓

Finetuning LoRAs on BILLIONS of tokens 🤖

Finetuning LoRAs on BILLIONS of tokens 🤖

Cursor UX team is CRACKED 💻

Cursor UX team is CRACKED 💻

Choosing the BEST OpenAI model 🏆

Choosing the BEST OpenAI model 🏆

How will OpenAI voice mode change API design?

How will OpenAI voice mode change API design?

STEALING OpenAI models data 🥷

STEALING OpenAI models data 🥷

[Paper Club] 🍓 On Reasoning: Q-STaR and Friends!

[Paper Club] 🍓 On Reasoning: Q-STaR and Friends!

[Paper Club] Writing in the Margins: Chunked Prefill KV Caching for Long Context Retrieval

[Paper Club] Writing in the Margins: Chunked Prefill KV Caching for Long Context Retrieval

The Ultimate Guide to Prompting - with Sander Schulhoff from LearnPrompting.org

The Ultimate Guide to Prompting - with Sander Schulhoff from LearnPrompting.org

llm.c's Origin and the Future of LLM Compilers - Andrej Karpathy at CUDA MODE

llm.c's Origin and the Future of LLM Compilers - Andrej Karpathy at CUDA MODE

Prompt Engineer is NOT a job 📝

Prompt Engineer is NOT a job 📝

Prompt Mining LLMs for better prompts ⛏️

Prompt Mining LLMs for better prompts ⛏️

The six pillars of few-shot prompting 🔧

The six pillars of few-shot prompting 🔧

Language Agents: From Reasoning to Acting — with Shunyu Yao of OpenAI, Harrison Chase of LangGraph

Language Agents: From Reasoning to Acting — with Shunyu Yao of OpenAI, Harrison Chase of LangGraph

[Paper Club] Who Validates the Validators? Aligning LLM-Judges with Humans (w/ Eugene Yan)

[Paper Club] Who Validates the Validators? Aligning LLM-Judges with Humans (w/ Eugene Yan)

Can you separate intelligence and knowledge?

Can you separate intelligence and knowledge?

Related AI Lessons

I taught myself to code 5 months ago and built an autonomous AI red-team tester — testyourllm.com

A piano teacher with no coding background built an autonomous AI red-team tester in 5 months, which successfully broke Llama 3.3 70B on the first try

Reddit r/artificial

ChatGPT vs Claude vs Gemini in 2026: Honest Comparison

Learn how ChatGPT, Claude, and Gemini compare in 2026 and which one is best for specific tasks

LLMs Do Not Know Your Life

LLMs provide internet-average advice that may not apply to individual circumstances, highlighting the importance of critical thinking and human judgment

Progress for Machines, Obedience for People

Learn to critically evaluate the impact of technology on society and distinguish between progress for machines and obedience for people, understanding the importance of responsible AI development and deployment.

Chapters (25)

Intro Song by Suno.ai

2:01 Swyx and Alessio in Singapore

5:49 GPU Rich vs Poors: Frontier Labs

6:35 GPU Rich Frontier Models: Claude 3.5

10:37 GPU Rich helping Poors: Llama 3.1: The Synthetic Data Model

15:41 GPU Rich helping Poors: Frontier Labs Vibe Shift - Phi 3, Gemma 2

18:26 GPU Rich: Mistral Large

21:56 GPU Rich: Nvidia + FlashAttention 3

23:45 GPU Rich helping Poors: Noam Shazeer & Character.AI

28:14 GPU Poors: On Device LLMs: Mozilla Llamafile, Chrome (Gemini Nano), Apple Intell

35:33 Quality Data Wars: NYT vs The Atlantic lawyer up vs partner up

37:41 Quality Data Wars: Reddit, ScarJo, RIAA vs Udio & Suno

41:03 Quality Data Wars: Synthetic Data, Jagged Intelligence, AlphaProof

45:33 Multimodality War: ChatGPT Voice Mode, OpenAI demo at AIEWF

47:34 Multimodality War: Meta Llama 3 multimodality + Chameleon

50:54 Multimodality War: PaliGemma + CoPaliGemma

52:55 Renaming Rag/Ops War to LLM OS War

55:31 LLM OS War: Ops War: Prompt Management vs Gateway vs Observability

1:02:57 LLM OS War: BM42 Vector DB Wars, Memory Databases, GraphRAG

1:06:15 LLM OS War: Agent Tooling

1:08:26 LLM OS War: Agent Protocols

1:10:43 Trend: Commoditization of Intelligence

1:16:45 Trend: Vertical Service as Software, AI Employees, Brightwave, Dropzone

1:20:44 Trend: Benchmark Frontiers after MMLU

1:23:31 Crowdstrike will save us from Skynet

5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems

Dave Ebbelaar (LLM Eng)