Hierarchical Reasoning Model: Substance or Hype?

Julia Turc · Advanced ·📄 Research Papers Explained ·8mo ago

Skills: LLM Foundations80%LLM Engineering70%Fine-tuning LLMs60%RAG Basics50%

📚 Free resources (reading list + visuals): https://www.patreon.com/c/JuliaTurc 📃 HRM paper: https://arxiv.org/abs/2506.21734 ▶️ Yacine's YouTube channel: https://www.youtube.com/@deeplearningexplained In this video, we dive into the Hierarchical Reasoning Model (HRM), a new architecture from Sapient Intelligence that challenges scaling as the only way to advance AI. With only 27M parameters, 1000 training examples, and no pretraining, HRM still manages to place on the notoriously difficult ARC-AGI leaderboard, right next to models from OpenAI and Anthropic. Together with Yacine Mahdid (neuroscience researcher & ML practitioner), we’ll explore: • Why vanilla Transformers plateau on tasks like Sudoku and Maze solving • How latent recurrence and hierarchical loops give HRM more reasoning depth • The neuroscience inspiration (theta–gamma coupling in the hippocampus 🧠) • HRM’s controversial evaluation on ARC-AGI: was it a breakthrough or bending the rules? • What this means for the future of reasoning in AI models Timestamps: 00:00 Introducing HRM 01:23 Why Sudoku breaks Transformers 03:07 Recurrence via Chain-of-Thought 04:22 HRM: bird's eye view 06:30 Latent recurrence 08:23 The neuroscience backing 11:43 The H and L modules 12:32 Backprop-through-time approximation 13:48 The outer loop 19:31 Training data augmentation 22:59 Evaluation on Sudoku 24:07 Evaluation on ARC-AGI

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

More on: LLM Foundations

View skill →

Getting Started with Vertex AI Gemini 1.5 Flash

I TRAINED AN AI TO SOLVE 2+2 (w/ Live Coding)

I TRAINED AN AI TO SOLVE 2+2 (w/ Live Coding)

How to use the ChatGPT API with Python!!

How to use the ChatGPT API with Python!!

Nicholas Renotte

Gemini 2.5: Create an interactive plot of economic data

Gemini 2.5: Create an interactive plot of economic data

Google DeepMind

LangChain Chatbots: Building a Personalized AI Assistant

LangChain Chatbots: Building a Personalized AI Assistant

Analytics Vidhya

Auto-generating meeting notes with Python

Auto-generating meeting notes with Python

Related AI Lessons

The ABCs of reading medical research and review papers these days

Learn to critically evaluate medical research papers by accepting nothing at face value, believing no one blindly, and checking everything

#1 DevLog Meta-research: I Got Tired of Tab Chaos While Reading Research Papers.

Learn to manage research paper tabs efficiently and apply meta-research techniques to improve productivity

How to Set Up a Karpathy-Style Wiki for Your Research Field

Learn to set up a Karpathy-style wiki for your research field to organize and share knowledge effectively

The Non-Optimality of Scientific Knowledge: Path Dependence, Lock-In, and The Local Minimum Trap

Scientific knowledge may be stuck in a local minimum, hindering optimal progress, and understanding this concept is crucial for advancing research

Chapters (12)

Introducing HRM

1:23 Why Sudoku breaks Transformers

3:07 Recurrence via Chain-of-Thought

4:22 HRM: bird's eye view

6:30 Latent recurrence

8:23 The neuroscience backing

11:43 The H and L modules

12:32 Backprop-through-time approximation

13:48 The outer loop

19:31 Training data augmentation

22:59 Evaluation on Sudoku

24:07 Evaluation on ARC-AGI

Microsoft Research Forum | Season 2, Episode 4

Microsoft Research