Hierarchical Reasoning Model: Substance or Hype?

Julia Turc ยท Advanced ยท๐Ÿ“„ Research Papers Explained ยท8mo ago
๐Ÿ“š Free resources (reading list + visuals): https://www.patreon.com/c/JuliaTurc ๐Ÿ“ƒ HRM paper: https://arxiv.org/abs/2506.21734 โ–ถ๏ธ Yacine's YouTube channel: https://www.youtube.com/@deeplearningexplained In this video, we dive into the Hierarchical Reasoning Model (HRM), a new architecture from Sapient Intelligence that challenges scaling as the only way to advance AI. With only 27M parameters, 1000 training examples, and no pretraining, HRM still manages to place on the notoriously difficult ARC-AGI leaderboard, right next to models from OpenAI and Anthropic. Together with Yacine Mahdid (neuroscience researcher & ML practitioner), weโ€™ll explore: โ€ข Why vanilla Transformers plateau on tasks like Sudoku and Maze solving โ€ข How latent recurrence and hierarchical loops give HRM more reasoning depth โ€ข The neuroscience inspiration (thetaโ€“gamma coupling in the hippocampus ๐Ÿง ) โ€ข HRMโ€™s controversial evaluation on ARC-AGI: was it a breakthrough or bending the rules? โ€ข What this means for the future of reasoning in AI models Timestamps: 00:00 Introducing HRM 01:23 Why Sudoku breaks Transformers 03:07 Recurrence via Chain-of-Thought 04:22 HRM: bird's eye view 06:30 Latent recurrence 08:23 The neuroscience backing 11:43 The H and L modules 12:32 Backprop-through-time approximation 13:48 The outer loop 19:31 Training data augmentation 22:59 Evaluation on Sudoku 24:07 Evaluation on ARC-AGI
Watch on YouTube โ†— (saves to browser)
Sign in to unlock AI tutor explanation ยท โšก30

Related AI Lessons

โšก
The ABCs of reading medical research and review papers these days
Learn to critically evaluate medical research papers by accepting nothing at face value, believing no one blindly, and checking everything
Medium ยท LLM
โšก
#1 DevLog Meta-research: I Got Tired of Tab Chaos While Reading Research Papers.
Learn to manage research paper tabs efficiently and apply meta-research techniques to improve productivity
Dev.to AI
โšก
How to Set Up a Karpathy-Style Wiki for Your Research Field
Learn to set up a Karpathy-style wiki for your research field to organize and share knowledge effectively
Medium ยท AI
โšก
The Non-Optimality of Scientific Knowledge: Path Dependence, Lock-In, and The Local Minimum Trap
Scientific knowledge may be stuck in a local minimum, hindering optimal progress, and understanding this concept is crucial for advancing research
ArXiv cs.AI

Chapters (12)

Introducing HRM
1:23 Why Sudoku breaks Transformers
3:07 Recurrence via Chain-of-Thought
4:22 HRM: bird's eye view
6:30 Latent recurrence
8:23 The neuroscience backing
11:43 The H and L modules
12:32 Backprop-through-time approximation
13:48 The outer loop
19:31 Training data augmentation
22:59 Evaluation on Sudoku
24:07 Evaluation on ARC-AGI
Up next
Microsoft Research Forum | Season 2, Episode 4
Microsoft Research
Watch โ†’