Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

📰 Hugging Face Blog
Published 31 Jan 2025
Read full article → ← Back to News