MiniCPM4.1-8B: First Open-Source Reasoning LLM with Trainable Sparse Attention

OpenBMB ยท Advanced ยท๐Ÿง  Large Language Models ยท6mo ago
๐Ÿš€ Introducing MiniCPM4.1-8B: First Open-Source Reasoning LLM with Trainable Sparse Attention โœ… Strong Reasoning Capability: Surpasses similar-sized models on 15 tasks! โœ… Fast Generation: 3x decoding speedup for reasoning โœ… Efficient Architecture: Trainable sparse attention, frequency-ranked speculative decoding Download Models: Huggingface: https://huggingface.co/openbmb/MiniCPM4.1-8B Github: https://github.com/OpenBMB/MiniCPM Technical Report: https://arxiv.org/pdf/2506.07900
Watch on YouTube โ†— (saves to browser)
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Next Up
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)