MiniCPM4 - Efficient Edge-Side Large Model

OpenBMB ยท Advanced ยท๐Ÿง  Large Language Models ยท8mo ago
๐Ÿš€ MiniCPM4 is here! 5x faster on end devices ๐Ÿ”ฅ โœจ What's new: ๐Ÿ—๏ธ Efficient Model Architecture - InfLLM v2 -- Trainable Sparse Attention Mechanism ๐Ÿง  Efficient Learning Algorithms - Model Wind Tunnel 2.0 -- Efficient Predictable Scaling - BitCPM -- Ultimate Ternary Quantization ๐Ÿ“š High-Quality Training Data - UltraClean -- High-quality Pre-training Data Filtering and Generation - UltraChat v2 -- High-quality Supervised Fine-tuning Data Generation โšก Efficient Inference System: - CPM.cu -- Light Lightweight and Efficient CUDA Inference Framework - ArkInfer -- Cross-Platform Deployment Frameworโ€ฆ
Watch on YouTube โ†— (saves to browser)
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Next Up
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)