MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs
📰 ArXiv cs.AI
arXiv:2602.12705v4 Announce Type: replace-cross Abstract: We present MedXIAOHE, a medical vision-language foundation model designed to advance general-purpose medical understanding and reasoning in real-world clinical applications. MedXIAOHE achieves state-of-the-art performance across diverse medical benchmarks and surpasses leading closed-source multimodal systems on multiple capabilities. To achieve this, we propose an entity-aware continual pretraining framework that organizes heterogeneous
DeepCamp AI