Doctorina MedBench: End-to-End Evaluation of Agent-Based Medical AI

📰 ArXiv cs.AI

arXiv:2603.25821v1 Announce Type: cross Abstract: We present Doctorina MedBench, a comprehensive evaluation framework for agent-based medical AI based on the simulation of realistic physician-patient interactions. Unlike traditional medical benchmarks that rely on solving standardized test questions, the proposed approach models a multi-step clinical dialogue in which either a physician or an AI system must collect medical history, analyze attached materials (including laboratory reports, images

Published 30 Mar 2026

Read full paper → ← Back to News