Benchmarking and Adapting On-Device LLMs for Clinical Decision Support

📰 ArXiv cs.AI

arXiv:2601.03266v2 Announce Type: replace-cross Abstract: Large language models (LLMs) have rapidly advanced in clinical decision-making, yet the deployment of proprietary systems is hindered by privacy concerns and reliance on cloud-based infrastructure. Open-source alternatives allow local inference but often have large model sizes that limit their use in resource-constrained clinical settings. Here, we benchmark on-device LLMs from the gpt-oss (20b, 120b), Qwen3.5 (9B, 27B, 35B), and Gemma 4

Published 29 Apr 2026
Read full paper → ← Back to Reads