Large Language Models are Powerful Electronic Health Record Encoders

📰 ArXiv cs.AI

arXiv:2502.17403v5 Announce Type: replace-cross Abstract: Electronic Health Records (EHRs) offer considerable potential for clinical prediction, but their complexity and heterogeneity challenge traditional machine learning. Domain-specific EHR foundation models trained on unlabeled EHR data have shown improved predictive accuracy and generalization. However, their development is constrained by limited data access and site-specific vocabularies. We convert EHR data into plain text by replacing me

Published 15 Apr 2026

Read full paper → ← Back to Reads