Representation geometry shapes task performance in vision-language modeling for CT enterography

📰 ArXiv cs.AI

arXiv:2604.13021v1 Announce Type: cross Abstract: Computed tomography (CT) enterography is a primary imaging modality for assessing inflammatory bowel disease (IBD), yet the representational choices that best support automated analysis of this modality are unknown. We present the first study of vision-language transfer learning on abdominal CT enterography and identify two main findings. First, mean pooling of slice embeddings gives better categorical disease assessment (59.2\% three-class accur

Published 15 Apr 2026
Read full paper → ← Back to Reads