Representation geometry shapes task performance in vision-language modeling for CT enterography
📰 ArXiv cs.AI
arXiv:2604.13021v1 Announce Type: cross Abstract: Computed tomography (CT) enterography is a primary imaging modality for assessing inflammatory bowel disease (IBD), yet the representational choices that best support automated analysis of this modality are unknown. We present the first study of vision-language transfer learning on abdominal CT enterography and identify two main findings. First, mean pooling of slice embeddings gives better categorical disease assessment (59.2\% three-class accur
DeepCamp AI