To See or To Please: Uncovering Visual Sycophancy and Split Beliefs in VLMs
📰 ArXiv cs.AI
arXiv:2603.18373v2 Announce Type: replace-cross Abstract: When VLMs answer correctly, do they genuinely rely on visual information or exploit language shortcuts? We introduce the Tri-Layer Diagnostic Framework, which disentangles hallucination sources via three metrics: Latent Anomaly Detection (perceptual awareness), Visual Necessity Score (visual dependency, measured via KL divergence), and Competition Score (conflict between visual grounding and instruction following). Using counterfactual in
DeepCamp AI