Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Models
📰 ArXiv cs.AI
arXiv:2604.25642v1 Announce Type: cross Abstract: Large Vision-Language Models (LVLMs) have achieved remarkable progress in visual-textual understanding, yet their reliability is critically undermined by hallucinations, i.e., the generation of factually incorrect or inconsistent responses. While recent studies using steering vectors demonstrated promise in reducing hallucinations, a notable challenge remains: they inadvertently amplify the severity of residual hallucinations. We attribute this t
DeepCamp AI