DO-Bench: An Attributable Benchmark for Diagnosing Object Hallucination in Vision-Language Models

📰 ArXiv cs.AI

arXiv:2604.22822v1 Announce Type: cross Abstract: Object level hallucination remains a central reliability challenge for vision language models (VLMs), particularly in binary object existence verification. Existing benchmarks emphasize aggregate accuracy but rarely disentangle whether errors stem from perceptual limitations or from the influence of contextual textual priors, leaving underlying failure mechanisms ambiguous. We introduce DO-Bench, a controlled diagnostic benchmark that isolates th

Published 28 Apr 2026
Read full paper → ← Back to Reads