Variational Visual Question Answering for Uncertainty-Aware Selective Prediction

📰 ArXiv cs.AI

arXiv:2505.09591v3 Announce Type: replace-cross Abstract: Despite remarkable progress in recent years, Vision Language Models (VLMs) remain prone to overconfidence and hallucinations on tasks such as Visual Question Answering (VQA) and Visual Reasoning. Bayesian methods can potentially improve reliability by helping models predict selectively, that is, models respond only when they are sufficiently confident. Unfortunately, such approaches can be costly and ineffective for large models, and ther

Published 14 Apr 2026

Read full paper → ← Back to Reads