VidAudio-Bench: Benchmarking V2A and VT2A Generation across Four Audio Categories

📰 ArXiv cs.AI

arXiv:2604.10542v1 Announce Type: cross Abstract: Video-to-Audio (V2A) generation is essential for immersive multimedia experiences, yet its evaluation remains underexplored. Existing benchmarks typically assess diverse audio types under a unified protocol, overlooking the fine-grained requirements of distinct audio categories. To address this gap, we propose VidAudio-Bench, a multi-task benchmark for V2A evaluation with four key features: (1) Broad Coverage: It encompasses four representative a

Published 14 Apr 2026
Read full paper → ← Back to Reads