VVS: Accelerating Speculative Decoding for Visual Autoregressive Generation via Partial Verification Skipping

📰 ArXiv cs.AI

arXiv:2511.13587v2 Announce Type: replace-cross Abstract: Visual autoregressive (AR) generation models have demonstrated strong potential for image generation, yet their next-token-prediction paradigm introduces considerable inference latency. Although speculative decoding (SD) has been proven effective for accelerating visual AR models, its "draft one step, then verify one step" paradigm prevents a direct reduction in the number of forward passes, limiting its acceleration potential. Motivated

Published 25 Apr 2026
Read full paper → ← Back to Reads