VVS: Accelerating Speculative Decoding for Visual Autoregressive Generation via Partial Verification Skipping
📰 ArXiv cs.AI
arXiv:2511.13587v2 Announce Type: replace-cross Abstract: Visual autoregressive (AR) generation models have demonstrated strong potential for image generation, yet their next-token-prediction paradigm introduces considerable inference latency. Although speculative decoding (SD) has been proven effective for accelerating visual AR models, its "draft one step, then verify one step" paradigm prevents a direct reduction in the number of forward passes, limiting its acceleration potential. Motivated
DeepCamp AI