Channel-wise Vector Quantization

📰 ArXiv cs.AI

arXiv:2605.26089v1 Announce Type: cross Abstract: We present Channel-wise Vector Quantization (CVQ), a novel image tokenization paradigm that replaces patch-wise tokens with channel-wise tokens. Unlike conventional vector quantization, which assigns a discrete token to each patch feature vector, CVQ quantizes each channel of the feature map. This formulation represents an image as discrete levels of visual details, rather than as a grid of spatial patches. Based on CVQ, we introduce a new visual

Published 26 May 2026
Read full paper → ← Back to Reads