Two-Dimensional Quantization for Geometry-Aware Audio Coding

📰 ArXiv cs.AI

arXiv:2512.01537v2 Announce Type: replace-cross Abstract: Recent neural audio codecs have achieved impressive reconstruction quality, typically relying on quantization methods such as Residual Vector Quantization (RVQ), Vector Quantization (VQ) and Finite Scalar Quantization (FSQ). However, these quantization techniques limit the geometric structure of the latent space, make it harder to capture correlations between features leading to inefficiency in representation learning, codebook utilizatio

Published 18 May 2026
Read full paper → ← Back to Reads