SAME: A Semantically-Aligned Music Autoencoder

📰 ArXiv cs.AI

arXiv:2605.18613v1 Announce Type: cross Abstract: Latent representations are at the heart of the majority of modern generative models. In the audio domain they are typically produced by a neural-audio-codec autoencoder. In this work we introduce SAME (Semantically-Aligned Music autoEncoder), an autoencoder for stereo music and general audio that reaches a 4096$\times$ temporal compression ratio while maintaining reconstruction quality and downstream generative performance. We achieve this by com

Published 19 May 2026

Read full paper → ← Back to Reads