Interpreting Video Representations with Spatio-Temporal Sparse Autoencoders

📰 ArXiv cs.AI

arXiv:2604.03919v1 Announce Type: cross Abstract: We present the first systematic study of Sparse Autoencoders (SAEs) on video representations. Standard SAEs decompose video into interpretable, monosemantic features but destroy temporal coherence: hard TopK selection produces unstable feature assignments across frames, reducing autocorrelation by 36%. We propose spatio-temporal contrastive objectives and Matryoshka hierarchical grouping that recover and even exceed raw temporal coherence. The co

Published 7 Apr 2026

Read full paper → ← Back to News