Interpreting Video Representations with Spatio-Temporal Sparse Autoencoders
📰 ArXiv cs.AI
arXiv:2604.03919v1 Announce Type: cross Abstract: We present the first systematic study of Sparse Autoencoders (SAEs) on video representations. Standard SAEs decompose video into interpretable, monosemantic features but destroy temporal coherence: hard TopK selection produces unstable feature assignments across frames, reducing autocorrelation by 36%. We propose spatio-temporal contrastive objectives and Matryoshka hierarchical grouping that recover and even exceed raw temporal coherence. The co
DeepCamp AI