DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature Caching

📰 ArXiv cs.AI

arXiv:2602.05449v3 Announce Type: replace-cross Abstract: While diffusion models have achieved great success in the field of video generation, this progress is accompanied by a rapidly escalating computational burden. Among the existing acceleration methods, Feature Caching is popular due to its training-free property and considerable speedup performance, but it inevitably faces semantic and detail drop with further compression. Another widely adopted method, training-aware step-distillation, th

Published 21 Apr 2026

Read full paper → ← Back to Reads