DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature Caching
📰 ArXiv cs.AI
arXiv:2602.05449v3 Announce Type: replace-cross Abstract: While diffusion models have achieved great success in the field of video generation, this progress is accompanied by a rapidly escalating computational burden. Among the existing acceleration methods, Feature Caching is popular due to its training-free property and considerable speedup performance, but it inevitably faces semantic and detail drop with further compression. Another widely adopted method, training-aware step-distillation, th
DeepCamp AI