Seeking Physics in Diffusion Noise

📰 ArXiv cs.AI

Researchers find that video diffusion models can encode signals predictive of physical plausibility, allowing for partial separation of plausible and implausible videos in feature space

advanced Published 27 Mar 2026

Action Steps

Analyze intermediate denoising representations of a pretrained Diffusion Transformer (DiT)
Probe mid-layer feature space across noise levels to identify separability of physically plausible and implausible videos
Investigate whether separability can be attributed to visual quality or generator identity
Explore recoverable physics-related cues in frozen diffusion models

Who Needs to Know This

AI engineers and researchers working on computer vision and diffusion models can benefit from this study, as it provides insights into the capabilities and limitations of these models

Key Insight

💡 Diffusion models can capture physically plausible signals, even in noise