How I Understand Diffusion Models

Jia-Bin Huang · Beginner ·🎨 Image & Video AI ·2y ago

About this lesson

Diffusion models are powerful generative models that enable many successful applications like image, video, and 3D generation from texts. In this tutorial, I share my understanding of the diffusion model basics, including training, guidance, resolution, and speed. Below are some other great resources to learn more about diffusion models. ===== Slides ===== Here are the slides used in this video Training: https://bit.ly/3WudEPH Guidance: https://bit.ly/3wedCky Resolution: https://bit.ly/4bqxHmo Speed: https://bit.ly/4bpJzoJ ===== Tutorials ===== [CVPR 2022 Tutorial] Denoising Diffusion-based Generative Modeling: Foundations and Applications https://cvpr2022-tutorial-diffusion-models.github.io/ [CVPR 2023 Tutorial] Denoising Diffusion Models: A Generative Learning Big Bang https://cvpr2023-tutorial-diffusion-models.github.io/ [A short course by DeepLearning.AI] How Diffusion Models Work https://www.youtube.com/watch?v=obdVesVsGQI ===== Training ===== [Sohl-Dickstein et al. 2015] Deep Unsupervised Learning using Nonequilibrium Thermodynamics https://arxiv.org/abs/1503.03585 [Ho et al. 2020]: Denoising Diffusion Probabilistic Models https://arxiv.org/abs/2006.11239 [Luo 2022] Understanding Diffusion Models: A Unified Perspective https://arxiv.org/abs/2208.11970 [Karras et al. 2022] Elucidating the design space of diffusion-based generative models https://arxiv.org/abs/2206.00364 [Karras et al. 2023] Analyzing and Improving the Training Dynamics of Diffusion Models https://arxiv.org/abs/2312.02696 ===== Guidance ===== [Dhariwal and Nichol 2021] Diffusion Models Beat GANs on Image Synthesis https://arxiv.org/abs/2105.05233 [Ho and Salimans 2022] Classifier-Free Diffusion Guidance https://arxiv.org/abs/2207.12598 [Sander Dieleman 2022] Guidance: a cheat code for diffusion models https://sander.ai/2022/05/26/guidance.html [Sander Dieleman 2023] The geometry of diffusion guidance https://sander.ai/2023/08/28/geometry.html ===== Resolution ===== [Ho et al.

Original Description

Diffusion models are powerful generative models that enable many successful applications like image, video, and 3D generation from texts. In this tutorial, I share my understanding of the diffusion model basics, including training, guidance, resolution, and speed. Below are some other great resources to learn more about diffusion models. ===== Slides ===== Here are the slides used in this video Training: https://bit.ly/3WudEPH Guidance: https://bit.ly/3wedCky Resolution: https://bit.ly/4bqxHmo Speed: https://bit.ly/4bpJzoJ ===== Tutorials ===== [CVPR 2022 Tutorial] Denoising Diffusion-based Generative Modeling: Foundations and Applications https://cvpr2022-tutorial-diffusion-models.github.io/ [CVPR 2023 Tutorial] Denoising Diffusion Models: A Generative Learning Big Bang https://cvpr2023-tutorial-diffusion-models.github.io/ [A short course by DeepLearning.AI] How Diffusion Models Work https://www.youtube.com/watch?v=obdVesVsGQI ===== Training ===== [Sohl-Dickstein et al. 2015] Deep Unsupervised Learning using Nonequilibrium Thermodynamics https://arxiv.org/abs/1503.03585 [Ho et al. 2020]: Denoising Diffusion Probabilistic Models https://arxiv.org/abs/2006.11239 [Luo 2022] Understanding Diffusion Models: A Unified Perspective https://arxiv.org/abs/2208.11970 [Karras et al. 2022] Elucidating the design space of diffusion-based generative models https://arxiv.org/abs/2206.00364 [Karras et al. 2023] Analyzing and Improving the Training Dynamics of Diffusion Models https://arxiv.org/abs/2312.02696 ===== Guidance ===== [Dhariwal and Nichol 2021] Diffusion Models Beat GANs on Image Synthesis https://arxiv.org/abs/2105.05233 [Ho and Salimans 2022] Classifier-Free Diffusion Guidance https://arxiv.org/abs/2207.12598 [Sander Dieleman 2022] Guidance: a cheat code for diffusion models https://sander.ai/2022/05/26/guidance.html [Sander Dieleman 2023] The geometry of diffusion guidance https://sander.ai/2023/08/28/geometry.html ===== Resolution ===== [Ho et al.
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

FREE AI Sin City Photo Generator — Turn Any Photo Into High-Contrast Noir Art (2026)
Transform any photo into a Sin City-inspired high-contrast noir art using a free AI generator
Dev.to AI
Google makes Gemini’s personalized image generation free for all US users
Google's Gemini personalized image generation is now free for all US users, allowing them to generate images informed by their Google data
The Next Web AI
Gemini’s personalized AI image generation is now free for U.S. users
Gemini's AI image generation is now free for U.S. users, allowing for personalized images based on user interests and data
TechCrunch AI
WebP's Compression Secret: How a 1MB PNG Becomes a 200KB WebP
Learn how WebP compresses images more efficiently than PNG and JPEG, and why it matters for web development
Dev.to · swift king
Up next
OpenAI Kills Sora then Descends into Chaos
ColdFusion
Watch →