OpenAI GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models

Name: OpenAI GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Uploaded: 2021-12-29T02:31:56Z
Channel: Aleksa Gordić - The AI Epiphany
Description: ❤️ Become The AI Epiphany Patreon ❤️ https://www.patreon.com/theaiepiphany 👨👩👧👦 Join our Discord community 👨👩👧👦 https://discord.gg/peBrCph...

Aleksa Gordić - The AI Epiphany · Beginner ·🎨 Image & Video AI ·4y ago

Skills: Multimodal LLMs90%Fine-tuning LLMs80%CV Basics70%Modern CV Models70%Generative CV70%

❤️ Become The AI Epiphany Patreon ❤️ https://www.patreon.com/theaiepiphany 👨‍👩‍👧‍👦 Join our Discord community 👨‍👩‍👧‍👦 https://discord.gg/peBrCpheKE In this video I cover a new paper from OpenAI - "GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models" where they combine diffusion models with transformers to outperform their older DALL-E model. ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬ ✅ GLIDE paper: https://arxiv.org/abs/2112.10741 ✅ GLIDE code: https://github.com/openai/glide-text2im Learning about diffusion models Papers: ✅ Seminal (2015): https://arxiv.org/pdf/1503.03585.pdf ✅ DDPM (2020): https://arxiv.org/pdf/2006.11239.pdf ✅ OpenAI (1): https://arxiv.org/pdf/2102.09672.pdf ✅ OpenAI (2): https://arxiv.org/pdf/2105.05233.pdf Blogs: ✅ Score-based models: https://yang-song.github.io/blog/2021/score/ ✅ Diffusion models: https://lilianweng.github.io/lil-log/2021/07/11/diffusion-models.html ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬ ⌚️ Timetable: 00:00 Intro to GLIDE - results 04:00 Intro to diffusion models 07:10 Inpainting and other awesome results 11:05 Diffusion models in depth 20:45 VAE inspired loss 31:30 GLIDE pipeline (diffusion + transformers) 34:15 Guided diffusion 38:00 Classifier-free guidance 42:25 CLIP guidance 45:25 Comparison with other models 48:30 Safety considerations 49:25 Failure cases 51:40 Outro ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬ 💰 BECOME A PATREON OF THE AI EPIPHANY ❤️ If these videos, GitHub projects, and blogs help you, consider helping me out by supporting me on Patreon! The AI Epiphany - https://www.patreon.com/theaiepiphany One-time donation - https://www.paypal.com/paypalme/theaiepiphany Huge thank you to these AI Epiphany patreons: Eli Mahler Kulsoom Abdullah Petar Veličković ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬ 💼 LinkedIn - https://www.linkedin.com/in/aleksagordic/ 🐦 Twitter - https://twitter.com/gordic_aleksa 👨‍👩‍👧‍👦 Discord - https://discord.gg/peBrCpheKE 📺 YouTube - https://www.youtube.com/c/TheAIEpiphany/ 📚 M

Watch on YouTube ↗ (saves to browser)

Playlist

Uploads from Aleksa Gordić - The AI Epiphany · Aleksa Gordić - The AI Epiphany · 0 of 60

← Previous Next →

OpenAI GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models

Playlist

More on: Multimodal LLMs

Related AI Lessons

Chapters (13)

Lesson complete!