Stable diffusion 2.0 Released

Sebastian Kamph · Beginner ·🎨 Image & Video AI ·3y ago

Skills: Multimodal LLMs90%Prompt Craft80%CV Basics70%Modern CV Models70%

Key Takeaways

The video discusses the release of Stable Diffusion 2.0, a text-to-image synthesis model developed by Stability AI, with improved quality and resolution, and features such as super resolution upscaler diffusion model and depth to image model. The model is trained on a filtered dataset and optimized for single GPU with lower VRAM requirements.

Full Transcript

Hello friends so great news the king is back stable Fusion 2.0 is out and that means no super stable effusion 2.0 no fake 2.0 the real official 2.0 of stable Fusion made by stability AI so let's check that out oh and if you don't care about stable Fusion 2.0 and just came here for a joke I got you so five ants rented an apartment with five other ants they're now 10 ants all right so we're here at stability ai's blog and we have here stable Fusion 2.0 release so this is great news everyone so we had the we had a 1.4 we had a 1.5 and now we made a jump to 2.0 and this is stimulated AI which uh or emad is the face of that one it is our pleasure to announce the open source release of stable Fusion version 2. the original stable Fusion version 1 led by compass change the nature of Open Source a models and spawn hundreds of other models Innovation all over the world it had one of the fastest climbs to 10 000 GitHub stars of any software rocketing through 33 000 stars in less than two months so that was just some history here's the team working on it we have stable effusion 2.0 delivers a number of big improvements and features versus the original version one release so let's dive in take a look at them yeah that's exactly what we're gonna do it's an example image so we have a new text image diffusion model and just right off the bat here what this means is you're not gonna get a new checkpoint file that you can just download from hugging face like what happened in 1.5 with a new diffusion model every user interface is going to make need to make some changes so at the point of recording this it's not possible to put this into automatic 11 11 or whatever user interface you use right now but when you're seeing this maybe because there's already been pull requests to update some of the most popular ones so give it a day or two and we're probably gonna be well updated this has here stable Fusion release includes robust text to image models trained using a brand new text encoder developed by a layout which with support from stability AI which greatly improves the quality of the generated images compared to earlier version 1 releases the text image models in this release can generate images with default resolutions of both 5 12 5 5 12 pixels and 768 by 768 so this is great news now you have a native higher resolution which means I mean of course you could do a high resolution previously but doing it natively means that the model has been trained or fine-tuned really on on images that are 768 plus seven by six on 768 by 768 you can start with higher resolution images and then move upwards from there these models are trained on aesthetic subset of the Leon 5B data set created by the deployed team adds Wai which is then for further filtered to remove adult content using Lions NSFW filter so this is good news for some and bad news for some you're gonna have a not safer work filter which is great for professional use and just general family friendliness for some there's going to be a limited Edition but I'm sure they're gonna be there's gonna be workarounds for people that need that I think generally for professional use which I use the AI for mostly this is actually a good feature because there's been some sort of like a limitation of what the what what you can't do especially in a professional environment and for example just like doing a YouTuber streaming you can't live render anything because anything can pop up I'm just gonna get like banned that's not great because here are some examples of images produced in the native 768 by 768 another new features super resolution upscaler diffusion models so stable diffusion 2.0 also includes an upscaler diffusion model that enhances the resolution of images by a factor of four and here's an example of that so the model is upscaling an image that is 128 by 128 and into the high resolution of 512 by 5 12. so they say here combined with the text to image models which can get the images up to like what we talked about earlier 768 and now we can generate images up to well not up to it can generate images by to 2048 and as I said or even higher I think this is a like a solid number until it starts you know losing a lot of detail this is this is kind of cool the depth to image diffusion model now the Forum has been working a little bit with depth mapping and minus but it hasn't been that widely used in just regular stable diffusion models previously but now they have implemented that and it's they call it depth to image and it's basically you can have a depth map which well this can be an example of a depth depth map and um then your text to image or depth to image Generations will base their results on that so as you can see all the results here are based upon this depth map so it's a think of it as image to image but more advanced really and it says your depth image can offer all sorts of new creative applications delivering Transformations that look radically different from either Ridge from the original but which still preserve the coherence and depth of that image you can see the white image here what that one that pops out that's the depth image and then it generates out from that so that's really cool and that's gonna help especially in professional use I mean let's say you're working with everyone says oh mid Journey V4 that's that's so great yeah but it's it's I mean yeah you can get great images but it's so very limited you need to be able to control the AI generation like not 100 but but close to it if you're going to use this in a professional environment the demands and specs are so specific that you just you just can't deliver okay here's the beautiful image oh it's done I mean it works with Facebook and Instagram and stuff like that and well some uses but most of the time you need to be super specific and stable Fusion is the king of that and has been has never been dethroned in that regard and updated in painting diffusion model now in painting was improved a little bit in 1.5 and well I hope to see much better Improvement here in 2.0 in painting has been well it's it's been improved but it has been one of the weaker aspects of stable diffusion so far and that's also taking account the professional use in painting is an extremely powerful tool let's say you're working with like a composition or something that you have previously it's like you have a folder from a photo shoot you can just click quickly in paint a little bit that's going to transform the whole business and again just like the first iteration of stable Fusion we worked hard to optimize the mode to run on a single GPU so as I've understood it it doesn't say so specifically but I've heard talks about lower vram requirements all around we'll see about that I haven't tested it thoroughly yet we wanted to make it accessible to as many people as possible from the very start we've already seen that when millions of people get their hands on these models they collectively create some truly amazing things this is the power of Open Source tapping the last potential of millions of talented people who might not have the resources to train a state-of-the-art model but you have the ability to do something incredible with one this new release along with this powerful new features like depth to image and high resolution of scaling capabilities will serve as the foundation of countless applications and enable an explosion of new creative potential well yeah I don't doubt that okay so for more details yeah here's the GitHub link I'm going to put all the links in the description below so so check that out but remember as of this recording you can't just download the checkpoint and start working but give it a few hours and um most tools I assume will be updated for this because this is huge news huge news and again this is the real official 2.0 now again if you are if you are like a programmer or no coding uh very well you can get this to work there's a information in GitHub how to manually start it up we're quickly gonna dive into the GitHub as well so here's stable Fusion again 2.0 here's some example images so this repo contains stable diffusion models trained from scratch and this is important they have been trained from scratch and hopefully with experience and knowledge about the previous ones so this is again a huge update and we'll be continuously updated with new checkpoints so you're gonna have the you're going to have the 512 and the new 512 by 5 fill model if you're gonna have the fine-tuned 768 by 768 model so you can choose which one you want to use some of the news here here we talked about the upscaling the new upscaling and the new depth depth map with saw some examples of that here's another one here's a text guided okay that was the end painting model yeah that was we saw that previously okay so basically here if you know what you're doing you can update an existing latent diffusion environment and you're gonna need to run calm down and install the diffusions here exformers again are available and you should use them it's going to lower your vram requirements by a lot and speed up the renders as well and then just need to run that compiling here so this is just a graph of the comparison of the different models so the new 2.0 is the blue line here I'm not gonna delve too much into that some text to image examples yeah you can check this out bears by yourself I'm not going to delve into this most of this was in the blog post already but again I'm gonna post links down below yeah so there you have it stable Fusion 2.0 and to summarize what you have is a brand new latent space trained on larger images so you're gonna get high resolution native 768 by 768 better compositions a new upscaler depth to image as a new feature the not safe for work filter all that good jazz so yeah thanks for tuning in I hope this will be available when you watch the video and if not well just wait a few hours and I'm sure it will be updated in most interfaces have a good one see ya

Original Description

Stable diffusion 2.0 has been released by StabilityAI. Let's check it out. https://stability.ai/blog/stable-diffusion-v2-release https://github.com/Stability-AI/stablediffusion Support me on Patreon to get access to unique perks! https://www.patreon.com/sebastiankamph Ultimate Stable diffusion guide: https://youtu.be/DHaL56P6f5M Ultimate Animation guide in Stable diffusion: https://youtu.be/lztn6qLc9UE How to fix live render preview: https://youtu.be/_4rY0oPbUYA CHAPTERS 0:00 Introduction 0:20 Dadjoke 0:33 Stable diffusion 2.0 Release 9:46 Stable diffusion 2.0 Github 11:32 Closing words & summary

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Sebastian Kamph · Sebastian Kamph · 31 of 60

← Previous Next →

How to install stable diffusion tutorial (automatic1111)

How to install stable diffusion tutorial (automatic1111)

Sebastian Kamph

Inpainting in Stable diffusion for beginners.

Inpainting in Stable diffusion for beginners.

Sebastian Kamph

OpenAI NEW Whisper is AMAZING!

OpenAI NEW Whisper is AMAZING!

Sebastian Kamph

Tutorial - Free AI Game assets in Stable diffusion. Episode 1: Sword

Tutorial - Free AI Game assets in Stable diffusion. Episode 1: Sword

Sebastian Kamph

Game assets in Stable diffusion. Ep 2: Jewelry

Game assets in Stable diffusion. Ep 2: Jewelry

Sebastian Kamph

Stable diffusion Animation tutorial with AUTOMATIC AUDIO SYNC. Make your own AI music video!

Stable diffusion Animation tutorial with AUTOMATIC AUDIO SYNC. Make your own AI music video!

Sebastian Kamph

Stable diffusion img2img tutorial.

Stable diffusion img2img tutorial.

Sebastian Kamph

Stable diffusion tutorial - AI Game assets. Episode 3: Treasure chest

Stable diffusion tutorial - AI Game assets. Episode 3: Treasure chest

Sebastian Kamph

Stable diffusion animation tutorial. Deforum ALL settings explained. Make your own AI video!

Stable diffusion animation tutorial. Deforum ALL settings explained. Make your own AI video!

Sebastian Kamph

Dreambooth tutorial for stable diffusion. Quick, free and easy!

Dreambooth tutorial for stable diffusion. Quick, free and easy!

Sebastian Kamph

Dreambooth to CKPT. NEW VERSION! Dreambooth locally on potato pc.

Dreambooth to CKPT. NEW VERSION! Dreambooth locally on potato pc.

Sebastian Kamph

Stable diffusion tutorial. ULTIMATE guide - everything you need to know!

Stable diffusion tutorial. ULTIMATE guide - everything you need to know!

Sebastian Kamph

AI music video. Neffex - Winning

AI music video. Neffex - Winning

Sebastian Kamph

Stable diffusion video input tutorial. How I made this music video singing animation.

Stable diffusion video input tutorial. How I made this music video singing animation.

Sebastian Kamph

Stable diffusion color grading tutorial. Quick trick!

Stable diffusion color grading tutorial. Quick trick!

Sebastian Kamph

Prompt Editing and Alternating Words in Stable Diffusion.

Prompt Editing and Alternating Words in Stable Diffusion.

Sebastian Kamph

Stable diffusion gui most important setting. Live render preview.

Stable diffusion gui most important setting. Live render preview.

Sebastian Kamph

NEW Voice2img prototype! This AI assistant is using Stable diffusion!

NEW Voice2img prototype! This AI assistant is using Stable diffusion!

Sebastian Kamph

Prompts and FREE ONLINE stable diffusion. OpenArt AI tutorial

Prompts and FREE ONLINE stable diffusion. OpenArt AI tutorial

Sebastian Kamph

Stable diffusion Halloween concept art tutorial.

Stable diffusion Halloween concept art tutorial.

Sebastian Kamph

Stable diffusion GTA 6 style image tutorial. Quick and EASY!

Stable diffusion GTA 6 style image tutorial. Quick and EASY!

Sebastian Kamph

Stable diffusion prompt tutorial. NEW PROMPT BOOK released!

Stable diffusion prompt tutorial. NEW PROMPT BOOK released!

Sebastian Kamph

Stable diffusion GTA 6 style image tutorial. Quick and EASY!

Stable diffusion GTA 6 style image tutorial. Quick and EASY!

Sebastian Kamph

How to install Deforum locally. Stable diffusion animation.

How to install Deforum locally. Stable diffusion animation.

Sebastian Kamph

Dreambooth in Automatic1111. Cpu only & gpu option.

Dreambooth in Automatic1111. Cpu only & gpu option.

Sebastian Kamph

Nvidia's NEW text to image AI eDiff-I. Will it dethrone Stable diffusion?

Nvidia's NEW text to image AI eDiff-I. Will it dethrone Stable diffusion?

Sebastian Kamph

NEW VR in Stable diffusion? The future is now!

NEW VR in Stable diffusion? The future is now!

Sebastian Kamph

Motion capture workflow implementation with Stable diffusion

Motion capture workflow implementation with Stable diffusion

Sebastian Kamph

Don't make these 7 mistakes in Stable diffusion.

Don't make these 7 mistakes in Stable diffusion.

Sebastian Kamph

Stable diffusion up to 50% faster? I'll show you.

Stable diffusion up to 50% faster? I'll show you.

Sebastian Kamph

Stable diffusion 2.0 Released

Stable diffusion 2.0 Released

Sebastian Kamph

Top 5 Stable diffusion tips for newcomers.

Top 5 Stable diffusion tips for newcomers.

Sebastian Kamph

3 AMAZING Stable diffusion models that will change your life!

3 AMAZING Stable diffusion models that will change your life!

Sebastian Kamph

Best NEW AI tool? InvokeAI tutorial for Stable diffusion.

Best NEW AI tool? InvokeAI tutorial for Stable diffusion.

Sebastian Kamph

Monetize your AI art on Creative Fabrica with CF Spark.

Monetize your AI art on Creative Fabrica with CF Spark.

Sebastian Kamph

NEW Stable diffusion 2.1 RELEASED!

NEW Stable diffusion 2.1 RELEASED!

Sebastian Kamph

Stable diffusion 2.1 is GREAT. At this one thing. 2.1 install tutorial.

Stable diffusion 2.1 is GREAT. At this one thing. 2.1 install tutorial.

Sebastian Kamph

Your face in AI images? The EASY way.

Your face in AI images? The EASY way.

Sebastian Kamph

3 FANTASTIC Stable diffusion models you don't know about!

3 FANTASTIC Stable diffusion models you don't know about!

Sebastian Kamph

Unstable diffusion JUST GOT BANNED! 😲

Unstable diffusion JUST GOT BANNED! 😲

Sebastian Kamph

The end of AI Art? Lawsuit against Stable diffusion

The end of AI Art? Lawsuit against Stable diffusion

Sebastian Kamph

Stable diffusion TIER LIST. Best GUI ranked.

Stable diffusion TIER LIST. Best GUI ranked.

Sebastian Kamph

Google's ChatGPT rival Bard. Is it better?

Google's ChatGPT rival Bard. Is it better?

Sebastian Kamph

7 Secrets in ChatGPT (Don't tell your boss!)

7 Secrets in ChatGPT (Don't tell your boss!)

Sebastian Kamph

How to ChatGPT? Chat GPT explained!

How to ChatGPT? Chat GPT explained!

Sebastian Kamph

How to ChatGPT in 20 seconds!

How to ChatGPT in 20 seconds!

Sebastian Kamph

Midjourney 4C Features

Midjourney 4C Features

Sebastian Kamph

NEW ControlNet for Stable diffusion RELEASED! THIS IS MIND BLOWING!

NEW ControlNet for Stable diffusion RELEASED! THIS IS MIND BLOWING!

Sebastian Kamph

Revealing my Workflow to Perfect AI Images.

Revealing my Workflow to Perfect AI Images.

Sebastian Kamph

LIVE Pose in Stable Diffusion's ControlNet.

LIVE Pose in Stable Diffusion's ControlNet.

Sebastian Kamph

Control Light in AI Images

Control Light in AI Images

Sebastian Kamph

Multi-ControlNet tutorial.

Multi-ControlNet tutorial.

Sebastian Kamph

Control Text in AI Images

Control Text in AI Images

Sebastian Kamph

Full AI Art Workflow. ControlNet & Stable diffusion.

Full AI Art Workflow. ControlNet & Stable diffusion.

Sebastian Kamph

ControlNet Guidance tutorial. Fixing hands?

ControlNet Guidance tutorial. Fixing hands?

Sebastian Kamph

Illuminati Model with Noise Offset & Weekly AI Art Challenge

Illuminati Model with Noise Offset & Weekly AI Art Challenge

Sebastian Kamph

Paint&Text2Image - MultiDiffusion Region Control.

Paint&Text2Image - MultiDiffusion Region Control.

Sebastian Kamph

Style2Image in ControlNet (T2I)

Style2Image in ControlNet (T2I)

Sebastian Kamph

Gen-1 AI Animation is WILD

Gen-1 AI Animation is WILD

Sebastian Kamph

Famous Scenes Remade by ControlNet AI

Famous Scenes Remade by ControlNet AI

Sebastian Kamph

The video teaches how to use Stable Diffusion 2.0, a text-to-image synthesis model, to generate high-quality images with improved resolution and features such as super resolution upscaler diffusion model and depth to image model. The model is trained on a filtered dataset and optimized for single GPU with lower VRAM requirements. By watching this video, viewers can learn how to craft effective text prompts, use prompt engineering techniques, and apply computer vision techniques to image generati

Key Takeaways

Install Stable Diffusion 2.0
Load the pre-trained model
Craft effective text prompts
Use the super resolution upscaler diffusion model to enhance image resolution
Apply the depth to image model for advanced image generation
Fine-tune the model for specific use cases
Use the text guided painting model to generate images

💡 The key insight of this video is that Stable Diffusion 2.0 is a powerful text-to-image synthesis model that can generate high-quality images with improved resolution and features such as super resolution upscaler diffusion model and depth to image model.

🔒 Pro feature: Ask AI to explain this lesson →

More on: Multimodal LLMs

View skill →

Google Veo 3 Tutorial: How to create AI Videos in Flow, Gemini or Google Vids?

Google Veo 3 Tutorial: How to create AI Videos in Flow, Gemini or Google Vids?

AI Tool Journey

NVIDIA Clara Guardian Virtual Patient Assistant

NVIDIA Clara Guardian Virtual Patient Assistant

NVIDIA Developer

Building Multimodal Search and RAG

Building Multimodal Search and RAG

Midjourney Trick: Consistent Character in Different Images

Midjourney Trick: Consistent Character in Different Images

Ollama Multimodal: EASILY setup Llava locally & Integrate API

Ollama Multimodal: EASILY setup Llava locally & Integrate API

The ONLY Real Time Speech AI that can run locally!!!

The ONLY Real Time Speech AI that can run locally!!!

Related Reads

I Built an Image Steganography Tool — Hide Any File Inside a PNG with AES-256 Encryption

Learn to build an image steganography tool that hides files inside PNGs with AES-256 encryption, enhancing security and privacy

Dev.to · Rishu

FREE AI Sin City Photo Generator — Turn Any Photo Into High-Contrast Noir Art (2026)

Transform any photo into a Sin City-inspired high-contrast noir art using a free AI generator

Google makes Gemini’s personalized image generation free for all US users

Google's Gemini personalized image generation is now free for all US users, allowing them to generate images informed by their Google data

The Next Web AI

Gemini’s personalized AI image generation is now free for U.S. users

Gemini's AI image generation is now free for U.S. users, allowing for personalized images based on user interests and data

Chapters (5)

Introduction

0:20 Dadjoke

0:33 Stable diffusion 2.0 Release

9:46 Stable diffusion 2.0 Github

11:32 Closing words & summary

OpenAI Kills Sora then Descends into Chaos