Stable diffusion 2.0 Released
Key Takeaways
The video discusses the release of Stable Diffusion 2.0, a text-to-image synthesis model developed by Stability AI, with improved quality and resolution, and features such as super resolution upscaler diffusion model and depth to image model. The model is trained on a filtered dataset and optimized for single GPU with lower VRAM requirements.
Full Transcript
Hello friends so great news the king is back stable Fusion 2.0 is out and that means no super stable effusion 2.0 no fake 2.0 the real official 2.0 of stable Fusion made by stability AI so let's check that out oh and if you don't care about stable Fusion 2.0 and just came here for a joke I got you so five ants rented an apartment with five other ants they're now 10 ants all right so we're here at stability ai's blog and we have here stable Fusion 2.0 release so this is great news everyone so we had the we had a 1.4 we had a 1.5 and now we made a jump to 2.0 and this is stimulated AI which uh or emad is the face of that one it is our pleasure to announce the open source release of stable Fusion version 2. the original stable Fusion version 1 led by compass change the nature of Open Source a models and spawn hundreds of other models Innovation all over the world it had one of the fastest climbs to 10 000 GitHub stars of any software rocketing through 33 000 stars in less than two months so that was just some history here's the team working on it we have stable effusion 2.0 delivers a number of big improvements and features versus the original version one release so let's dive in take a look at them yeah that's exactly what we're gonna do it's an example image so we have a new text image diffusion model and just right off the bat here what this means is you're not gonna get a new checkpoint file that you can just download from hugging face like what happened in 1.5 with a new diffusion model every user interface is going to make need to make some changes so at the point of recording this it's not possible to put this into automatic 11 11 or whatever user interface you use right now but when you're seeing this maybe because there's already been pull requests to update some of the most popular ones so give it a day or two and we're probably gonna be well updated this has here stable Fusion release includes robust text to image models trained using a brand new text encoder developed by a layout which with support from stability AI which greatly improves the quality of the generated images compared to earlier version 1 releases the text image models in this release can generate images with default resolutions of both 5 12 5 5 12 pixels and 768 by 768 so this is great news now you have a native higher resolution which means I mean of course you could do a high resolution previously but doing it natively means that the model has been trained or fine-tuned really on on images that are 768 plus seven by six on 768 by 768 you can start with higher resolution images and then move upwards from there these models are trained on aesthetic subset of the Leon 5B data set created by the deployed team adds Wai which is then for further filtered to remove adult content using Lions NSFW filter so this is good news for some and bad news for some you're gonna have a not safer work filter which is great for professional use and just general family friendliness for some there's going to be a limited Edition but I'm sure they're gonna be there's gonna be workarounds for people that need that I think generally for professional use which I use the AI for mostly this is actually a good feature because there's been some sort of like a limitation of what the what what you can't do especially in a professional environment and for example just like doing a YouTuber streaming you can't live render anything because anything can pop up I'm just gonna get like banned that's not great because here are some examples of images produced in the native 768 by 768 another new features super resolution upscaler diffusion models so stable diffusion 2.0 also includes an upscaler diffusion model that enhances the resolution of images by a factor of four and here's an example of that so the model is upscaling an image that is 128 by 128 and into the high resolution of 512 by 5 12. so they say here combined with the text to image models which can get the images up to like what we talked about earlier 768 and now we can generate images up to well not up to it can generate images by to 2048 and as I said or even higher I think this is a like a solid number until it starts you know losing a lot of detail this is this is kind of cool the depth to image diffusion model now the Forum has been working a little bit with depth mapping and minus but it hasn't been that widely used in just regular stable diffusion models previously but now they have implemented that and it's they call it depth to image and it's basically you can have a depth map which well this can be an example of a depth depth map and um then your text to image or depth to image Generations will base their results on that so as you can see all the results here are based upon this depth map so it's a think of it as image to image but more advanced really and it says your depth image can offer all sorts of new creative applications delivering Transformations that look radically different from either Ridge from the original but which still preserve the coherence and depth of that image you can see the white image here what that one that pops out that's the depth image and then it generates out from that so that's really cool and that's gonna help especially in professional use I mean let's say you're working with everyone says oh mid Journey V4 that's that's so great yeah but it's it's I mean yeah you can get great images but it's so very limited you need to be able to control the AI generation like not 100 but but close to it if you're going to use this in a professional environment the demands and specs are so specific that you just you just can't deliver okay here's the beautiful image oh it's done I mean it works with Facebook and Instagram and stuff like that and well some uses but most of the time you need to be super specific and stable Fusion is the king of that and has been has never been dethroned in that regard and updated in painting diffusion model now in painting was improved a little bit in 1.5 and well I hope to see much better Improvement here in 2.0 in painting has been well it's it's been improved but it has been one of the weaker aspects of stable diffusion so far and that's also taking account the professional use in painting is an extremely powerful tool let's say you're working with like a composition or something that you have previously it's like you have a folder from a photo shoot you can just click quickly in paint a little bit that's going to transform the whole business and again just like the first iteration of stable Fusion we worked hard to optimize the mode to run on a single GPU so as I've understood it it doesn't say so specifically but I've heard talks about lower vram requirements all around we'll see about that I haven't tested it thoroughly yet we wanted to make it accessible to as many people as possible from the very start we've already seen that when millions of people get their hands on these models they collectively create some truly amazing things this is the power of Open Source tapping the last potential of millions of talented people who might not have the resources to train a state-of-the-art model but you have the ability to do something incredible with one this new release along with this powerful new features like depth to image and high resolution of scaling capabilities will serve as the foundation of countless applications and enable an explosion of new creative potential well yeah I don't doubt that okay so for more details yeah here's the GitHub link I'm going to put all the links in the description below so so check that out but remember as of this recording you can't just download the checkpoint and start working but give it a few hours and um most tools I assume will be updated for this because this is huge news huge news and again this is the real official 2.0 now again if you are if you are like a programmer or no coding uh very well you can get this to work there's a information in GitHub how to manually start it up we're quickly gonna dive into the GitHub as well so here's stable Fusion again 2.0 here's some example images so this repo contains stable diffusion models trained from scratch and this is important they have been trained from scratch and hopefully with experience and knowledge about the previous ones so this is again a huge update and we'll be continuously updated with new checkpoints so you're gonna have the you're going to have the 512 and the new 512 by 5 fill model if you're gonna have the fine-tuned 768 by 768 model so you can choose which one you want to use some of the news here here we talked about the upscaling the new upscaling and the new depth depth map with saw some examples of that here's another one here's a text guided okay that was the end painting model yeah that was we saw that previously okay so basically here if you know what you're doing you can update an existing latent diffusion environment and you're gonna need to run calm down and install the diffusions here exformers again are available and you should use them it's going to lower your vram requirements by a lot and speed up the renders as well and then just need to run that compiling here so this is just a graph of the comparison of the different models so the new 2.0 is the blue line here I'm not gonna delve too much into that some text to image examples yeah you can check this out bears by yourself I'm not going to delve into this most of this was in the blog post already but again I'm gonna post links down below yeah so there you have it stable Fusion 2.0 and to summarize what you have is a brand new latent space trained on larger images so you're gonna get high resolution native 768 by 768 better compositions a new upscaler depth to image as a new feature the not safe for work filter all that good jazz so yeah thanks for tuning in I hope this will be available when you watch the video and if not well just wait a few hours and I'm sure it will be updated in most interfaces have a good one see ya
Original Description
Stable diffusion 2.0 has been released by StabilityAI. Let's check it out.
https://stability.ai/blog/stable-diffusion-v2-release
https://github.com/Stability-AI/stablediffusion
Support me on Patreon to get access to unique perks!
https://www.patreon.com/sebastiankamph
Ultimate Stable diffusion guide:
https://youtu.be/DHaL56P6f5M
Ultimate Animation guide in Stable diffusion:
https://youtu.be/lztn6qLc9UE
How to fix live render preview:
https://youtu.be/_4rY0oPbUYA
CHAPTERS
0:00 Introduction
0:20 Dadjoke
0:33 Stable diffusion 2.0 Release
9:46 Stable diffusion 2.0 Github
11:32 Closing words & summary
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from Sebastian Kamph · Sebastian Kamph · 31 of 60
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
▶
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
How to install stable diffusion tutorial (automatic1111)
Sebastian Kamph
Inpainting in Stable diffusion for beginners.
Sebastian Kamph
OpenAI NEW Whisper is AMAZING!
Sebastian Kamph
Tutorial - Free AI Game assets in Stable diffusion. Episode 1: Sword
Sebastian Kamph
Game assets in Stable diffusion. Ep 2: Jewelry
Sebastian Kamph
Stable diffusion Animation tutorial with AUTOMATIC AUDIO SYNC. Make your own AI music video!
Sebastian Kamph
Stable diffusion img2img tutorial.
Sebastian Kamph
Stable diffusion tutorial - AI Game assets. Episode 3: Treasure chest
Sebastian Kamph
Stable diffusion animation tutorial. Deforum ALL settings explained. Make your own AI video!
Sebastian Kamph
Dreambooth tutorial for stable diffusion. Quick, free and easy!
Sebastian Kamph
Dreambooth to CKPT. NEW VERSION! Dreambooth locally on potato pc.
Sebastian Kamph
Stable diffusion tutorial. ULTIMATE guide - everything you need to know!
Sebastian Kamph
AI music video. Neffex - Winning
Sebastian Kamph
Stable diffusion video input tutorial. How I made this music video singing animation.
Sebastian Kamph
Stable diffusion color grading tutorial. Quick trick!
Sebastian Kamph
Prompt Editing and Alternating Words in Stable Diffusion.
Sebastian Kamph
Stable diffusion gui most important setting. Live render preview.
Sebastian Kamph
NEW Voice2img prototype! This AI assistant is using Stable diffusion!
Sebastian Kamph
Prompts and FREE ONLINE stable diffusion. OpenArt AI tutorial
Sebastian Kamph
Stable diffusion Halloween concept art tutorial.
Sebastian Kamph
Stable diffusion GTA 6 style image tutorial. Quick and EASY!
Sebastian Kamph
Stable diffusion prompt tutorial. NEW PROMPT BOOK released!
Sebastian Kamph
Stable diffusion GTA 6 style image tutorial. Quick and EASY!
Sebastian Kamph
How to install Deforum locally. Stable diffusion animation.
Sebastian Kamph
Dreambooth in Automatic1111. Cpu only & gpu option.
Sebastian Kamph
Nvidia's NEW text to image AI eDiff-I. Will it dethrone Stable diffusion?
Sebastian Kamph
NEW VR in Stable diffusion? The future is now!
Sebastian Kamph
Motion capture workflow implementation with Stable diffusion
Sebastian Kamph
Don't make these 7 mistakes in Stable diffusion.
Sebastian Kamph
Stable diffusion up to 50% faster? I'll show you.
Sebastian Kamph
Stable diffusion 2.0 Released
Sebastian Kamph
Top 5 Stable diffusion tips for newcomers.
Sebastian Kamph
3 AMAZING Stable diffusion models that will change your life!
Sebastian Kamph
Best NEW AI tool? InvokeAI tutorial for Stable diffusion.
Sebastian Kamph
Monetize your AI art on Creative Fabrica with CF Spark.
Sebastian Kamph
NEW Stable diffusion 2.1 RELEASED!
Sebastian Kamph
Stable diffusion 2.1 is GREAT. At this one thing. 2.1 install tutorial.
Sebastian Kamph
Your face in AI images? The EASY way.
Sebastian Kamph
3 FANTASTIC Stable diffusion models you don't know about!
Sebastian Kamph
Unstable diffusion JUST GOT BANNED! 😲
Sebastian Kamph
The end of AI Art? Lawsuit against Stable diffusion
Sebastian Kamph
Stable diffusion TIER LIST. Best GUI ranked.
Sebastian Kamph
Google's ChatGPT rival Bard. Is it better?
Sebastian Kamph
7 Secrets in ChatGPT (Don't tell your boss!)
Sebastian Kamph
How to ChatGPT? Chat GPT explained!
Sebastian Kamph
How to ChatGPT in 20 seconds!
Sebastian Kamph
Midjourney 4C Features
Sebastian Kamph
NEW ControlNet for Stable diffusion RELEASED! THIS IS MIND BLOWING!
Sebastian Kamph
Revealing my Workflow to Perfect AI Images.
Sebastian Kamph
LIVE Pose in Stable Diffusion's ControlNet.
Sebastian Kamph
Control Light in AI Images
Sebastian Kamph
Multi-ControlNet tutorial.
Sebastian Kamph
Control Text in AI Images
Sebastian Kamph
Full AI Art Workflow. ControlNet & Stable diffusion.
Sebastian Kamph
ControlNet Guidance tutorial. Fixing hands?
Sebastian Kamph
Illuminati Model with Noise Offset & Weekly AI Art Challenge
Sebastian Kamph
Paint&Text2Image - MultiDiffusion Region Control.
Sebastian Kamph
Style2Image in ControlNet (T2I)
Sebastian Kamph
Gen-1 AI Animation is WILD
Sebastian Kamph
Famous Scenes Remade by ControlNet AI
Sebastian Kamph
More on: Multimodal LLMs
View skill →Related Reads
📰
📰
📰
📰
I Built an Image Steganography Tool — Hide Any File Inside a PNG with AES-256 Encryption
Dev.to · Rishu
FREE AI Sin City Photo Generator — Turn Any Photo Into High-Contrast Noir Art (2026)
Dev.to AI
Google makes Gemini’s personalized image generation free for all US users
The Next Web AI
Gemini’s personalized AI image generation is now free for U.S. users
TechCrunch AI
Chapters (5)
Introduction
0:20
Dadjoke
0:33
Stable diffusion 2.0 Release
9:46
Stable diffusion 2.0 Github
11:32
Closing words & summary
🎓
Tutor Explanation
DeepCamp AI