Stable Diffusion vs Flux vs OmniGen vs SANA
A comparison between different free Text-to-Image models running on a local machine, covering Stable Diffusion (1.5, SDXL, 3.5), Flux.1 (Schnell, Dev), Omnigen and the new, super-fast SANA-model published by NVIDIA, rating their image-quality, prompt-adherence, speed and VRAM-requirements.
I've tried to make the comparisons as fair and unbiased as possible, using the Google-Parti-Prompts method. So I created 1,391 images (107 per model) in 11 challenges and 12 categories and rated their quality. Still, it's a personal view but I will leave a link where you can download the rating sheets with …
Watch on YouTube ↗
(saves to browser)
Chapters (19)
Intro, Models
1:54
Prompts & Benchmarks (Google Parti Prompts)
2:24
Hardware, Rating-process, Rating methods
4:19
Stable Diffusion 1.5 - Juggernaut 1.5
5:20
SDXL - Juggernaut XL
6:00
SDXL-LCM - SDXL_Vanilla
6:42
SDXL-Lightning - Juggernaut XL Lightning
7:29
SDXL-Hyper - Boltning Realistic Hyper
8:06
SDXL-Turbo - TurbovisionXL
8:38
Flux.1 Schnell
10:12
Flux.1 Dev
11:19
Stable Diffusion 3.5 Medium
12:19
Stable Diffusion 3.5 Large
13:24
Stable Diffusion 3.5 Large Turbo
14:15
OmniGen
15:24
SANA
16:32
Recommendations
17:26
Download links (Models, Gumroad)
17:45
Outro
DeepCamp AI