Applied AI

Image & Video AI

Stable Diffusion, Midjourney, DALL-E, Sora, ControlNet and AI video generation

2,600
lessons
Skills in this topic
View full skill map →
Image Generation Basics
beginner
Generate photorealistic and stylised images with prompts
Advanced Image Generation
intermediate
Use ControlNet for pose/depth-guided generation
AI Video Generation
advanced
Generate a 10-second video clip from a text prompt
All Reads (128) Articles (20)Blog Posts (81)Tutorials (24)News (3)
Dev.to AI 🎨 Image & Video AI ⚡ AI Lesson 2mo ago
How to Create Product Photos with GPT Image 2
If you sell physical products, one of the hardest creative problems is producing enough visuals that are actually usable across product pages, ads, and social.
GPT Image2 vs. Nano Banana2: The New Battle for Visual AI Supremacy
Medium · ChatGPT 🎨 Image & Video AI ⚡ AI Lesson 2mo ago
GPT Image2 vs. Nano Banana2: The New Battle for Visual AI Supremacy
For the past year, Nano Banana2 has quietly built a reputation as the gold standard in AI image generation — especially among designers… Continue reading on Med
Medium · ChatGPT 🎨 Image & Video AI ⚡ AI Lesson 2mo ago
Unlocking the Power of AI-Generated Images: ChatGPT’s Latest Upgrade
The world of artificial intelligence has witnessed significant advancements in recent years, with one of the most notable developments… Continue reading on Medi
When Preprocessing Helps — and When It Hurts: Why Your Image Classification Model’s Accuracy Varies
Medium · Deep Learning 🎨 Image & Video AI ⚡ AI Lesson 2mo ago
When Preprocessing Helps — and When It Hurts: Why Your Image Classification Model’s Accuracy Varies
From 65% to 87% accuracy on CIFAR-10 using Convolutional Neural Networks — and what went wrong along the way. Continue reading on Level Up Coding »
Tired of Sorting Generated Images? I Built a Flask Tool.(For Mac code)
Medium · Python 🎨 Image & Video AI ⚡ AI Lesson 2mo ago
Tired of Sorting Generated Images? I Built a Flask Tool.(For Mac code)
If you’re generating images with ComfyUI, you probably already know the real problem isn’t generation — it’s cleanup. Continue reading on Medium »
How to Create Soft Cinematic Light in Midjourney
Medium · AI 🎨 Image & Video AI ⚡ AI Lesson 2mo ago
How to Create Soft Cinematic Light in Midjourney
Most Midjourney images feel too harsh or artificial, even with great prompts, because the light is wrong. Continue reading on Medium »
Dev.to AI 🎨 Image & Video AI ⚡ AI Lesson 2mo ago
Why Every AI Image Generator Fails at Text (And One That Finally Doesn't)
Why Every AI Image Generator Fails at Text (And One That Finally Doesn't) If you've spent any time with AI image generators, you've probably run into the same f
Dev.to AI 🎨 Image & Video AI ⚡ AI Lesson 2mo ago
ERNIE-Image: A Text-to-Image Model Built for Posters, Comics, and Text-Rich Visual Content
Introduction As text-to-image models continue to evolve, most improvements have focused on visual quality—higher resolution, better textures, and more photoreal
TIFF in 2026: what I learned researching the format nobody uses on the web
Dev.to · Serhii Kalyna 🎨 Image & Video AI ⚡ AI Lesson 2mo ago
TIFF in 2026: what I learned researching the format nobody uses on the web
I'm building a free image converter. One day I looked at my landing page for /tiff-to-webp and...
Denoising
Towards AI 🎨 Image & Video AI ⚡ AI Lesson 2mo ago
Denoising
Author(s): Sefa Bilicier Originally published on Towards AI. Introduction Have you ever taken a photo in low light and noticed those grainy, discolored spots th
ArXiv cs.AI 🎨 Image & Video AI 📄 Paper ⚡ AI Lesson 2mo ago
SANA I2I: A Text Free Flow Matching Framework for Paired Image to Image Translation with a Case Study in Fetal MRI Artifact Reduction
arXiv:2604.00298v1 Announce Type: cross Abstract: We propose SANA-I2I, a text-free high-resolution image-to-image generation framework that extends the SANA fam
ArXiv cs.AI 🎨 Image & Video AI 📄 Paper ⚡ AI Lesson 2mo ago
Science-T2I: Addressing Scientific Illusions in Image Synthesis
arXiv:2504.13129v2 Announce Type: replace-cross Abstract: Current image generation models produce visually compelling but scientifically implausible images, exp
ArXiv cs.AI 🎨 Image & Video AI 📄 Paper ⚡ AI Lesson 3mo ago
MELT: Improve Composed Image Retrieval via the Modification Frequentation-Rarity Balance Network
arXiv:2603.29291v1 Announce Type: cross Abstract: Composed Image Retrieval (CIR) uses a reference image and a modification text as a query to retrieve a target
ArXiv cs.AI 🎨 Image & Video AI 📄 Paper ⚡ AI Lesson 3mo ago
ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks
arXiv:2603.27862v1 Announce Type: cross Abstract: Advances in diffusion, autoregressive, and hybrid models have enabled high-quality image synthesis for tasks s
ArXiv cs.AI 🎨 Image & Video AI 📄 Paper 3mo ago
Image Generation Models: A Technical History
arXiv:2603.07455v2 Announce Type: replace-cross Abstract: Image generation has advanced rapidly over the past decade, yet the literature seems fragmented across
Stop One User From Hogging Your Laravel Queue
Dev.to · Yan Gus 🎨 Image & Video AI 3mo ago
Stop One User From Hogging Your Laravel Queue
The Problem I Ran Into I was building an AI image generation service. Users submit...
How I Built a Chrome Extension to Automate Grok AI Video Generation
Dev.to · Tomson Lee 🎨 Image & Video AI 3mo ago
How I Built a Chrome Extension to Automate Grok AI Video Generation
If you've used Grok AI for image and video generation, you've probably hit the same frustrations I...
Stable Diffusion WebUI vs ComfyUI: Compared
Dev.to · selfhosting.sh 🎨 Image & Video AI 3mo ago
Stable Diffusion WebUI vs ComfyUI: Compared
Quick Verdict For beginners and casual image generation, Stable Diffusion WebUI...
Comparing AI Video Generators at Scale: Latency, Quality, and Cost Tradeoffs
Dev.to · SIKOUTRIS 🎨 Image & Video AI 3mo ago
Comparing AI Video Generators at Scale: Latency, Quality, and Cost Tradeoffs
AI video generation went from a novelty to a legitimate production tool in 2025. Sora, Runway Gen-3,...
Benchmarking AI Image Generators: Building an Automated Visual Quality Pipeline
Dev.to · SIKOUTRIS 🎨 Image & Video AI 3mo ago
Benchmarking AI Image Generators: Building an Automated Visual Quality Pipeline
How do you objectively compare the output of Midjourney, DALL-E 3, Stable Diffusion, and Flux? That...
Prompt to Video App API
Dev.to · shrey vijayvargiya 🎨 Image & Video AI 3mo ago
Prompt to Video App API
Prompt to Video App Building AI video generation app Tags: Remotion, Honojs, Nextjs,...
How People Actually Use AI Image Generation: Data from 4,900+ Users
Dev.to · Dylan HUANG 🎨 Image & Video AI 3mo ago
How People Actually Use AI Image Generation: Data from 4,900+ Users
Real usage data reveals that 75% of AI image generation is editing, not creation. Analysis of user behavior across 54 countries.
LoRA and FT Are Unnecessary: How to Approach Distilled Models
Dev.to · soy 🎨 Image & Video AI 3mo ago
LoRA and FT Are Unnecessary: How to Approach Distilled Models
Introduction Fine-tuning (FT) a distilled model is either ineffective or leads to...
How I Built an AI Image Generation Platform That Reached 48K+ Users
Dev.to · Adib Ghamri 🎨 Image & Video AI 3mo ago
How I Built an AI Image Generation Platform That Reached 48K+ Users
Every developer dreams of building something that takes off. For me, that dream became NanoGenArt —...
How to Copy Any Pose in ComfyUI and Fix AI Skin
Dev.to · Esha Sharma 🎨 Image & Video AI 3mo ago
How to Copy Any Pose in ComfyUI and Fix AI Skin
This is a summarized guide. For the full JSON workflow and download files, check the original...
Hunyuan Video 720p on RTX 3090: Full On-Premise AI Media Pipeline E2E
Dev.to · Jörg Fuchs 🎨 Image & Video AI 3mo ago
Hunyuan Video 720p on RTX 3090: Full On-Premise AI Media Pipeline E2E
Running AI video generation on consumer hardware - here is our full E2E pipeline that generates...
🛠️ I Built a One-Click ComfyUI Setup for RTX 5090 on Windows — No WSL2, No Docker
Dev.to · GeneLab_999 🎨 Image & Video AI 4mo ago
🛠️ I Built a One-Click ComfyUI Setup for RTX 5090 on Windows — No WSL2, No Docker
I bought an RTX 5090. 32GB VRAM. The most powerful consumer GPU on the planet. Then I tried to run...
Stable Diffusion vs Midjourney vs DALL-E 3: AI Image Generation Compared
Dev.to · arenasbob2024-cell 🎨 Image & Video AI 4mo ago
Stable Diffusion vs Midjourney vs DALL-E 3: AI Image Generation Compared
AI image generation has gone from novelty to professional tool in under three years. In 2025, three...
Mini Tip of the Day - Preloading the License into the Docker IRIS Image
Dev.to · InterSystems Developer 🎨 Image & Video AI 4mo ago
Mini Tip of the Day - Preloading the License into the Docker IRIS Image
Who hasn't been developing a beautiful example using a Docker IRIS image and had the image generation...
How to automate OG image generation for every blog post
Dev.to · Custodia-Admin 🎨 Image & Video AI 4mo ago
How to automate OG image generation for every blog post
Generate unique Open Graph images for every blog post automatically using PageBolt API — no design tool needed.
# 🧬 Math as an Organism: Why Generative Art Beats Grinding DSA
Dev.to · Vinay Daggupati 🎨 Image & Video AI 4mo ago
# 🧬 Math as an Organism: Why Generative Art Beats Grinding DSA
There’s something truly magical about watching mathematics come to life. Not in a dusty classroom or...
Tiny Diffusion
Dev.to · Unica2804 🎨 Image & Video AI 4mo ago
Tiny Diffusion
Have you ever wondered how the diffusion model works? I also wondered about it for a long time. It's...
I Tested Seedance 2.0 vs Sora 2 with Identical Prompts — Here Are the Real Results
Dev.to · EvoLink 🎨 Image & Video AI 4mo ago
I Tested Seedance 2.0 vs Sora 2 with Identical Prompts — Here Are the Real Results
Seedance 2.0 vs Sora 2: Real API Tests with Identical Prompts (2026) Most "Seedance vs...
Automate OG Image Generation for Your Website (No Design Skills Needed)
Dev.to · GrabShot 🎨 Image & Video AI 4mo ago
Automate OG Image Generation for Your Website (No Design Skills Needed)
Every time you share a link on Twitter, Slack, or Discord, the preview image matters. A good OG image...
From Static Assets to Dynamic Synthesis: Mastering DALL-E 3 and Vercel AI SDK in Next.js
Dev.to · Programming Central 🎨 Image & Video AI 4mo ago
From Static Assets to Dynamic Synthesis: Mastering DALL-E 3 and Vercel AI SDK in Next.js
Imagine a web application where the visuals aren't pre-baked assets sitting on a CDN, but are...
Cartoon Universe: How to Teach a Diffusion Model Some Manners
Dev.to · Luca Visciola 🎨 Image & Video AI 4mo ago
Cartoon Universe: How to Teach a Diffusion Model Some Manners
Let’s clarify something before we begin. Cartoon Universe is not a model. It is not magic...
Neural bicameral LoRA Decoupling logic style
Dev.to · Thyago Carvalho 🎨 Image & Video AI 4mo ago
Neural bicameral LoRA Decoupling logic style
1. The Era of the Generalist Giant In the current landscape of AI, we rely heavily on...
Day 4: Generating Animated GIFs with Go 🎨
Dev.to · Rohan Nilatkar 🎨 Image & Video AI 4mo ago
Day 4: Generating Animated GIFs with Go 🎨
Today’s focus in my Go journey was moving beyond the console and into binary image generation. I’ve...
6 Pitfalls of Dynamic OG Image Generation on Cloudflare Workers (Satori + resvg-wasm)
Dev.to · DeVoresyah ArEst 🎨 Image & Video AI 4mo ago
6 Pitfalls of Dynamic OG Image Generation on Cloudflare Workers (Satori + resvg-wasm)
A deep-dive into the real issues we hit generating dynamic Open Graph images on Cloudflare Workers with Satori and resvg-wasm — and how we solved each one.
Seedance 2.0: How ByteDance's Dual-Branch Architecture Changes AI Video Generation
Dev.to · Jessie J 🎨 Image & Video AI 4mo ago
Seedance 2.0: How ByteDance's Dual-Branch Architecture Changes AI Video Generation
ByteDance released Seedance 2.0 in February 2026, and its architecture makes some genuinely...
Midjourney Alternative for Professionals
Dev.to · Kristjan Retter 🎨 Image & Video AI 4mo ago
Midjourney Alternative for Professionals
Midjourney was one of the first great generative models, and it remains an incredible tool for...
Inside the Prompt Black Markets: The Underground Trade of Proprietary Prompts for Midjourney, GPTs, and DALL-E 3
Dev.to · VelocityAI 🎨 Image & Video AI 4mo ago
Inside the Prompt Black Markets: The Underground Trade of Proprietary Prompts for Midjourney, GPTs, and DALL-E 3
You've seen the images. They're stunning, hyper-specific, and unlike anything you can generate with...
Dev 006 - Stable Diffusion, GPU Too Old
Dev.to · James 🎨 Image & Video AI 4mo ago
Dev 006 - Stable Diffusion, GPU Too Old
I'm convinced it's just a short time more until we see sprite sheets being generated entirely with...
Real-time, open source, AI video generation is here and here's what you can build with it
Dev.to · Vibor Cipan 🎨 Image & Video AI 4mo ago
Real-time, open source, AI video generation is here and here's what you can build with it
I think that most people think of AI video as "type a prompt, wait 30 seconds, get a clip." Sure,...
Dev 03 - Asset Creation with Photoshop Pattern View
Dev.to · James 🎨 Image & Video AI 5mo ago
Dev 03 - Asset Creation with Photoshop Pattern View
I am still very hopeful that I will be able to utilize Stable Diffusion for more texture generation,...
Dev 02 - Stable Diffusion - Pixel Art
Dev.to · James 🎨 Image & Video AI 5mo ago
Dev 02 - Stable Diffusion - Pixel Art
I spent a day getting Stable Diffusion installed locally so I can use it to generate pixel sprites...
Slashing torch.compile Warmup & LoRA Swapping Times with Pruna
Dev.to · Sara Han 🎨 Image & Video AI 5mo ago
Slashing torch.compile Warmup & LoRA Swapping Times with Pruna
PyTorch introduced torch.compile, a powerful feature that significantly boosts performance by...
How I Use Google Veo 3.1 & Sora API Without Breaking the Bank
Dev.to · Emmanuel Mumba 🎨 Image & Video AI 5mo ago
How I Use Google Veo 3.1 & Sora API Without Breaking the Bank
Working with video APIs can be tricky. Between juggling API keys, figuring out the right endpoints,...