Applied AI

Image & Video AI

Stable Diffusion, Midjourney, DALL-E, Sora, ControlNet and AI video generation

2,618
lessons
Skills in this topic
View full skill map →
Image Generation Basics
beginner
Generate photorealistic and stylised images with prompts
Advanced Image Generation
intermediate
Use ControlNet for pose/depth-guided generation
AI Video Generation
advanced
Generate a 10-second video clip from a text prompt

Showing 30 reads from curated sources

How to Write Better AI Image Prompts for Midjourney (With Examples That Actually Work)
Medium · ChatGPT 🎨 Image & Video AI ⚡ AI Lesson 2d ago
How to Write Better AI Image Prompts for Midjourney (With Examples That Actually Work)
If you use Midjourney, DALL·E, or Stable Diffusion, you’ve probably experienced this: Continue reading on Medium »
Image to Video AI: The Complete Workflow Playbook That Actually Produces Results
Medium · AI 🎨 Image & Video AI ⚡ AI Lesson 3d ago
Image to Video AI: The Complete Workflow Playbook That Actually Produces Results
From source image preparation to model selection, prompt architecture, quality control, and platform delivery — a practical guide for… Continue reading on Mediu
Image Harvest v1.0.2: Internationalization, Free Pro Trial & Quality-of-Life Improvements
Dev.to · kyriewen 🎨 Image & Video AI ⚡ AI Lesson 3d ago
Image Harvest v1.0.2: Internationalization, Free Pro Trial & Quality-of-Life Improvements
A month ago I launched Image Harvest — a privacy-first Chrome extension that extracts and...
Medium · Deep Learning 🎨 Image & Video AI ⚡ AI Lesson 4d ago
Pix2Pix: Image-to-Image Translation using Conditional GANs
Image-to-image translation is one of the most fascinating applications of deep learning. Instead of generating images purely from random… Continue reading on Me
Image Captioning API: Auto-Generate Alt Text and Descriptions
Dev.to · Om Prakash 🎨 Image & Video AI ⚡ AI Lesson 1w ago
Image Captioning API: Auto-Generate Alt Text and Descriptions
Fast image-to-text API with three caption styles: concise alt-tags, SEO-rich descriptions, and detailed narration. 8 credits per call.
Long video generation blog: Six Approaches, One Decision
Dev.to · Atlas Cloud 🎨 Image & Video AI ⚡ AI Lesson 1w ago
Long video generation blog: Six Approaches, One Decision
A few months ago we set ourselves a deceptively simple goal: produce coherent, high-quality video...
Optimasi Kompresi Citra Tanpa Kehilangan Detail (Lossless) pada Data High-Resolution
Medium · Data Science 🎨 Image & Video AI ⚡ AI Lesson 1w ago
Optimasi Kompresi Citra Tanpa Kehilangan Detail (Lossless) pada Data High-Resolution
Seiring dengan meningkatnya kebutuhan akan kualitas visual yang presisi, pengolahan citra digital resolusi tinggi (High-Resolution)… Continue reading on Medium
The Complete Guide to Programmatic Image Generation
Dev.to · Iteration Layer 🎨 Image & Video AI ⚡ AI Lesson 2w ago
The Complete Guide to Programmatic Image Generation
From Puppeteer to layer-based APIs — how to generate images programmatically at scale. Methods, patterns, and code.
I Tested 25 AI Headshot Generators. Here Are 9 That Actually Look Real (2026 Guide)
Medium · AI 🎨 Image & Video AI ⚡ AI Lesson 2w ago
I Tested 25 AI Headshot Generators. Here Are 9 That Actually Look Real (2026 Guide)
A hands-on comparison of the most realistic AI headshot generators for LinkedIn and professional use. Continue reading on Freelancer’s Hub »
Dev.to AI 🎨 Image & Video AI ⚡ AI Lesson 2w ago
Gemini Stalling? Optimize Performance with Google Workspace Login & Usage Management
Gemini Image Generation Stalling? Understanding Usage Limits in Google Workspace Have you ever experienced a sudden halt in Gemini's image generation, right in
I Built a Watermark Remover — Here’s What I Actually Learned
Dev.to · Eric Cheung 🎨 Image & Video AI ⚡ AI Lesson 2w ago
I Built a Watermark Remover — Here’s What I Actually Learned
I'd generate an image with Gemini, like it, want to drop it into a draft or mockup — and there was...
Dev.to AI 🎨 Image & Video AI ⚡ AI Lesson 2w ago
Criar vídeos com IA em português: guia completo 2026
Nos últimos anos, a inteligência artificial (IA) tem transformado o modo como criamos e consumimos conteúdos. Uma das áreas que tem visto um crescimento signifi
Dev.to AI 🎨 Image & Video AI ⚡ AI Lesson 2w ago
How I Created Custom AI Images for My Indie Project in Under an Hour – And You Can Too!
I was knee-deep in my latest indie app build, a productivity tracker for remote workers, when I hit a wall: I needed custom icons and banners to make it pop, bu
Dev.to AI 🎨 Image & Video AI ⚡ AI Lesson 2w ago
(The Senses) Image Generation & Media
The Catalyst Field note: Nano Banana Pro and reactive image gen I hit a real workflow failure mode: a proactive image stack (Nano Banana Pro) that would spontan
Achieving Consistent Cinematic Color Grading for Your Image Series
Dev.to · Om Prakash 🎨 Image & Video AI ⚡ AI Lesson 2w ago
Achieving Consistent Cinematic Color Grading for Your Image Series
If you’ve spent any time working with visual media—whether it’s for a personal travel blog, an...
Dev.to AI 🎨 Image & Video AI ⚡ AI Lesson 3w ago
How to Create Product Photos with GPT Image 2
If you sell physical products, one of the hardest creative problems is producing enough visuals that are actually usable across product pages, ads, and social.
GPT Image2 vs. Nano Banana2: The New Battle for Visual AI Supremacy
Medium · ChatGPT 🎨 Image & Video AI ⚡ AI Lesson 3w ago
GPT Image2 vs. Nano Banana2: The New Battle for Visual AI Supremacy
For the past year, Nano Banana2 has quietly built a reputation as the gold standard in AI image generation — especially among designers… Continue reading on Med
Medium · ChatGPT 🎨 Image & Video AI ⚡ AI Lesson 3w ago
Unlocking the Power of AI-Generated Images: ChatGPT’s Latest Upgrade
The world of artificial intelligence has witnessed significant advancements in recent years, with one of the most notable developments… Continue reading on Medi
When Preprocessing Helps — and When It Hurts: Why Your Image Classification Model’s Accuracy Varies
Medium · Deep Learning 🎨 Image & Video AI ⚡ AI Lesson 3w ago
When Preprocessing Helps — and When It Hurts: Why Your Image Classification Model’s Accuracy Varies
From 65% to 87% accuracy on CIFAR-10 using Convolutional Neural Networks — and what went wrong along the way. Continue reading on Level Up Coding »
Tired of Sorting Generated Images? I Built a Flask Tool.(For Mac code)
Medium · Python 🎨 Image & Video AI ⚡ AI Lesson 3w ago
Tired of Sorting Generated Images? I Built a Flask Tool.(For Mac code)
If you’re generating images with ComfyUI, you probably already know the real problem isn’t generation — it’s cleanup. Continue reading on Medium »
How to Create Soft Cinematic Light in Midjourney
Medium · AI 🎨 Image & Video AI ⚡ AI Lesson 3w ago
How to Create Soft Cinematic Light in Midjourney
Most Midjourney images feel too harsh or artificial, even with great prompts, because the light is wrong. Continue reading on Medium »
Dev.to AI 🎨 Image & Video AI ⚡ AI Lesson 3w ago
Why Every AI Image Generator Fails at Text (And One That Finally Doesn't)
Why Every AI Image Generator Fails at Text (And One That Finally Doesn't) If you've spent any time with AI image generators, you've probably run into the same f
Dev.to AI 🎨 Image & Video AI ⚡ AI Lesson 3w ago
ERNIE-Image: A Text-to-Image Model Built for Posters, Comics, and Text-Rich Visual Content
Introduction As text-to-image models continue to evolve, most improvements have focused on visual quality—higher resolution, better textures, and more photoreal
TIFF in 2026: what I learned researching the format nobody uses on the web
Dev.to · Serhii Kalyna 🎨 Image & Video AI ⚡ AI Lesson 1mo ago
TIFF in 2026: what I learned researching the format nobody uses on the web
I'm building a free image converter. One day I looked at my landing page for /tiff-to-webp and...
Denoising
Towards AI 🎨 Image & Video AI ⚡ AI Lesson 1mo ago
Denoising
Author(s): Sefa Bilicier Originally published on Towards AI. Introduction Have you ever taken a photo in low light and noticed those grainy, discolored spots th
ArXiv cs.AI 🎨 Image & Video AI 📄 Paper ⚡ AI Lesson 1mo ago
SANA I2I: A Text Free Flow Matching Framework for Paired Image to Image Translation with a Case Study in Fetal MRI Artifact Reduction
arXiv:2604.00298v1 Announce Type: cross Abstract: We propose SANA-I2I, a text-free high-resolution image-to-image generation framework that extends the SANA fam
ArXiv cs.AI 🎨 Image & Video AI 📄 Paper ⚡ AI Lesson 1mo ago
Science-T2I: Addressing Scientific Illusions in Image Synthesis
arXiv:2504.13129v2 Announce Type: replace-cross Abstract: Current image generation models produce visually compelling but scientifically implausible images, exp
ArXiv cs.AI 🎨 Image & Video AI 📄 Paper ⚡ AI Lesson 1mo ago
MELT: Improve Composed Image Retrieval via the Modification Frequentation-Rarity Balance Network
arXiv:2603.29291v1 Announce Type: cross Abstract: Composed Image Retrieval (CIR) uses a reference image and a modification text as a query to retrieve a target
ArXiv cs.AI 🎨 Image & Video AI 📄 Paper ⚡ AI Lesson 1mo ago
ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks
arXiv:2603.27862v1 Announce Type: cross Abstract: Advances in diffusion, autoregressive, and hybrid models have enabled high-quality image synthesis for tasks s
ArXiv cs.AI 🎨 Image & Video AI 📄 Paper 1mo ago
Image Generation Models: A Technical History
arXiv:2603.07455v2 Announce Type: replace-cross Abstract: Image generation has advanced rapidly over the past decade, yet the literature seems fragmented across