Fine-Tuning Qwen-Image-Edit and Using Wan 2.2 to Generate Multiple Actors

Oxen · Intermediate ·🎨 Image & Video AI ·7mo ago

Skills: Image Generation Basics80%Advanced Image Generation60%

Links + Notes 📝 https://www.oxen.ai/blog Join Fine-Tune Fridays 🔧 https://oxen.ai/community Discord 🗿 https://discord.com/invite/s3tBEn7Ptg Use Oxen AI 🐂 https://oxen.ai/ Oxen.ai offers one click fine-tuning or fine-tunes for you! Built on top of the worlds best data versioning tool, we offer tools to automate model evals, generate synthetic data, and effortlessly fine-tune models. -- Chapters 0:00 The Task: Generating Conan O’Brien interviewing Will Smith 2:04 Base Model Results and Early Fine-Tunes 3:13 The Problem: Video models aren’t good a multi person generations 6:50 Can we just prompt Nano Banana instead of fine-tuning 9:43 Why fine-tune? 11:17 What could a higher quality production pipeline look like? 14:50 Step 1: Masking 16:04 Enter DinoV3 21:28 Fine-tuning Qwen-Image-Edit to fill in masked images 26:12 Implementing our Wan 2.2 Comfyui Workflow 28:13 Questions 31:40 Tweaking our Comfyui flow 36:05 Moment of truth! Final generation 36:54 Question 38:15 Implementing our Qwen-Image-Edit LoRA in Comfyui 43:24 Conclusion

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

More on: Image Generation Basics

View skill →

Create and Master 3D Assets in Blender from Scratch

Create and Master 3D Assets in Blender from Scratch

ControlNet and Stable Diffusion Local Step by Step Installation Guide

ControlNet and Stable Diffusion Local Step by Step Installation Guide

Onur Yuce Gun, PhD

Qwen 2.5 AI: Complete Beginner Tutorial [100% Free and OpenSource]

Qwen 2.5 AI: Complete Beginner Tutorial [100% Free and OpenSource]

FREE Video AI - Deforum Local Install - Super Easy!

FREE Video AI - Deforum Local Install - Super Easy!

GEN-3 gives live to Midjourney images

GEN-3 gives live to Midjourney images

Baby Alpaca · Sora Showcase

Baby Alpaca · Sora Showcase

Related AI Lessons

What makes an AI image workflow useful for real commercial output?

Learn how to create a useful AI image workflow for commercial output, focusing on repeatability, versatility, and clarity

How to Write Better AI Image Prompts for Midjourney (With Examples That Actually Work)

Learn to write effective AI image prompts for Midjourney with actionable examples and techniques

Medium · ChatGPT

Image to Video AI: The Complete Workflow Playbook That Actually Produces Results

Learn a step-by-step workflow for image-to-video AI that produces results, from preparation to delivery

Image Harvest v1.0.2: Internationalization, Free Pro Trial & Quality-of-Life Improvements

Learn about Image Harvest v1.0.2, a Chrome extension with internationalization, free pro trial, and quality-of-life improvements, and how to utilize it for privacy-first image extraction

Dev.to · kyriewen

Chapters (16)

The Task: Generating Conan O’Brien interviewing Will Smith

2:04 Base Model Results and Early Fine-Tunes

3:13 The Problem: Video models aren’t good a multi person generations

6:50 Can we just prompt Nano Banana instead of fine-tuning

9:43 Why fine-tune?

11:17 What could a higher quality production pipeline look like?

14:50 Step 1: Masking

16:04 Enter DinoV3

21:28 Fine-tuning Qwen-Image-Edit to fill in masked images

26:12 Implementing our Wan 2.2 Comfyui Workflow

28:13 Questions

31:40 Tweaking our Comfyui flow

36:05 Moment of truth! Final generation

36:54 Question

38:15 Implementing our Qwen-Image-Edit LoRA in Comfyui

43:24 Conclusion

Krea 2 makes Diffusion FUN Again!