Fine-Tuning Qwen-Image-Edit and Using Wan 2.2 to Generate Multiple Actors

Oxen ยท Intermediate ยท๐ŸŽจ Image & Video AI ยท7mo ago
Links + Notes ๐Ÿ“ https://www.oxen.ai/blog Join Fine-Tune Fridays ๐Ÿ”ง https://oxen.ai/community Discord ๐Ÿ—ฟ https://discord.com/invite/s3tBEn7Ptg Use Oxen AI ๐Ÿ‚ https://oxen.ai/ Oxen.ai offers one click fine-tuning or fine-tunes for you! Built on top of the worlds best data versioning tool, we offer tools to automate model evals, generate synthetic data, and effortlessly fine-tune models. -- Chapters 0:00 The Task: Generating Conan Oโ€™Brien interviewing Will Smith 2:04 Base Model Results and Early Fine-Tunes 3:13 The Problem: Video models arenโ€™t good a multi person generations 6:50 Can we just prompt Nano Banana instead of fine-tuning 9:43 Why fine-tune? 11:17 What could a higher quality production pipeline look like? 14:50 Step 1: Masking 16:04 Enter DinoV3 21:28 Fine-tuning Qwen-Image-Edit to fill in masked images 26:12 Implementing our Wan 2.2 Comfyui Workflow 28:13 Questions 31:40 Tweaking our Comfyui flow 36:05 Moment of truth! Final generation 36:54 Question 38:15 Implementing our Qwen-Image-Edit LoRA in Comfyui 43:24 Conclusion
Watch on YouTube โ†— (saves to browser)
Sign in to unlock AI tutor explanation ยท โšก30

Related AI Lessons

โšก
What makes an AI image workflow useful for real commercial output?
Learn how to create a useful AI image workflow for commercial output, focusing on repeatability, versatility, and clarity
Dev.to AI
โšก
How to Write Better AI Image Prompts for Midjourney (With Examples That Actually Work)
Learn to write effective AI image prompts for Midjourney with actionable examples and techniques
Medium ยท ChatGPT
โšก
Image to Video AI: The Complete Workflow Playbook That Actually Produces Results
Learn a step-by-step workflow for image-to-video AI that produces results, from preparation to delivery
Medium ยท AI
โšก
Image Harvest v1.0.2: Internationalization, Free Pro Trial & Quality-of-Life Improvements
Learn about Image Harvest v1.0.2, a Chrome extension with internationalization, free pro trial, and quality-of-life improvements, and how to utilize it for privacy-first image extraction
Dev.to ยท kyriewen

Chapters (16)

The Task: Generating Conan Oโ€™Brien interviewing Will Smith
2:04 Base Model Results and Early Fine-Tunes
3:13 The Problem: Video models arenโ€™t good a multi person generations
6:50 Can we just prompt Nano Banana instead of fine-tuning
9:43 Why fine-tune?
11:17 What could a higher quality production pipeline look like?
14:50 Step 1: Masking
16:04 Enter DinoV3
21:28 Fine-tuning Qwen-Image-Edit to fill in masked images
26:12 Implementing our Wan 2.2 Comfyui Workflow
28:13 Questions
31:40 Tweaking our Comfyui flow
36:05 Moment of truth! Final generation
36:54 Question
38:15 Implementing our Qwen-Image-Edit LoRA in Comfyui
43:24 Conclusion
Up next
Krea 2 makes Diffusion FUN Again!
MattVidPro
Watch โ†’