Training video generation with Wan 2.2: Conan Oโ€™Brien and Will Smith character consistency

Oxen ยท Advanced ยท๐ŸŽจ Image & Video AI ยท8mo ago
Links + Notes ๐Ÿ“ https://www.oxen.ai/blog Join Fine-Tune Fridays ๐Ÿ”ง https://oxen.ai/community Discord ๐Ÿ—ฟ https://discord.com/invite/s3tBEn7Ptg Use Oxen AI ๐Ÿ‚ https://oxen.ai/ Oxen.ai offers one click fine-tuning or fine-tunes for you! Built on top of the worlds best data versioning tool, we offer tools to automate model evals, generate synthetic data, and effortlessly fine-tune models. -- Chapters 0:00 Is it Wan like โ€œAnneโ€ or โ€œwonโ€? 0:55 The Wan suite of models 1:10 Wan 2.1โ€™s model architecture and research paper 3:50 Wan 2.2 video improvements from Wan 2.1 5:35 Our fine-tuning goal: Conan Oโ€™Brien interviewing Will Smith whoโ€™s wearing a Denver Broncos shirt 7:30 Base model results 8:55 Wan 2.2โ€™s model architecture 12:55 Fine-tuning: How we created our data 17:12 Fine-tuning: How we fine-tuned each Wan model 19:22 Question: How many images do you need? 20:24 Question: Did we use musubi-tuner? 20:40 Question: How to train camera panning 22:45 Fine-tuning: Comparing images as we fine-tune 29:37 Bringing our Will Smith fine-tuned model to Comfyui 42:00 Configuring Comfyui to run our fine-tuned model 47:28 Question: Does the image input format matter? 48:40 Loading our Conan Oโ€™Brien fine-tuned model on Comfyui 57:45 Question: How are the LoRAs loaded into the pipeline 58:40 Final Results: Conan interviewing Will Smith
Watch on YouTube โ†— (saves to browser)
Sign in to unlock AI tutor explanation ยท โšก30

Related AI Lessons

โšก
How to Write Better AI Image Prompts for Midjourney (With Examples That Actually Work)
Learn to write effective AI image prompts for Midjourney with actionable examples and techniques
Medium ยท ChatGPT
โšก
Image to Video AI: The Complete Workflow Playbook That Actually Produces Results
Learn a step-by-step workflow for image-to-video AI that produces results, from preparation to delivery
Medium ยท AI
โšก
Image Harvest v1.0.2: Internationalization, Free Pro Trial & Quality-of-Life Improvements
Learn about Image Harvest v1.0.2, a Chrome extension with internationalization, free pro trial, and quality-of-life improvements, and how to utilize it for privacy-first image extraction
Dev.to ยท kyriewen
โšก
Pix2Pix: Image-to-Image Translation using Conditional GANs
Learn how to use Pix2Pix for image-to-image translation with conditional GANs, a powerful technique for generating realistic images
Medium ยท Deep Learning

Chapters (19)

Is it Wan like โ€œAnneโ€ or โ€œwonโ€?
0:55 The Wan suite of models
1:10 Wan 2.1โ€™s model architecture and research paper
3:50 Wan 2.2 video improvements from Wan 2.1
5:35 Our fine-tuning goal: Conan Oโ€™Brien interviewing Will Smith whoโ€™s wearing a Denv
7:30 Base model results
8:55 Wan 2.2โ€™s model architecture
12:55 Fine-tuning: How we created our data
17:12 Fine-tuning: How we fine-tuned each Wan model
19:22 Question: How many images do you need?
20:24 Question: Did we use musubi-tuner?
20:40 Question: How to train camera panning
22:45 Fine-tuning: Comparing images as we fine-tune
29:37 Bringing our Will Smith fine-tuned model to Comfyui
42:00 Configuring Comfyui to run our fine-tuned model
47:28 Question: Does the image input format matter?
48:40 Loading our Conan Oโ€™Brien fine-tuned model on Comfyui
57:45 Question: How are the LoRAs loaded into the pipeline
58:40 Final Results: Conan interviewing Will Smith
Up next
Krea 2 makes Diffusion FUN Again!
MattVidPro
Watch โ†’