The Future Of Online Shopping
Key Takeaways
The video discusses the VOGUE: Try-On by StyleGAN Interpolation Optimization research paper, which proposes a solution for virtual try-on in online shopping using AI, specifically StyleGAN 2 and latent space representation.
Full Transcript
i think there is nothing harder than finding the right size and the right look from online shopping and it is often easy to get the wrong size or style and you have to return it for the one that you want but then it takes like another month for this whole process before you can get your item which is really annoying so to propose a solution for this there are some ai researchers out there who have dedicated their time to solve this problem which is also known as virtual trions when given a pair of images we will want to apply for example the jeans onto the other image and in order to do that it will require us to synthesize high resolution images and the best image synthesizer right now is style gen 2. so a group of researchers from google mit ai lab and udub published this paper called vogue virtual try on by building upon stylegen 2 with various other algorithms to create this photorealistic try on ai so it looks really natural and high quality when you see the garment projected onto the person it is really flexible in projecting the garment too it not only is capable of being projected onto different poses but also can be applied to different directions of the body different body postures and even project short sleeves onto a person who has long sleeves on even when the arms were not shown from the input image it is able to synthesize consistent skin tones just by determining it from the hand and the neck really really impressive the same goes for projecting the pens too it gets the right waist level and even generates buttons that were not present in the referencing image not only that it is also able to generate a pocket where the hand can fit into accurately too i would definitely not have noticed this because it was just too natural such small details able being recognized by the ai is just surprising however if you look closely you can tell that the original image and the resulting image is slightly different in the region that is not supposed to change and this is actually because it is the way of generating these images that makes it look different to put it simply it's like music categorization map where in different directions you move the slow lead genres changes they are all still music of course but they sound gradually different when you move from the border of one category to another to put it into the perspective of vogue try on instead of music we now have clothes and so the music genres are like the type of clothes from hip hop to classical is like from short sleeves to long sleeves and the garments inside the image is now being represented by having thoughts on the categorization map and in a more technical term this map is called the latent space it exists in between the encoder and the decoder and it represents the image data in the most compressed form because it is so compressed by slightly changing its values of this latent space representation can affect the generation when it is being decoded at the end and instead of just being a compressed image data this ai focused on the garments instead of like the rgb values or other less relevant details in the good old days we have never dreamed about controlling this latent space because it is a very complex form of data and tuning it was basically impossible so after a lot of research papers in the recent years made by a lot of different ai researchers we are now able to control the latent space representations like we never had before and you can see that these kind of tasks has been shown a lot in ai face generation where we get the input phase latent space representation and we can then adjust the representation to get different hair lengths mustache gender age and even more while preserving the original look so this technique has been applied into this research paper with modifications and additions such as image segmentation for different regions of the body from the top the bottom the neck the chin to the hands and so this is how the ai can recognize where the arms are underneath the clothes and generate human skin over the arm instead of just clothes however there are also improvements to be made to this paper and it is far from perfect yet so far this ai is only capable of clothes with plain colors or simple patterns anything more complex than that such as a logo cannot be easily transformed into accurate latent space representation therefore in the generation would turn out distorted too and if you also look closely at the patterns or the buttons they are also not completely perfect and to exactly imitate try on they need to also figure out how to represent different clothes sizes but just by looking at the progress this paper has made against the older research papers i am pretty optimistic that this can turn into a very helpful tool in the near future and this video is sponsored by infinite red infinite red consulting handles your mobile web and ai needs if you are looking for someone to build your app visit with the link down in the description thank you guys for watching as usual and a big shout out to mark schfin and many other patreons that support my work through patreon join my discord and follow my twitter if you haven't and i'll see you all in the next one
Original Description
I forgot this research paper did not release their codes! So you cannot test it out personally. The results may also be cherry picked too, so just keep that in mind.
VOGUE: Try-On by StyleGAN Interpolation Optimization
[Project Page] https://vogue-try-on.github.io/
[Paper] http://arxiv.org/abs/2101.02285
[Interactive Demo] https://vogue-try-on.github.io/demo_rewrite.html
Today's Sponsor is Infinite Red
Infinite Red consulting handles your mobile, web, and AI needs
Check it out here: https://bit.ly/2UwddmM
This video is supported by the kind Patrons:
🙏Marc Schwyn, Mazen Alotaibi, Sascha Henrichs, Jake Disco, Peter Davidowicz
Support me on Patreon if you hope to see more:
https://www.patreon.com/bycloud
or by becoming a member instead (same perks!):
https://www.youtube.com/channel/UCgfe2ooZD3VJPB6aJAnuQng/join
[Discord] https://discord.gg/NhJZGtH
[Twitter] https://twitter.com/bycloudai
[Patreon] https://www.patreon.com/bycloud
[Music] Steaminwaffles - Blissful Negligence
[Profile Art] https://twitter.com/bynicalcynical
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from bycloud · bycloud · 36 of 60
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
▶
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Can Deepfake work on Anime?
bycloud
AI that Can Copy Voices
bycloud
Live Action Is Terrible So AI Turned It Back Into Anime
bycloud
2 AIs Enhance Anime to 4K 240FPS, but is it good?
bycloud
IRL to Anime With Cartoonization AI
bycloud
How Does AI Generated Songs Sound Like? [OpenAI Jukebox]
bycloud
AI Makes Any Images Cinematic [3D Photo Inpainting]
bycloud
AI Generates Anime Faces, And It's Getting Even Better [StyleGAN2]
bycloud
Tech Behind The Meme: Dame Da Ne AI - Single Image Deepfake
bycloud
AI Generates New Light Source for Images [PaintingLight]
bycloud
Depixelizing Doom Guy? Mona Lisa in Real Life? The "Upscaling" AI: PULSE
bycloud
Image Completion AI - Predict Pixels Just Like Text Predictions [Image-GPT]
bycloud
AI Generates 3D Human Model from 2D Image [PIFuHD - FacebookAI]
bycloud
AI Assisted Masking - Save Your Precious Time Right Now [AE Rotobrush 2]
bycloud
This AI Reconstruct Real Life Objects From Just Images [NeRF]
bycloud
Image Restoration AI - Upscale and Restore Faces with DFDNet
bycloud
Best Image Colorization AI 2020
bycloud
Image Decomposition AI - Edit Highlights and Textures Easily [Appearance Eraser]
bycloud
Deepfake With Audio Only [Wav2Lip]
bycloud
Copy IRL, Paste on your PC [AR Cut & Paste]
bycloud
This AI Transform Faces into Hyper-Realistic Cartoon Characters [Toonify]
bycloud
This AI Restores Old Photos with Damages Automatically!
bycloud
Anime Filter with AI - Snapchat vs. TikTok
bycloud
AI Reduces Bandwidth Problems for Video Calls [NVIDIA Maxine]
bycloud
AI Motion Capture - Track Your Hands & Body WITHOUT Bodysuit [FrankMocap]
bycloud
AI Converts Cartoon Characters To Real Life [Pixel2Style2Pixel]
bycloud
AI Sky Replacement with SkyAR
bycloud
Better Than DAIN? NEW BEST Tool for Boosting Video's FPS with AI [RIFE/Flowframes]
bycloud
AI That Paints Anything Stroke By Stroke
bycloud
What Happens When AI Robots Design Themselves
bycloud
Deepfake Movements with 1 image ONLY [Liquid Warping GAN]
bycloud
ANYTHING can be a "Green Screen" Now [Real-Time High-Resolution Background Matting]
bycloud
AI Transform any Image into Sketch or Line Art [ArtLine]
bycloud
AI That Could Soon Replace Vector Artists [DALL-E]
bycloud
Photoshop Detector AI Is Useless
bycloud
The Future Of Online Shopping
bycloud
How The Future of Image Search Would Look Like
bycloud
Everyone Can Make 3D Animations Easily Now! [Monster Mash]
bycloud
3D Video Stabilization with AI [NSFF]
bycloud
OpenAI’s Sarcastic Chat Bot [GPT-3 API Beta]
bycloud
You Describe & AI Photoshops Faces For You [StyleCLIP]
bycloud
You Only Need Audio To Deepfake Now! Might look slightly cursed tho [PCAVS]
bycloud
This AI Transfers Anime Back Into Sketch [Anime2Sketch]
bycloud
AI Learns To Play CS:GO By Watching Humans Play!
bycloud
How AI Fixes The Horrendous CR7 Statue
bycloud
Best Vocal Isolation & Instrumental Extraction 2021 [lalal.ai vs Spleeter]
bycloud
Face Enhance AI Restores Extremely Blurry Faces [GPEN]
bycloud
AI That Only Needs 1 Image To Deepfake [SimSwap]
bycloud
The Amazing AI Behind the TikTok JoJo Pose Challenge [BoostMonocularDepth + 3DP]
bycloud
StyleGAN3!? - What AI Actually Sees When Generating Faces [Alias-Free GAN]
bycloud
AI generated art goes brrrrr [VQGAN+CLIP]
bycloud
AI That Doodles Any Given Description
bycloud
Best AI Motion Capture 2021 - OpenPose vs DeepMotion
bycloud
Anime Image Enhance AI Has Gone To The Next Level [Real-ESRGAN]
bycloud
This Video's Voice Is Entirely Made From Audio Deepfake
bycloud
I Can’t Sing So I Cloned My Voice w/ AI To Cover Goodbye Sengen (English Cover)
bycloud
Best Background Removal - AIs Removes BG Without Green Screen And It's Amazing. [RVM]
bycloud
How I Deepfaked VTuber Gawr Gura with AI
bycloud
AI Magic Removal - Removes ANYTHING & Inpaints For You [LaMa]
bycloud
I Did NOT Expect AI Anime Filter To Be This Good [AnimeGANv2]
bycloud
More on: Reading ML Papers
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
I Spent Weeks Looking for a Research Gap Before I Realized I Was Searching the Wrong Way
Medium · AI
ICMI 2026 Reviews [D]
Reddit r/MachineLearning
Workshop submission for main conference paper under review [D]
Reddit r/MachineLearning
Kept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P]
Reddit r/MachineLearning
🎓
Tutor Explanation
DeepCamp AI