The Future Of Online Shopping

bycloud · Advanced ·📄 Research Papers Explained ·5y ago

Skills: Reading ML Papers90%CV Basics80%

Key Takeaways

The video discusses the VOGUE: Try-On by StyleGAN Interpolation Optimization research paper, which proposes a solution for virtual try-on in online shopping using AI, specifically StyleGAN 2 and latent space representation.

Full Transcript

i think there is nothing harder than finding the right size and the right look from online shopping and it is often easy to get the wrong size or style and you have to return it for the one that you want but then it takes like another month for this whole process before you can get your item which is really annoying so to propose a solution for this there are some ai researchers out there who have dedicated their time to solve this problem which is also known as virtual trions when given a pair of images we will want to apply for example the jeans onto the other image and in order to do that it will require us to synthesize high resolution images and the best image synthesizer right now is style gen 2. so a group of researchers from google mit ai lab and udub published this paper called vogue virtual try on by building upon stylegen 2 with various other algorithms to create this photorealistic try on ai so it looks really natural and high quality when you see the garment projected onto the person it is really flexible in projecting the garment too it not only is capable of being projected onto different poses but also can be applied to different directions of the body different body postures and even project short sleeves onto a person who has long sleeves on even when the arms were not shown from the input image it is able to synthesize consistent skin tones just by determining it from the hand and the neck really really impressive the same goes for projecting the pens too it gets the right waist level and even generates buttons that were not present in the referencing image not only that it is also able to generate a pocket where the hand can fit into accurately too i would definitely not have noticed this because it was just too natural such small details able being recognized by the ai is just surprising however if you look closely you can tell that the original image and the resulting image is slightly different in the region that is not supposed to change and this is actually because it is the way of generating these images that makes it look different to put it simply it's like music categorization map where in different directions you move the slow lead genres changes they are all still music of course but they sound gradually different when you move from the border of one category to another to put it into the perspective of vogue try on instead of music we now have clothes and so the music genres are like the type of clothes from hip hop to classical is like from short sleeves to long sleeves and the garments inside the image is now being represented by having thoughts on the categorization map and in a more technical term this map is called the latent space it exists in between the encoder and the decoder and it represents the image data in the most compressed form because it is so compressed by slightly changing its values of this latent space representation can affect the generation when it is being decoded at the end and instead of just being a compressed image data this ai focused on the garments instead of like the rgb values or other less relevant details in the good old days we have never dreamed about controlling this latent space because it is a very complex form of data and tuning it was basically impossible so after a lot of research papers in the recent years made by a lot of different ai researchers we are now able to control the latent space representations like we never had before and you can see that these kind of tasks has been shown a lot in ai face generation where we get the input phase latent space representation and we can then adjust the representation to get different hair lengths mustache gender age and even more while preserving the original look so this technique has been applied into this research paper with modifications and additions such as image segmentation for different regions of the body from the top the bottom the neck the chin to the hands and so this is how the ai can recognize where the arms are underneath the clothes and generate human skin over the arm instead of just clothes however there are also improvements to be made to this paper and it is far from perfect yet so far this ai is only capable of clothes with plain colors or simple patterns anything more complex than that such as a logo cannot be easily transformed into accurate latent space representation therefore in the generation would turn out distorted too and if you also look closely at the patterns or the buttons they are also not completely perfect and to exactly imitate try on they need to also figure out how to represent different clothes sizes but just by looking at the progress this paper has made against the older research papers i am pretty optimistic that this can turn into a very helpful tool in the near future and this video is sponsored by infinite red infinite red consulting handles your mobile web and ai needs if you are looking for someone to build your app visit with the link down in the description thank you guys for watching as usual and a big shout out to mark schfin and many other patreons that support my work through patreon join my discord and follow my twitter if you haven't and i'll see you all in the next one

Original Description

I forgot this research paper did not release their codes! So you cannot test it out personally. The results may also be cherry picked too, so just keep that in mind. VOGUE: Try-On by StyleGAN Interpolation Optimization [Project Page] https://vogue-try-on.github.io/ [Paper] http://arxiv.org/abs/2101.02285 [Interactive Demo] https://vogue-try-on.github.io/demo_rewrite.html Today's Sponsor is Infinite Red Infinite Red consulting handles your mobile, web, and AI needs Check it out here: https://bit.ly/2UwddmM This video is supported by the kind Patrons: 🙏Marc Schwyn, Mazen Alotaibi, Sascha Henrichs, Jake Disco, Peter Davidowicz Support me on Patreon if you hope to see more: https://www.patreon.com/bycloud or by becoming a member instead (same perks!): https://www.youtube.com/channel/UCgfe2ooZD3VJPB6aJAnuQng/join [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Music] Steaminwaffles - Blissful Negligence [Profile Art] https://twitter.com/bynicalcynical

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from bycloud · bycloud · 36 of 60

← Previous Next →

Can Deepfake work on Anime?

Can Deepfake work on Anime?

AI that Can Copy Voices

AI that Can Copy Voices

Live Action Is Terrible So AI Turned It Back Into Anime

Live Action Is Terrible So AI Turned It Back Into Anime

2 AIs Enhance Anime to 4K 240FPS, but is it good?

2 AIs Enhance Anime to 4K 240FPS, but is it good?

IRL to Anime With Cartoonization AI

IRL to Anime With Cartoonization AI

How Does AI Generated Songs Sound Like? [OpenAI Jukebox]

How Does AI Generated Songs Sound Like? [OpenAI Jukebox]

AI Makes Any Images Cinematic [3D Photo Inpainting]

AI Makes Any Images Cinematic [3D Photo Inpainting]

AI Generates Anime Faces, And It's Getting Even Better [StyleGAN2]

AI Generates Anime Faces, And It's Getting Even Better [StyleGAN2]

Tech Behind The Meme: Dame Da Ne AI - Single Image Deepfake

Tech Behind The Meme: Dame Da Ne AI - Single Image Deepfake

AI Generates New Light Source for Images [PaintingLight]

AI Generates New Light Source for Images [PaintingLight]

Depixelizing Doom Guy? Mona Lisa in Real Life? The "Upscaling" AI: PULSE

Depixelizing Doom Guy? Mona Lisa in Real Life? The "Upscaling" AI: PULSE

Image Completion AI - Predict Pixels Just Like Text Predictions [Image-GPT]

Image Completion AI - Predict Pixels Just Like Text Predictions [Image-GPT]

AI Generates 3D Human Model from 2D Image [PIFuHD - FacebookAI]

AI Generates 3D Human Model from 2D Image [PIFuHD - FacebookAI]

AI Assisted Masking - Save Your Precious Time Right Now [AE Rotobrush 2]

AI Assisted Masking - Save Your Precious Time Right Now [AE Rotobrush 2]

This AI Reconstruct Real Life Objects From Just Images [NeRF]

This AI Reconstruct Real Life Objects From Just Images [NeRF]

Image Restoration AI - Upscale and Restore Faces with DFDNet

Image Restoration AI - Upscale and Restore Faces with DFDNet

Best Image Colorization AI 2020

Best Image Colorization AI 2020

Image Decomposition AI - Edit Highlights and Textures Easily [Appearance Eraser]

Image Decomposition AI - Edit Highlights and Textures Easily [Appearance Eraser]

Deepfake With Audio Only [Wav2Lip]

Deepfake With Audio Only [Wav2Lip]

Copy IRL, Paste on your PC [AR Cut & Paste]

Copy IRL, Paste on your PC [AR Cut & Paste]

This AI Transform Faces into Hyper-Realistic Cartoon Characters [Toonify]

This AI Transform Faces into Hyper-Realistic Cartoon Characters [Toonify]

This AI Restores Old Photos with Damages Automatically!

This AI Restores Old Photos with Damages Automatically!

Anime Filter with AI - Snapchat vs. TikTok

Anime Filter with AI - Snapchat vs. TikTok

AI Reduces Bandwidth Problems for Video Calls [NVIDIA Maxine]

AI Reduces Bandwidth Problems for Video Calls [NVIDIA Maxine]

AI Motion Capture - Track Your Hands & Body WITHOUT Bodysuit [FrankMocap]

AI Motion Capture - Track Your Hands & Body WITHOUT Bodysuit [FrankMocap]

AI Converts Cartoon Characters To Real Life [Pixel2Style2Pixel]

AI Converts Cartoon Characters To Real Life [Pixel2Style2Pixel]

AI Sky Replacement with SkyAR

AI Sky Replacement with SkyAR

Better Than DAIN? NEW BEST Tool for Boosting Video's FPS with AI [RIFE/Flowframes]

Better Than DAIN? NEW BEST Tool for Boosting Video's FPS with AI [RIFE/Flowframes]

AI That Paints Anything Stroke By Stroke

AI That Paints Anything Stroke By Stroke

What Happens When AI Robots Design Themselves

What Happens When AI Robots Design Themselves

Deepfake Movements with 1 image ONLY [Liquid Warping GAN]

Deepfake Movements with 1 image ONLY [Liquid Warping GAN]

ANYTHING can be a "Green Screen" Now [Real-Time High-Resolution Background Matting]

ANYTHING can be a "Green Screen" Now [Real-Time High-Resolution Background Matting]

AI Transform any Image into Sketch or Line Art [ArtLine]

AI Transform any Image into Sketch or Line Art [ArtLine]

AI That Could Soon Replace Vector Artists [DALL-E]

AI That Could Soon Replace Vector Artists [DALL-E]

Photoshop Detector AI Is Useless

Photoshop Detector AI Is Useless

The Future Of Online Shopping

The Future Of Online Shopping

How The Future of Image Search Would Look Like

How The Future of Image Search Would Look Like

Everyone Can Make 3D Animations Easily Now! [Monster Mash]

Everyone Can Make 3D Animations Easily Now! [Monster Mash]

3D Video Stabilization with AI [NSFF]

3D Video Stabilization with AI [NSFF]

OpenAI’s Sarcastic Chat Bot [GPT-3 API Beta]

OpenAI’s Sarcastic Chat Bot [GPT-3 API Beta]

You Describe & AI Photoshops Faces For You [StyleCLIP]

You Describe & AI Photoshops Faces For You [StyleCLIP]

You Only Need Audio To Deepfake Now! Might look slightly cursed tho [PCAVS]

You Only Need Audio To Deepfake Now! Might look slightly cursed tho [PCAVS]

This AI Transfers Anime Back Into Sketch [Anime2Sketch]

This AI Transfers Anime Back Into Sketch [Anime2Sketch]

AI Learns To Play CS:GO By Watching Humans Play!

AI Learns To Play CS:GO By Watching Humans Play!

How AI Fixes The Horrendous CR7 Statue

How AI Fixes The Horrendous CR7 Statue

Best Vocal Isolation & Instrumental Extraction 2021 [lalal.ai vs Spleeter]

Best Vocal Isolation & Instrumental Extraction 2021 [lalal.ai vs Spleeter]

Face Enhance AI Restores Extremely Blurry Faces [GPEN]

Face Enhance AI Restores Extremely Blurry Faces [GPEN]

AI That Only Needs 1 Image To Deepfake [SimSwap]

AI That Only Needs 1 Image To Deepfake [SimSwap]

The Amazing AI Behind the TikTok JoJo Pose Challenge [BoostMonocularDepth + 3DP]

The Amazing AI Behind the TikTok JoJo Pose Challenge [BoostMonocularDepth + 3DP]

StyleGAN3!? - What AI Actually Sees When Generating Faces [Alias-Free GAN]

StyleGAN3!? - What AI Actually Sees When Generating Faces [Alias-Free GAN]

AI generated art goes brrrrr [VQGAN+CLIP]

AI generated art goes brrrrr [VQGAN+CLIP]

AI That Doodles Any Given Description

AI That Doodles Any Given Description

Best AI Motion Capture 2021 - OpenPose vs DeepMotion

Best AI Motion Capture 2021 - OpenPose vs DeepMotion

Anime Image Enhance AI Has Gone To The Next Level [Real-ESRGAN]

Anime Image Enhance AI Has Gone To The Next Level [Real-ESRGAN]

This Video's Voice Is Entirely Made From Audio Deepfake

This Video's Voice Is Entirely Made From Audio Deepfake

I Can’t Sing So I Cloned My Voice w/ AI To Cover Goodbye Sengen (English Cover)

I Can’t Sing So I Cloned My Voice w/ AI To Cover Goodbye Sengen (English Cover)

Best Background Removal - AIs Removes BG Without Green Screen And It's Amazing. [RVM]

Best Background Removal - AIs Removes BG Without Green Screen And It's Amazing. [RVM]

How I Deepfaked VTuber Gawr Gura with AI

How I Deepfaked VTuber Gawr Gura with AI

AI Magic Removal - Removes ANYTHING & Inpaints For You [LaMa]

AI Magic Removal - Removes ANYTHING & Inpaints For You [LaMa]

I Did NOT Expect AI Anime Filter To Be This Good [AnimeGANv2]

I Did NOT Expect AI Anime Filter To Be This Good [AnimeGANv2]

The video discusses a research paper on virtual try-on using AI, which can synthesize high-resolution images and project garments onto a person, allowing for photorealistic try-on. The paper uses StyleGAN 2 and latent space representation to achieve this.

Key Takeaways

Understand the problem of virtual try-on in online shopping
Learn about StyleGAN 2 and its applications
Study latent space representation and its role in image synthesis
Apply knowledge of computer vision and image processing to understand the paper's contributions

💡 The paper's use of latent space representation allows for flexible and photorealistic garment projection, but there are still limitations and improvements to be made.

🔒 Pro feature: Ask AI to explain this lesson →

More on: Reading ML Papers

View skill →

Automatic Literature Review with GPT-3 - I embedded and indexed all of arXiv into a search engine!

Automatic Literature Review with GPT-3 - I embedded and indexed all of arXiv into a search engine!

Marcos Lopez Caniego - ESASky's JupyterLab widget| JupyterCon 2020

Marcos Lopez Caniego - ESASky's JupyterLab widget| JupyterCon 2020

Obsidian Zotero Integration Plugin | Streamline Your Research Paper Workflow 📝️

Obsidian Zotero Integration Plugin | Streamline Your Research Paper Workflow 📝️

This FULLY FREE Research Agent can BUILD Reports in Minutes!!!

This FULLY FREE Research Agent can BUILD Reports in Minutes!!!

Claude 3.7 Sonnet API | Build a Research Assistant

Claude 3.7 Sonnet API | Build a Research Assistant

I Built An Obsidian AI Research Assistant with Oz...

I Built An Obsidian AI Research Assistant with Oz...

Related AI Lessons

I Spent Weeks Looking for a Research Gap Before I Realized I Was Searching the Wrong Way

Learn how to effectively find research gaps by changing your approach, a crucial skill for AI researchers and academics

ICMI 2026 Reviews [D]

Learn how to interpret ICMI 2026 reviews and improve your paper's acceptance chances

Reddit r/MachineLearning

Workshop submission for main conference paper under review [D]

Learn how to navigate submitting a paper to a non-archival workshop before the final decision of a main conference like ECCV

Reddit r/MachineLearning

Kept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P]

Streamline your research with a new Chrome extension and website that integrates 3M papers from arxiv, OpenReview, GitHub, and HuggingFace, including citation graphs and SPECTER2 neighbors, and provide feedback to improve it

Reddit r/MachineLearning

Beyond Big Vendors: ERP Systems Explained #shorts

Digital Transformation with Eric Kimberling