Learn to Draw Real People using AI: Unveiling Future of Image-to-Image Translation

What's AI by Louis-François Bouchard · Beginner ·📄 Research Papers Explained ·6y ago

Key Takeaways

The video discusses the DeepFaceDrawing paper, which presents an image-to-image translation technique for generating high-quality face images from rough or incomplete sketches using a deep learning framework. The technique uses a two-subnetwork architecture, including a component embedding module and a feature mapping and synthesis module, to learn a space of plausible face images and synthesize a face image from an input sketch.

Full Transcript

you can now generate high-quality face images from ruff or even incomplete sketches with zero drawing skills using this image to image translation technique if your drawing skills are as bad as mine you can even adjust how much the eyes mouth and nose will affect the final image let's see if it really works and how they did it recent deep image to image translation techniques allo fast generation of face images from freehand sketches however existing solutions tend to overfit two sketches thus requiring professional consistent sketches as inputs show you Chen and I'll just share the paper called deep face drawing deep generation of face image from sketches to address this issue their key ID is to implicitly model the shape space of plausible face images and synthesize a face image in the space to approximate an input sketch not worry if you are lost we will cover this in the video there system allows user with little or no training in drawing to produce high-quality face images from ruff are even incomplete freehand sketches the method even faithfully respects user intentions in input strokes which serve more like soft constraints to get image synthesis they essentially use input sketches as soft constraints and is thus able to produce high-quality face images even from these rough sketches most of such deep learning based solutions for sketch to image translation often take input sketches almost fixed and attempt to infer the missing texture or shading information between strokes to some extent their problems are formulated more like reconstruction problems with input sketches as hard constraints since the often trained their networks from pairs of real images and their corresponding edge maps due to the data-driven nature the you cry or test sketches with quality similar to the edge maps of real images to synthesize realistic face images however such sketches are difficult to make especially for users with little training enjoying to address this issue the key ID is two implicitly learn a space of plausible face catches from real face catcher images and find the closest point in the space to approximate an input sketch this way sketches can be used more like stuff constraints to guide the image synthesis and as you can see the results are amazing as illustrated the deep learning framework takes a sketch image as input and generates a high quality facial image the network's architecture consists of two sub networks the first sub network is the component embedding module which is responsible for learning feature and beddings of individual face components using separate auto encoder networks the step turns component sketches into semantically meaningful feature vectors using a non - encoder architecture that separately learns five feature descriptors from the face cache data namely for left eye right eye nose mouth and reminder a reminder image corresponding to the reminder component is the same as the original sketch image but with the eyes nose and mouth removed the second sub Network consists of two sub modules the feature mapping and the ml synthesis although FM looks similar to the decoding part of see using the coding models converting feature vectors to spatial feature Maps it improves the information flow and thus provide more flexibility to fuse individual phase components for higher quality synthesis results the feature maps of individual phase components are then combined according to the face structure and finally pass to the is for face image synthesis which converts them to a realistic face image using a conditional Gann architecture which takes the feature Maps as input to a generator with the generation guided by a discriminator if you are not familiar with the gun architecture I suggest you to watch the video I made introducing them with this complex architecture they adapted a two-stage training strategy to train their network in stage 1 only the C module is trained by using component sketches to train five individual auto-encoders for filter embeddings the training is done in a self supervised manner which are covered in a previous video and I linked in stage 2 they fix the parameters at the Train component encoders and train the entire network with the unknown parameters in the FM and is modules together in an end-to-end manner to assess users especially those with little training and drawing we provided a shadow guidance caching which is shown in this video given a current sketch it finds the 10 most similar sketch component images the fun component images are then blended as shadow and placed at the corresponding components position for sketching guidance as you can see on the left initially when the canvas is empty the shadow is more blurry the shadow is updated instantly for every new input stroke the synthesized image is displayed in the window on the right users may choose to update the synthesized image instantly or trigger a convert command of course users with good drawing skills tend to trust their own drawings more dentals with little training enjoying so they provided a slider for each component type to control the blending weights between a sketch component and its refined version controlling the degree of interpolation between the sketch you made and the final version that is shown for either the eyes nose or mouth both qualitative and quantitative evaluations show the superior generation ability of their system to existing and alternative solutions just take a moment to look at these amazing results in comparison with the alternatives creating realistic human face images from scratch benefits various applications including face morphing face copy/paste criminal investigation character design it's casual training and more due to their simplicity conciseness and ease of use sketches are often used to depict desired faces which makes this new paper extremely relevant of course this was just a simple overview of the new image to image translation technique that allows fast generation of face images from freehand sketches I strongly recommend to read their paper and check their video demo both are linked in the description if you want me to cover a specific term in next video leave it in the comments below please leave a like if you learn something and subscribe to the channel to not miss any further terms clearly explain [Music]

Original Description

This week's new paper is DeepFaceDrawing. Ask any questions or remarks you have in the comments, I will gladly answer to everything! Subscribe to not miss any AI news and terms clearly vulgarized! #SketchToImage #DeepFaceDrawing #DeepLearning GANs: https://www.youtube.com/watch?v=ZnpZsiy_p2M The paper, code and video of the DeepFaceDrawing is available on their page: http://geometrylearning.com/DeepFaceDrawing/ Share this to someone who needs to learn more about Artificial Intelligence! Spread knowledge, not germs! Join Our Discord channel, Learn AI Together: https://discord.gg/SVse4Sr Follow me for more AI content! Instagram: https://www.instagram.com/whats_ai/ LinkedIn: www.linkedin.com/in/whats-ai Twitter: https://twitter.com/Whats_AI Facebook: https://www.facebook.com/whats.artificial.intelligence/ The best courses to start and progress in AI: https://www.omologapps.com/whats-ai 0:00 Hey! Tap the Thumbs Up button and Subscribe to help me. You'll learn a lot of cool stuff, I promise. 0:41 Paper explained 5:40 Results Song credit: https://soundcloud.com/mattis-rodrigue/sans-titre
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from What's AI by Louis-François Bouchard · What's AI by Louis-François Bouchard · 35 of 60

1 What is Artificial intelligence? | Artificial Intelligence terms explained for everyone 1
What is Artificial intelligence? | Artificial Intelligence terms explained for everyone 1
What's AI by Louis-François Bouchard
2 What is Machine Learning? | Introduction to ML for beginners in a minute 2
What is Machine Learning? | Introduction to ML for beginners in a minute 2
What's AI by Louis-François Bouchard
3 What is Deep Learning | Introduction to DL for beginners in a minute 3
What is Deep Learning | Introduction to DL for beginners in a minute 3
What's AI by Louis-François Bouchard
4 What is Supervised Learning | Machine Learning basics explained for beginners 4
What is Supervised Learning | Machine Learning basics explained for beginners 4
What's AI by Louis-François Bouchard
5 What is Unsupervised Learning | Machine Learning basics explained for beginners 5
What is Unsupervised Learning | Machine Learning basics explained for beginners 5
What's AI by Louis-François Bouchard
6 What is Semi-Supervised Learning | Machine Learning basics explained for beginners 6
What is Semi-Supervised Learning | Machine Learning basics explained for beginners 6
What's AI by Louis-François Bouchard
7 What is Reinforcement Learning | Machine Learning basics explained for beginners 7
What is Reinforcement Learning | Machine Learning basics explained for beginners 7
What's AI by Louis-François Bouchard
8 What is Classification | Introduction to Machine Learning for beginners | The Most Used Terms 8
What is Classification | Introduction to Machine Learning for beginners | The Most Used Terms 8
What's AI by Louis-François Bouchard
9 What is Regression | Introduction to Machine Learning for beginners | The Most Used Terms 9
What is Regression | Introduction to Machine Learning for beginners | The Most Used Terms 9
What's AI by Louis-François Bouchard
10 What is Clustering | Introduction to Machine Learning for beginners | The Most Used Terms 10
What is Clustering | Introduction to Machine Learning for beginners | The Most Used Terms 10
What's AI by Louis-François Bouchard
11 What is Backpropagation | Artificial Intelligence & Machine Learning Basics for Beginners 11
What is Backpropagation | Artificial Intelligence & Machine Learning Basics for Beginners 11
What's AI by Louis-François Bouchard
12 What is NLP ? | Introduction to Natural Language Processing for Beginners | Machine Learning 12
What is NLP ? | Introduction to Natural Language Processing for Beginners | Machine Learning 12
What's AI by Louis-François Bouchard
13 Comparing AGI and Traditional AI: Now and Beyond
Comparing AGI and Traditional AI: Now and Beyond
What's AI by Louis-François Bouchard
14 Demystifying Neural Network: A Beginner's Guide to Machine Learning Fundamentals
Demystifying Neural Network: A Beginner's Guide to Machine Learning Fundamentals
What's AI by Louis-François Bouchard
15 Understanding Computer Vision: An Entry-Level Introduction to ML-Driven CV
Understanding Computer Vision: An Entry-Level Introduction to ML-Driven CV
What's AI by Louis-François Bouchard
16 Chatbots for Beginners: A Comprehensive Intro to Machine Learning Applications
Chatbots for Beginners: A Comprehensive Intro to Machine Learning Applications
What's AI by Louis-François Bouchard
17 What is Image Segmentation ? | Computer Vision & ML Techniques Explained for Beginners 17
What is Image Segmentation ? | Computer Vision & ML Techniques Explained for Beginners 17
What's AI by Louis-François Bouchard
18 Object Detection Clearly Explained for Everyone
Object Detection Clearly Explained for Everyone
What's AI by Louis-François Bouchard
19 What is a RNN ? | Introduction to Recurrent Neural Network FOR EVERYONE 19
What is a RNN ? | Introduction to Recurrent Neural Network FOR EVERYONE 19
What's AI by Louis-François Bouchard
20 What is Transfer Learning ? | Deep Learning Basics Explained for Beginners 20
What is Transfer Learning ? | Deep Learning Basics Explained for Beginners 20
What's AI by Louis-François Bouchard
21 Data Science Demystified - An Essential Introduction
Data Science Demystified - An Essential Introduction
What's AI by Louis-François Bouchard
22 Demystifying Data Mining - A Clear and Concise Explanation
Demystifying Data Mining - A Clear and Concise Explanation
What's AI by Louis-François Bouchard
23 Decoding Logistic Regression - A Simple and Comprehensive Explanation
Decoding Logistic Regression - A Simple and Comprehensive Explanation
What's AI by Louis-François Bouchard
24 What is the YOLO algorithm? | Introduction to You Only Look Once, Real Time Object Detection 24
What is the YOLO algorithm? | Introduction to You Only Look Once, Real Time Object Detection 24
What's AI by Louis-François Bouchard
25 AI or Human? What is the Turing Test
AI or Human? What is the Turing Test
What's AI by Louis-François Bouchard
26 Genetic Algorithms Demystified - How Algorithms Evolve
Genetic Algorithms Demystified - How Algorithms Evolve
What's AI by Louis-François Bouchard
27 What is Data Labeling ? | Prepare Your Data for ML and AI | Attaching meaning to digital data 27
What is Data Labeling ? | Prepare Your Data for ML and AI | Attaching meaning to digital data 27
What's AI by Louis-François Bouchard
28 Human Pose Estimation in Machine Learning Explained (2D & 3D)
Human Pose Estimation in Machine Learning Explained (2D & 3D)
What's AI by Louis-François Bouchard
29 What is Self-Supervised Learning ? | Will machines be able to learn like humans ? 29
What is Self-Supervised Learning ? | Will machines be able to learn like humans ? 29
What's AI by Louis-François Bouchard
30 What are GANs ? | Introduction to Generative Adversarial Networks | Face Generation & Editing - 30
What are GANs ? | Introduction to Generative Adversarial Networks | Face Generation & Editing - 30
What's AI by Louis-François Bouchard
31 Introduction to Energy-Based Learning | Yann LeCun Paper
Introduction to Energy-Based Learning | Yann LeCun Paper
What's AI by Louis-François Bouchard
32 The Science Behind Google Translate: Understanding Transformers
The Science Behind Google Translate: Understanding Transformers
What's AI by Louis-François Bouchard
33 Mastering CNNs in 5 Minutes | ConvNets Explained
Mastering CNNs in 5 Minutes | ConvNets Explained
What's AI by Louis-François Bouchard
34 Discover the Power of YOLOv4 - Real-Time Object Detection Simplified
Discover the Power of YOLOv4 - Real-Time Object Detection Simplified
What's AI by Louis-François Bouchard
Learn to Draw Real People using AI: Unveiling Future of Image-to-Image Translation
Learn to Draw Real People using AI: Unveiling Future of Image-to-Image Translation
What's AI by Louis-François Bouchard
36 AI Powers PAC-MAN - The Game Engine-Free Revolution
AI Powers PAC-MAN - The Game Engine-Free Revolution
What's AI by Louis-François Bouchard
37 This AI makes blurry faces look 60 times sharper! Introduction to PULSE: photo upsampling
This AI makes blurry faces look 60 times sharper! Introduction to PULSE: photo upsampling
What's AI by Louis-François Bouchard
38 Facebook's TransCoder: Converting Programming Languages with AI
Facebook's TransCoder: Converting Programming Languages with AI
What's AI by Louis-François Bouchard
39 Transforming Images to 3D Models with AI - Discover PIFuHD
Transforming Images to 3D Models with AI - Discover PIFuHD
What's AI by Louis-François Bouchard
40 Optimize Your ML Models - Avoid Underfitting and Overfitting
Optimize Your ML Models - Avoid Underfitting and Overfitting
What's AI by Louis-François Bouchard
41 Behind the Scenes - Disney's Secrets to High-Res Face Swaps
Behind the Scenes - Disney's Secrets to High-Res Face Swaps
What's AI by Louis-François Bouchard
42 Linear Regression in Machine Learning Explained in 5 Minutes
Linear Regression in Machine Learning Explained in 5 Minutes
What's AI by Louis-François Bouchard
43 Style Transfer Better Than GANs! Swapping Autoencoder Explained
Style Transfer Better Than GANs! Swapping Autoencoder Explained
What's AI by Louis-François Bouchard
44 Use AI to Remove Objects from Videos
Use AI to Remove Objects from Videos
What's AI by Louis-François Bouchard
45 OpenAI's Language Generator: GPT | The first AI Generating Text, Code, Websites...
OpenAI's Language Generator: GPT | The first AI Generating Text, Code, Websites...
What's AI by Louis-François Bouchard
46 Autocomplete Images With AI: image-GPT explained
Autocomplete Images With AI: image-GPT explained
What's AI by Louis-François Bouchard
47 Turning Reality into Art - AI That Cartoonizes Your Pictures and Videos
Turning Reality into Art - AI That Cartoonizes Your Pictures and Videos
What's AI by Louis-François Bouchard
48 From Portrait to Cartoon - Discover the Power of FreezeG
From Portrait to Cartoon - Discover the Power of FreezeG
What's AI by Louis-François Bouchard
49 Transfer clothes between photos using AI. From a single image!
Transfer clothes between photos using AI. From a single image!
What's AI by Louis-François Bouchard
50 Precise 3D Human Pose and Mesh Estimation from a Single RGB Image
Precise 3D Human Pose and Mesh Estimation from a Single RGB Image
What's AI by Louis-François Bouchard
51 Smart Navigation - How AI Robots Understand and Explore Environments
Smart Navigation - How AI Robots Understand and Explore Environments
What's AI by Louis-François Bouchard
52 Techfitlab Breaks Down Tesla Autopilot, AI, ML, and DL Complexities
Techfitlab Breaks Down Tesla Autopilot, AI, ML, and DL Complexities
What's AI by Louis-François Bouchard
53 ECCV 2020 Best Paper Award | RAFT: A New Deep Network Architecture For Optical Flow | WITH CODE
ECCV 2020 Best Paper Award | RAFT: A New Deep Network Architecture For Optical Flow | WITH CODE
What's AI by Louis-François Bouchard
54 Maximize Business Efficiency with AI / GPT Technology!
Maximize Business Efficiency with AI / GPT Technology!
What's AI by Louis-François Bouchard
55 AI Transforms Google Photos into Real-Life Scenes
AI Transforms Google Photos into Real-Life Scenes
What's AI by Louis-François Bouchard
56 Old Photo Restoration Using Deep Learning | 2020 Novel Approach Explained & Results
Old Photo Restoration Using Deep Learning | 2020 Novel Approach Explained & Results
What's AI by Louis-François Bouchard
57 This computer vision algorithm removes the water from underwater images !
This computer vision algorithm removes the water from underwater images !
What's AI by Louis-François Bouchard
58 DeepFakes in 5 minutes | Understand how deepfakes work and create your own!
DeepFakes in 5 minutes | Understand how deepfakes work and create your own!
What's AI by Louis-François Bouchard
59 A new brain-inspired intelligent system can drive a car using only 19 control neurons!
A new brain-inspired intelligent system can drive a car using only 19 control neurons!
What's AI by Louis-François Bouchard
60 Toonify: Turn Real Faces into Animated Disney Characters
Toonify: Turn Real Faces into Animated Disney Characters
What's AI by Louis-François Bouchard

The DeepFaceDrawing paper presents a technique for generating high-quality face images from rough or incomplete sketches using a deep learning framework. The technique uses a two-subnetwork architecture to learn a space of plausible face images and synthesize a face image from an input sketch.

Key Takeaways
  1. Read the DeepFaceDrawing paper
  2. Understand the component embedding module and feature mapping and synthesis module
  3. Apply the technique to generate face images from sketches
  4. Evaluate the results using qualitative and quantitative metrics
💡 The technique uses a two-subnetwork architecture to learn a space of plausible face images and synthesize a face image from an input sketch, allowing for high-quality face image generation from rough or incomplete sketches.

Related AI Lessons

I Spent Weeks Looking for a Research Gap Before I Realized I Was Searching the Wrong Way
Learn how to effectively find research gaps by changing your approach, a crucial skill for AI researchers and academics
Medium · AI
ICMI 2026 Reviews [D]
Learn how to interpret ICMI 2026 reviews and improve your paper's acceptance chances
Reddit r/MachineLearning
Workshop submission for main conference paper under review [D]
Learn how to navigate submitting a paper to a non-archival workshop before the final decision of a main conference like ECCV
Reddit r/MachineLearning
Kept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P]
Streamline your research with a new Chrome extension and website that integrates 3M papers from arxiv, OpenReview, GitHub, and HuggingFace, including citation graphs and SPECTER2 neighbors, and provide feedback to improve it
Reddit r/MachineLearning

Chapters (3)

Hey! Tap the Thumbs Up button and Subscribe to help me. You'll learn a lot of co
0:41 Paper explained
5:40 Results
Up next
Beyond Big Vendors: ERP Systems Explained #shorts
Digital Transformation with Eric Kimberling
Watch →