GANs with Fewer Labels

Connor Shorten · Advanced ·🧬 Deep Learning ·6y ago

Key Takeaways

The video discusses a research paper from Google AI on making Generative Adversarial Networks (GANs) work with fewer labels, utilizing semi-supervised and self-supervised learning techniques to reduce the need for manual labeling.

Full Transcript

[Music] this video will present a paper from Google AI research on making generative adversarial networks work with fewer labels so for a quick recap Darren of adversarial networks taken random noise and then produce images such that they can generate novel data points this works by being trained against a another deep neural network a discriminator which classifies samples is real or fake and then this process iterates until eventually the generator can construct brand new images from the data set so the problem is that the state-of-the-art ganth require labels it's a supervised learning technique and this started with the conditional generic store networks where you took a one hot encoded class vector and embedded it with the generated image of the real image of the discriminator and then with the latent code on the generator so the key idea in the presentation in this video is that this technique is able to match the big Gann which is the state of the art on generating images on image net with just 10% of the labels and then it can outperform this technique using 20% of labels and this is super important because in the wild there is tons of data on things like Instagram Google Images and YouTube and it's really impossible to scale up labeling these data sets for the future of representation learning generation and computer vision so these this presents the overall results from the different techniques they test and this vertical bar indicates the state of the art using labels so we're gonna focus primarily on these s to gain and non techniques and discuss NS 3 in and discuss how this works so again these are some of the generated this is from the technique the S 3 again and this is from the began so again these are both 128 128 image net generate examples so the inspiration is self supervised gain and this is a paper that stabilized the training of generative adversarial networks by forcing the discriminating them 90 degrees 180 270 or not at all and then having a discriminative predict the rotation like how much was the image rotated and this technique is gaining enormous popularity it's known as self supervised learning and it's really like an unsupervised learning thing but it's sort of denoted this way because the task sort of lends itself to being modified in this way for a representation learning so in images it's done typically through rotations predicting the permutations of like a jigsaw puzzle by cropping out patches and then in language it's way more popular in language in language it's when you take like the some word in the sentence and then you predict the context so self supervised learning is really really popular in language models and it's becoming much more popular in image models as well so here's the idea you can get a combined self supervised learning with Ganz so one way of doing this might be to have a so you want to have your class label Y that you can embed in the discriminator and the generator and this will result in a much better again this has been shown that adding labels to Ganz really really improves them so one way of doing this might be to take the image and then train the self supervised model particular tations and then use these features to cluster the images into classes so use like k-means clustering to form like 50 auxilary classes based on the features learn from this self supervised learning task so another idea would be to even if you already have the Y labels is just to regularize the discriminators features with requiring the discriminator to also predict the rotation loss this is what is done in self supervised gans so here's another idea is semi-supervised learning so I'm sorry the terminology is so similar but semi-supervised learning isn't the same self supervised so much supervises this idea of you have some label data in your data set so let's say you have your 100% data samples you have ten percent are labeled so one way of using semi semi supervised learning with ganz is you have the discriminators features are used to predict the class label of the either real image or the image that comes from the generator so you would you you would train this classifier that is also using sharing these same features that classify the image is real or fake in the first place and use this to derive the late and then you would have a loss for the samples that are labeled so for the 10% of images that are labeled you would have some feedback for this part of the network so here's the idea of the of this paper you're going to combine the idea of predicting the class what the discriminator features and then you're also going to regularize the features by having it predict this rotation loss so the discriminator is regularized with the self supervision term it's still predicting real afaik and then it's going to predict the class label and then embed that class label into the conditioning layer so over all combining these three different loss function makes up this kind of thing this section here is the normal Gann part I mean yeah generally like you can see how it's a pretty complex thing but overall it it works out in this way so these are the results using different percentages of label data so it goes five percent temperature and the F ID met these are like two metrics of evaluating generated images on frequently used on imagenet and to compare Ganz in the literature so you see that it doesn't really look like it saturates too much and the F ID score it definitely saturates but the inception score seems to still have a bit of a big improvement it's almost five full points higher so again this is the bar chart that shows how the this technique improves as you add more labeled data to the semi-supervised learning component of it so remember they have the so supervised rotation prediction that is like normalizing the discriminator features and then you have this semi-supervised learning part which is used to predict the class conditional embedding in the discriminator model so these are some of the results of the model to showing the latent space interpolation works well and then this is when they scale it up for a 256 by 256 target output so some concluding thoughts is really interesting to see semi-supervised learning and ganz being merged together and it also would be really cool to see this technique that was presented her pretty recently as well where they normalize they have the data that's unlabeled and then they augment it using like rotations and whatnot then they try to make sure that these class predictions are similar to each other so it would be interesting to see you know add a fourth loss to this network as well so thanks for watching this video on making generative adversarial networks work with fewer late labels please describe to Henri AI allows for more deep learning videos

Original Description

State of the art GAN results using only 20% of the labels! This is a great technique utilizing semi-supervised learning and self-supervised learning to dramatically reduce the need for manual labeling in training Generative Adversarial Networks! Please Subscribe! Thanks for watching! Paper Link: https://arxiv.org/pdf/1903.02271.pdf "High-Fidelity Image Generation with Fewer Labels"
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Connor Shorten · Connor Shorten · 42 of 60

1 DenseNets
DenseNets
Connor Shorten
2 DeepWalk Explained
DeepWalk Explained
Connor Shorten
3 Inception Network Explained
Inception Network Explained
Connor Shorten
4 StackGAN
StackGAN
Connor Shorten
5 StyleGAN
StyleGAN
Connor Shorten
6 Progressive Growing of GANs Explained
Progressive Growing of GANs Explained
Connor Shorten
7 Improved Techniques for Training GANs
Improved Techniques for Training GANs
Connor Shorten
8 Word2Vec Explained
Word2Vec Explained
Connor Shorten
9 Must Read Papers on GANs
Must Read Papers on GANs
Connor Shorten
10 Unsupervised Feature Learning
Unsupervised Feature Learning
Connor Shorten
11 Self-Supervised GANs
Self-Supervised GANs
Connor Shorten
12 Embedding Graphs with Deep Learning
Embedding Graphs with Deep Learning
Connor Shorten
13 Transfer Learning in GANs
Transfer Learning in GANs
Connor Shorten
14 ReLU Activation Function
ReLU Activation Function
Connor Shorten
15 AC-GAN Explained
AC-GAN Explained
Connor Shorten
16 SimGAN Explained
SimGAN Explained
Connor Shorten
17 DC-GAN Explained!
DC-GAN Explained!
Connor Shorten
18 ResNet Explained!
ResNet Explained!
Connor Shorten
19 Graph Convolutional Networks
Graph Convolutional Networks
Connor Shorten
20 Neural Architecture Search
Neural Architecture Search
Connor Shorten
21 Henry AI Labs
Henry AI Labs
Connor Shorten
22 Video Classification with Deep Learning
Video Classification with Deep Learning
Connor Shorten
23 BigGANs in Data Augmentation
BigGANs in Data Augmentation
Connor Shorten
24 Introduction to Deep Learning
Introduction to Deep Learning
Connor Shorten
25 EfficientNet Explained!
EfficientNet Explained!
Connor Shorten
26 Self-Attention GAN
Self-Attention GAN
Connor Shorten
27 Curriculum Learning in Deep Neural Networks
Curriculum Learning in Deep Neural Networks
Connor Shorten
28 Deep Learning Podcast #1 | Edward Dixon | Stochastic Weight Averaging
Deep Learning Podcast #1 | Edward Dixon | Stochastic Weight Averaging
Connor Shorten
29 Deep Compression
Deep Compression
Connor Shorten
30 Skin Cancer Classification with Deep Learning
Skin Cancer Classification with Deep Learning
Connor Shorten
31 Deep Learning Podcast #2 | Edward Peake | Deep Learning in Medical Imaging
Deep Learning Podcast #2 | Edward Peake | Deep Learning in Medical Imaging
Connor Shorten
32 The Lottery Ticket Hypothesis Explained!
The Lottery Ticket Hypothesis Explained!
Connor Shorten
33 SqueezeNet
SqueezeNet
Connor Shorten
34 GauGAN Explained!
GauGAN Explained!
Connor Shorten
35 AutoML with Hyperband
AutoML with Hyperband
Connor Shorten
36 DL Podcast #3 | Yannic Kilcher | Population-Based Search
DL Podcast #3 | Yannic Kilcher | Population-Based Search
Connor Shorten
37 Weakly Supervised Pretraining
Weakly Supervised Pretraining
Connor Shorten
38 Image Data Augmentation for Deep Learning
Image Data Augmentation for Deep Learning
Connor Shorten
39 Unsupervised Data Augmentation
Unsupervised Data Augmentation
Connor Shorten
40 Wide ResNet Explained!
Wide ResNet Explained!
Connor Shorten
41 RevNet: Backpropagation without Storing Activations
RevNet: Backpropagation without Storing Activations
Connor Shorten
GANs with Fewer Labels
GANs with Fewer Labels
Connor Shorten
43 BigBiGAN Unsupervised Learning!
BigBiGAN Unsupervised Learning!
Connor Shorten
44 Self-Supervised Learning
Self-Supervised Learning
Connor Shorten
45 Multi-Task Self-Supervised Learning
Multi-Task Self-Supervised Learning
Connor Shorten
46 Self-Supervised GANs
Self-Supervised GANs
Connor Shorten
47 Population Based Training
Population Based Training
Connor Shorten
48 Show, Attend and Tell
Show, Attend and Tell
Connor Shorten
49 Siamese Neural Networks
Siamese Neural Networks
Connor Shorten
50 WaveGAN Explained!
WaveGAN Explained!
Connor Shorten
51 VAE-GAN Explained!
VAE-GAN Explained!
Connor Shorten
52 Evolution in Neural Architecture Search!
Evolution in Neural Architecture Search!
Connor Shorten
53 AI Research Weekly Update August 18th, 2019
AI Research Weekly Update August 18th, 2019
Connor Shorten
54 Weight Agnostic Neural Networks Explained!
Weight Agnostic Neural Networks Explained!
Connor Shorten
55 AI Research Weekly Update August 25th, 2019
AI Research Weekly Update August 25th, 2019
Connor Shorten
56 Neuroevolution of Augmenting Topologies (NEAT)
Neuroevolution of Augmenting Topologies (NEAT)
Connor Shorten
57 CoDeepNEAT
CoDeepNEAT
Connor Shorten
58 AI Research Weekly Update September 1st, 2019
AI Research Weekly Update September 1st, 2019
Connor Shorten
59 Randomly Wired Neural Networks
Randomly Wired Neural Networks
Connor Shorten
60 Genetic CNN
Genetic CNN
Connor Shorten

This video discusses a research paper on making GANs work with fewer labels using semi-supervised and self-supervised learning techniques, and demonstrates how to implement these techniques to achieve state-of-the-art results with limited labeled data.

Key Takeaways
  1. Implement semi-supervised learning with GANs
  2. Use self-supervised learning to regularize discriminator features
  3. Combine semi-supervised and self-supervised learning with GANs
  4. Evaluate GAN performance using metrics such as Inception score and FID score
💡 Combining semi-supervised and self-supervised learning techniques can significantly improve GAN performance with limited labeled data.

Related Reads

📰
Want to get started with deep learning
Get started with deep learning by leveraging resources like Andrew Karpathy's playlist and frameworks such as TensorFlow or PyTorch
Reddit r/deeplearning
📰
Building a Deepfake Detector From Scratch — What Nobody Tells You
Learn to build a deepfake detector from scratch and understand the challenges involved in detecting AI-generated fake media
Medium · Deep Learning
📰
Unfolding the Meandering Path: High-Dimensional Invariance and the Flat 2D Plane of Neural…
Learn about high-dimensional invariance and its relation to the flat 2D plane of neural networks, and how to apply these concepts to improve model performance
Medium · Deep Learning
📰
Implementing Neural Style Transfer from Scratch: The Project That Started It All
Learn to implement Neural Style Transfer from scratch and understand its significance in deep learning
Medium · Deep Learning
Up next
Image Classification with ml5.js
The Coding Train
Watch →