Self-Supervised GANs

Connor Shorten · Advanced ·📄 Research Papers Explained ·6y ago

Skills: Reading ML Papers90%Research Methods80%RAG Basics70%Vector Stores60%RAG Evaluation60%

Key Takeaways

The video discusses self-supervised GANs, which use rotation angle classification to stabilize GAN training and avoid cyclic collapse, with tools such as GANs, batch normalization, gradient penalty, spectral normalization, and FID metric.

Full Transcript

[Music] this video will explain a study on using self supervised learning with Ganz to stabilize training and prevent catastrophic forgetting and cyclic problems with training Ganz so the headline idea is that this paper utilizes the rotation self supervision task to rival supervised Ganz where you use the class labels on Ganz and obviously we want to get away from doing supervised learning with Ganz because scans are useful unsupervised learning technique for feature extraction and also envision in general we have access to enormous image datasets and we can't possibly label all the data so this technique is going to improve image generation and the representation quality of when you use this as a technique to Train classification models so rotation classification is is one of the most popular self supervised learning tasks in addition to things like relative location colonization and I'm like super resolution and these jigsaw puzzle kind of things but so the idea is basically you take an image and then you either don't rotate it at all or you rotate it 90 180 or 270 degrees and then you'd have a network predict how much the image has been rotated so they're going to integrate this part of the rotation task within the Gantt framework so Ganz already work where the generator produces a fake image and then a real image is sampled from the data set and then the discriminator is gonna tell real fake based on these two but now what it's going to do is the images they're going to go into this pipeline and they're going to be rotated so the discriminator is classifying them as real or fake but it's also predicting the rotation of the image so the original gann objective is this loss where you have the expectation that the sample is from the data set and that you predicted that correctly or that it's from the generator and you predicted that correctly so what we're gonna do to modify it with the self supervision task is in the generator loss you add the successive discriminator in predicting the rotation on the generated image the X from PG means X sampled from the generator and then you know the prediction of the rotation angle you know the rotation equals are given this image and then on the discriminator side you the losses the sample from the data set and then the loss on this so the discriminator isn't actually updated with how successfully it predicts the rotation on the generator so for this reason it's called collaborative adversarial learning because the generator has the discriminators rotation success in its moss turn so that it's in the best interest of the generator to produce images that the discriminator can successfully classify the rotation angle oh so this is an interesting idea that they generated and discriminator our adversarial with respect to the real fake tasks and then they also have this other thing that they're doing together in a way so the motivation for this is catastrophic forgetting and deep learning and this image here isn't from Ganz it's from a classifier so what happens is if they train the model on the task of classifying with one class from the CFR ten data set and then they switch the class and every time they do this and switch the classes you see this downward spike as the discriminate urban as the classifier no longer has the kinds of representations to classify these images so see here it is probably classifying like cats images of cats and then it goes to dogs or something so in Ganz we have cyclic training this is one of the most common failure modes in addition to like a mode collapse and the discriminator I like dominating the generator so imagine that the generator the discriminator sends a signal and the generator jumps from this kind of distribution to this one and then the discriminator fails at this one because it's completely forgotten about this so it jumps back to this and then it continues this cycle forever and that's because the discriminator as the generator starts producing like this B set the discriminator is completely forgotten about a and then it learns all about B so the generator goes to C and then some and then it goes back to a to fool the discriminator because the discriminator can't continually learn the changing distribution of the generated samples compared to the real data set so additional motivation is let's say that you've got the optimal discriminator at convergence where the generated data distribution is identical to the original data distribution the discriminator at this point just outputs hi-five regardless to the input because you know if you predict a generated sample it overly confident predictions will kill it so it'll just do 0.5 like I don't know I don't know every time so when this happens if you take these representations from the discriminator and use it for as features for a classification model they get really bad because the discriminator no longer has a sense of visual representation because it's just outputting 0.5 regardless of the input so these are the initial results of the self supervised gain and this modulation and the modification to batch normalization compared to conditional gans and then unconditional gains so we see already that the self supervised gain is outperforming the unconditional again which is just again without the rotation task and the conditional gannets the gain but with the class labels so now they're gonna scale it up they're going to use the big gain model which is an enormous general diversity of network model they're going to use an enormous amount of computational resources with 128 of the Google TP uv3 pods and then they're going to increase the batch size up to 2048 so using this they're also going to search for some hyper parameters on the gradient penalty and spectral normalization which control D like Lipschitz constant of the discriminator basically a function to make sure I'm in a limit like a normalization method to make sure the discriminator doesn't become too strong for the generator and then they also iterate on the hyper parameter of how many training steps should they discriminate or have per generator set because the discriminator is so heavily normalized with these gradient penalties and spectral mobilizations that sometimes you might want to run it for twice to catch up so these are the some of the samples found from their best model in the image net these are pretty convincing results given that there was no labels provided to the network and then this is the they use the F ID metric for you know presenting the quality of their generated samples and on this metric the lower score is a better score so this just shows like as it's training you know you sample some of it and then you calculate the score so you see on image net how the unconditional gain collapses and this is like a failure mode this would be a really high score and then this is the rotation quality showing that you know when you when you train the again and then you take the discriminator is visual representations and use that to train a classifier you see that when you use rotation only the self supervised task of this one you get about twenty twenty percent unconditional does horribly and then the conditional Gann doesn't perform as well as the self supervised gain for the task of representation quality which may really be the most in most important application of Ganz is whether you can use it too for unsupervised learning for classifiers object detectors and miscellaneous computer vision tests so this is the C for ten classification accuracy about 80% pretty similar to the others though on C part ten but on imagenet it's much different it's much higher than unconditional or rotation only and a little bit higher than conditional so this is the comparison of the self supervised game with other unsupervised learning techniques and this began model was recently improved with a big can model as well and does outperform this and it's just interesting to see all these different techniques that people are coming up with to make use of unlabeled data for representation learning so thanks for watching this video on self supervised gann begins please subscribe to Henry AI labs for more deep learning videos

Original Description

Self-Supervised learning describes a set of tasks that can learn useful representations without manual tuning. This paper explores using self-supervision through rotation angle classification to stabilize GAN training and avoid cyclic collapse through catastrophic forgetting in the discriminator. Thanks for watching! Please Subscribe! Paper Link: https://arxiv.org/pdf/1811.11212.pdf

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Connor Shorten · Connor Shorten · 46 of 60

← Previous Next →

DeepWalk Explained

DeepWalk Explained

Inception Network Explained

Inception Network Explained

Progressive Growing of GANs Explained

Progressive Growing of GANs Explained

Improved Techniques for Training GANs

Improved Techniques for Training GANs

Word2Vec Explained

Word2Vec Explained

Must Read Papers on GANs

Must Read Papers on GANs

Unsupervised Feature Learning

Unsupervised Feature Learning

Self-Supervised GANs

Self-Supervised GANs

Embedding Graphs with Deep Learning

Embedding Graphs with Deep Learning

Transfer Learning in GANs

Transfer Learning in GANs

ReLU Activation Function

ReLU Activation Function

AC-GAN Explained

AC-GAN Explained

SimGAN Explained

SimGAN Explained

DC-GAN Explained!

DC-GAN Explained!

ResNet Explained!

ResNet Explained!

Graph Convolutional Networks

Graph Convolutional Networks

Neural Architecture Search

Neural Architecture Search

Video Classification with Deep Learning

Video Classification with Deep Learning

BigGANs in Data Augmentation

BigGANs in Data Augmentation

Introduction to Deep Learning

Introduction to Deep Learning

EfficientNet Explained!

EfficientNet Explained!

Self-Attention GAN

Self-Attention GAN

Curriculum Learning in Deep Neural Networks

Curriculum Learning in Deep Neural Networks

Deep Learning Podcast #1 | Edward Dixon | Stochastic Weight Averaging

Deep Learning Podcast #1 | Edward Dixon | Stochastic Weight Averaging

Deep Compression

Deep Compression

Skin Cancer Classification with Deep Learning

Skin Cancer Classification with Deep Learning

Deep Learning Podcast #2 | Edward Peake | Deep Learning in Medical Imaging

Deep Learning Podcast #2 | Edward Peake | Deep Learning in Medical Imaging

The Lottery Ticket Hypothesis Explained!

The Lottery Ticket Hypothesis Explained!

GauGAN Explained!

GauGAN Explained!

AutoML with Hyperband

AutoML with Hyperband

DL Podcast #3 | Yannic Kilcher | Population-Based Search

DL Podcast #3 | Yannic Kilcher | Population-Based Search

Weakly Supervised Pretraining

Weakly Supervised Pretraining

Image Data Augmentation for Deep Learning

Image Data Augmentation for Deep Learning

Unsupervised Data Augmentation

Unsupervised Data Augmentation

Wide ResNet Explained!

Wide ResNet Explained!

RevNet: Backpropagation without Storing Activations

RevNet: Backpropagation without Storing Activations

GANs with Fewer Labels

GANs with Fewer Labels

BigBiGAN Unsupervised Learning!

BigBiGAN Unsupervised Learning!

Self-Supervised Learning

Self-Supervised Learning

Multi-Task Self-Supervised Learning

Multi-Task Self-Supervised Learning

Self-Supervised GANs

Self-Supervised GANs

Population Based Training

Population Based Training

Show, Attend and Tell

Show, Attend and Tell

Siamese Neural Networks

Siamese Neural Networks

WaveGAN Explained!

WaveGAN Explained!

VAE-GAN Explained!

VAE-GAN Explained!

Evolution in Neural Architecture Search!

Evolution in Neural Architecture Search!

AI Research Weekly Update August 18th, 2019

AI Research Weekly Update August 18th, 2019

Weight Agnostic Neural Networks Explained!

Weight Agnostic Neural Networks Explained!

AI Research Weekly Update August 25th, 2019

AI Research Weekly Update August 25th, 2019

Neuroevolution of Augmenting Topologies (NEAT)

Neuroevolution of Augmenting Topologies (NEAT)

AI Research Weekly Update September 1st, 2019

AI Research Weekly Update September 1st, 2019

Randomly Wired Neural Networks

Randomly Wired Neural Networks

This video teaches how self-supervised GANs can be used to stabilize GAN training and improve representation quality, with a focus on rotation angle classification and modulation mechanisms. The video also discusses the evaluation of self-supervised GANs using metrics such as FID and classification accuracy.

Key Takeaways

Implement self-supervised GANs using rotation angle classification
Use modulation mechanisms to prevent the discriminator from becoming too strong
Evaluate the performance of self-supervised GANs using metrics such as FID and classification accuracy
Compare the performance of self-supervised GANs with unconditional and conditional GANs
Use batch normalization, gradient penalty, and spectral normalization to improve the stability of GAN training

💡 Self-supervised GANs can outperform unconditional and conditional GANs on representation quality and achieve high classification accuracy on datasets such as ImageNet.

🔒 Pro feature: Ask AI to explain this lesson →

More on: Reading ML Papers

View skill →

Automatic Literature Review with GPT-3 - I embedded and indexed all of arXiv into a search engine!

Automatic Literature Review with GPT-3 - I embedded and indexed all of arXiv into a search engine!

Marcos Lopez Caniego - ESASky's JupyterLab widget| JupyterCon 2020

Marcos Lopez Caniego - ESASky's JupyterLab widget| JupyterCon 2020

Obsidian Zotero Integration Plugin | Streamline Your Research Paper Workflow 📝️

Obsidian Zotero Integration Plugin | Streamline Your Research Paper Workflow 📝️

This FULLY FREE Research Agent can BUILD Reports in Minutes!!!

This FULLY FREE Research Agent can BUILD Reports in Minutes!!!

Claude 3.7 Sonnet API | Build a Research Assistant

Claude 3.7 Sonnet API | Build a Research Assistant

I Built An Obsidian AI Research Assistant with Oz...

I Built An Obsidian AI Research Assistant with Oz...

Related AI Lessons

I Spent Weeks Looking for a Research Gap Before I Realized I Was Searching the Wrong Way

Learn how to effectively find research gaps by changing your approach, a crucial skill for AI researchers and academics

ICMI 2026 Reviews [D]

Learn how to interpret ICMI 2026 reviews and improve your paper's acceptance chances

Reddit r/MachineLearning

Workshop submission for main conference paper under review [D]

Learn how to navigate submitting a paper to a non-archival workshop before the final decision of a main conference like ECCV

Reddit r/MachineLearning

Kept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P]

Streamline your research with a new Chrome extension and website that integrates 3M papers from arxiv, OpenReview, GitHub, and HuggingFace, including citation graphs and SPECTER2 neighbors, and provide feedback to improve it

Reddit r/MachineLearning

Beyond Big Vendors: ERP Systems Explained #shorts

Digital Transformation with Eric Kimberling