Transfer Learning in GANs

Connor Shorten · Beginner ·🧬 Deep Learning ·7y ago

Key Takeaways

The video discusses the application of transfer learning to Generative Adversarial Networks (GANs), specifically exploring its potential to save computation and enable training on small datasets. It highlights the concept of transferring a pre-trained model from a source domain to a target domain, allowing for the generation of new data that resembles the target domain.

Full Transcript

this video is going to introduce the idea of applying transfer learning to generative adversarial networks the transfer learning is one of the most popular ideas in classification tasks such as classifying like a dog or a cat and it hasn't really been applied to generative adversarial networks if this is developed and successful it'll be able to save people tons of computation because you could take a pre train model from something like open AI or deepmind and just plug and play this pre train model into your data set in addition to saving tons of computation this also helped you to train Ganz on small data sets which is really useful in application domains such as medical image analysis so from a high-level overview this is the idea you have the source domain which is the original data set such as image net the Train on and then you transfer this model into some new target so in the context of ganz this means it starts out generating image net images and then it'll transfer in start generating it'll start generating the image net again but then the discriminator will have the new custom data set and it'll start to mold it into the to generating this new data set so this is just a picture of showing how transfer learning is used in classification tasks this is a really popular paper where they repurposed classification layers for the task of object localization so here are some of the interesting things that the researchers and transfer and Gantt study they want to know the impact of the target domain size you know like if you only have a thousand images in your target domain will this still work then they want to know the distance between the source and the target so let's say you go from image net which is pictures of like cats dogs and airplanes stuff like that and let's say you want to translate transition into like liver lesion images this is like totally different so you'd want to see if the distance between these two data sets is going to impact the quality of transfer learning with generative networks and then again you want to look at conditional Ganz so conditional Ganz of this idea that you can inject a class label to control degan output but it's similar to transfer learning with regular classification tasks the label distribution is going to be totally different so you want to see if there's some way that you can labels when you transfer so this quickly just shows how the DC can that does Elsa bedrooms which is 64 by 64 requires 36 million parameters 36 million parameters in a deep model means you need a lot of data to avoid overfitting so evaluating ganz is one of the most difficult things with gain research so the researchers in this paper shoes use the fresh a inception distance which is given here and then independent Wasserstein critic so their experiments they use the improved Wasserstein Gann model which has ResNet layers in both the generator and the discriminator they target 64 by 64 images and they train with a batch of 128 images for 50k iterations so this table right here shows you know the test if training the generator only from scratch or just pre training the generator and then they do the inverse stage test you know pre training only on the discriminator versus training from scratch and then surprisingly they find that pre training works if you're going to only do one of the other it works better than this committer but in all cases it actually does work best to transfer with both the Jenner & discriminator so this picture here kind of gives you an idea of how transferring and Gans looks you see from scratch compared to the pre trained how it's kind of like a more coherent transition now this image shows the transfer process from different source data sets so they're all transferring into kitchens but the one on the top is from scratch I think the one below is from image net and you know this is the different data sets that are describing the paper and this one is a more complete thing that shows the different data sets they test is the source data sets and then the results when they transfer this shows more of that so in conclusion transfer learning and ganz is a really interesting idea it can save computation and data they find that transferring does work well and additionally they find that datasets with dense classes so like maybe a data set that has a bunch of dog breeds would perform better than a dataset in the source dataset that has like a ton of different things so thanks for watching this video please subscribe for more deep learning videos [Music] you

Original Description

This video explains how Transfer Learning can be applied to Generative Modeling! Thanks for watching, please subscribe!
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Connor Shorten · Connor Shorten · 13 of 60

1 DenseNets
DenseNets
Connor Shorten
2 DeepWalk Explained
DeepWalk Explained
Connor Shorten
3 Inception Network Explained
Inception Network Explained
Connor Shorten
4 StackGAN
StackGAN
Connor Shorten
5 StyleGAN
StyleGAN
Connor Shorten
6 Progressive Growing of GANs Explained
Progressive Growing of GANs Explained
Connor Shorten
7 Improved Techniques for Training GANs
Improved Techniques for Training GANs
Connor Shorten
8 Word2Vec Explained
Word2Vec Explained
Connor Shorten
9 Must Read Papers on GANs
Must Read Papers on GANs
Connor Shorten
10 Unsupervised Feature Learning
Unsupervised Feature Learning
Connor Shorten
11 Self-Supervised GANs
Self-Supervised GANs
Connor Shorten
12 Embedding Graphs with Deep Learning
Embedding Graphs with Deep Learning
Connor Shorten
Transfer Learning in GANs
Transfer Learning in GANs
Connor Shorten
14 ReLU Activation Function
ReLU Activation Function
Connor Shorten
15 AC-GAN Explained
AC-GAN Explained
Connor Shorten
16 SimGAN Explained
SimGAN Explained
Connor Shorten
17 DC-GAN Explained!
DC-GAN Explained!
Connor Shorten
18 ResNet Explained!
ResNet Explained!
Connor Shorten
19 Graph Convolutional Networks
Graph Convolutional Networks
Connor Shorten
20 Neural Architecture Search
Neural Architecture Search
Connor Shorten
21 Henry AI Labs
Henry AI Labs
Connor Shorten
22 Video Classification with Deep Learning
Video Classification with Deep Learning
Connor Shorten
23 BigGANs in Data Augmentation
BigGANs in Data Augmentation
Connor Shorten
24 Introduction to Deep Learning
Introduction to Deep Learning
Connor Shorten
25 EfficientNet Explained!
EfficientNet Explained!
Connor Shorten
26 Self-Attention GAN
Self-Attention GAN
Connor Shorten
27 Curriculum Learning in Deep Neural Networks
Curriculum Learning in Deep Neural Networks
Connor Shorten
28 Deep Learning Podcast #1 | Edward Dixon | Stochastic Weight Averaging
Deep Learning Podcast #1 | Edward Dixon | Stochastic Weight Averaging
Connor Shorten
29 Deep Compression
Deep Compression
Connor Shorten
30 Skin Cancer Classification with Deep Learning
Skin Cancer Classification with Deep Learning
Connor Shorten
31 Deep Learning Podcast #2 | Edward Peake | Deep Learning in Medical Imaging
Deep Learning Podcast #2 | Edward Peake | Deep Learning in Medical Imaging
Connor Shorten
32 The Lottery Ticket Hypothesis Explained!
The Lottery Ticket Hypothesis Explained!
Connor Shorten
33 SqueezeNet
SqueezeNet
Connor Shorten
34 GauGAN Explained!
GauGAN Explained!
Connor Shorten
35 AutoML with Hyperband
AutoML with Hyperband
Connor Shorten
36 DL Podcast #3 | Yannic Kilcher | Population-Based Search
DL Podcast #3 | Yannic Kilcher | Population-Based Search
Connor Shorten
37 Weakly Supervised Pretraining
Weakly Supervised Pretraining
Connor Shorten
38 Image Data Augmentation for Deep Learning
Image Data Augmentation for Deep Learning
Connor Shorten
39 Unsupervised Data Augmentation
Unsupervised Data Augmentation
Connor Shorten
40 Wide ResNet Explained!
Wide ResNet Explained!
Connor Shorten
41 RevNet: Backpropagation without Storing Activations
RevNet: Backpropagation without Storing Activations
Connor Shorten
42 GANs with Fewer Labels
GANs with Fewer Labels
Connor Shorten
43 BigBiGAN Unsupervised Learning!
BigBiGAN Unsupervised Learning!
Connor Shorten
44 Self-Supervised Learning
Self-Supervised Learning
Connor Shorten
45 Multi-Task Self-Supervised Learning
Multi-Task Self-Supervised Learning
Connor Shorten
46 Self-Supervised GANs
Self-Supervised GANs
Connor Shorten
47 Population Based Training
Population Based Training
Connor Shorten
48 Show, Attend and Tell
Show, Attend and Tell
Connor Shorten
49 Siamese Neural Networks
Siamese Neural Networks
Connor Shorten
50 WaveGAN Explained!
WaveGAN Explained!
Connor Shorten
51 VAE-GAN Explained!
VAE-GAN Explained!
Connor Shorten
52 Evolution in Neural Architecture Search!
Evolution in Neural Architecture Search!
Connor Shorten
53 AI Research Weekly Update August 18th, 2019
AI Research Weekly Update August 18th, 2019
Connor Shorten
54 Weight Agnostic Neural Networks Explained!
Weight Agnostic Neural Networks Explained!
Connor Shorten
55 AI Research Weekly Update August 25th, 2019
AI Research Weekly Update August 25th, 2019
Connor Shorten
56 Neuroevolution of Augmenting Topologies (NEAT)
Neuroevolution of Augmenting Topologies (NEAT)
Connor Shorten
57 CoDeepNEAT
CoDeepNEAT
Connor Shorten
58 AI Research Weekly Update September 1st, 2019
AI Research Weekly Update September 1st, 2019
Connor Shorten
59 Randomly Wired Neural Networks
Randomly Wired Neural Networks
Connor Shorten
60 Genetic CNN
Genetic CNN
Connor Shorten

This video introduces the concept of transfer learning in GANs, exploring its potential to save computation and enable training on small datasets. It discusses the application of transfer learning to GANs, including the use of pre-trained models and the evaluation of transfer learning using metrics such as inception distance and Wasserstein critic.

Key Takeaways
  1. Understand the concept of transfer learning and its application to GANs
  2. Learn how to use pre-trained models for transfer learning in GANs
  3. Evaluate the effectiveness of transfer learning using metrics such as inception distance and Wasserstein critic
  4. Apply transfer learning to GANs for specific use cases, such as medical image analysis
💡 Transfer learning can be effectively applied to GANs, enabling the generation of new data that resembles the target domain, and can save computation and enable training on small datasets.

Related AI Lessons

Want to get started with deep learning
Get started with deep learning by leveraging resources like Andrew Karpathy's playlist and frameworks such as TensorFlow or PyTorch
Reddit r/deeplearning
Building a Deepfake Detector From Scratch — What Nobody Tells You
Learn to build a deepfake detector from scratch and understand the challenges involved in detecting AI-generated fake media
Medium · Deep Learning
Unfolding the Meandering Path: High-Dimensional Invariance and the Flat 2D Plane of Neural…
Learn about high-dimensional invariance and its relation to the flat 2D plane of neural networks, and how to apply these concepts to improve model performance
Medium · Deep Learning
Implementing Neural Style Transfer from Scratch: The Project That Started It All
Learn to implement Neural Style Transfer from scratch and understand its significance in deep learning
Medium · Deep Learning
Up next
Image Classification with ml5.js
The Coding Train
Watch →