Transfer Learning in GANs
Key Takeaways
The video discusses the application of transfer learning to Generative Adversarial Networks (GANs), specifically exploring its potential to save computation and enable training on small datasets. It highlights the concept of transferring a pre-trained model from a source domain to a target domain, allowing for the generation of new data that resembles the target domain.
Full Transcript
this video is going to introduce the idea of applying transfer learning to generative adversarial networks the transfer learning is one of the most popular ideas in classification tasks such as classifying like a dog or a cat and it hasn't really been applied to generative adversarial networks if this is developed and successful it'll be able to save people tons of computation because you could take a pre train model from something like open AI or deepmind and just plug and play this pre train model into your data set in addition to saving tons of computation this also helped you to train Ganz on small data sets which is really useful in application domains such as medical image analysis so from a high-level overview this is the idea you have the source domain which is the original data set such as image net the Train on and then you transfer this model into some new target so in the context of ganz this means it starts out generating image net images and then it'll transfer in start generating it'll start generating the image net again but then the discriminator will have the new custom data set and it'll start to mold it into the to generating this new data set so this is just a picture of showing how transfer learning is used in classification tasks this is a really popular paper where they repurposed classification layers for the task of object localization so here are some of the interesting things that the researchers and transfer and Gantt study they want to know the impact of the target domain size you know like if you only have a thousand images in your target domain will this still work then they want to know the distance between the source and the target so let's say you go from image net which is pictures of like cats dogs and airplanes stuff like that and let's say you want to translate transition into like liver lesion images this is like totally different so you'd want to see if the distance between these two data sets is going to impact the quality of transfer learning with generative networks and then again you want to look at conditional Ganz so conditional Ganz of this idea that you can inject a class label to control degan output but it's similar to transfer learning with regular classification tasks the label distribution is going to be totally different so you want to see if there's some way that you can labels when you transfer so this quickly just shows how the DC can that does Elsa bedrooms which is 64 by 64 requires 36 million parameters 36 million parameters in a deep model means you need a lot of data to avoid overfitting so evaluating ganz is one of the most difficult things with gain research so the researchers in this paper shoes use the fresh a inception distance which is given here and then independent Wasserstein critic so their experiments they use the improved Wasserstein Gann model which has ResNet layers in both the generator and the discriminator they target 64 by 64 images and they train with a batch of 128 images for 50k iterations so this table right here shows you know the test if training the generator only from scratch or just pre training the generator and then they do the inverse stage test you know pre training only on the discriminator versus training from scratch and then surprisingly they find that pre training works if you're going to only do one of the other it works better than this committer but in all cases it actually does work best to transfer with both the Jenner & discriminator so this picture here kind of gives you an idea of how transferring and Gans looks you see from scratch compared to the pre trained how it's kind of like a more coherent transition now this image shows the transfer process from different source data sets so they're all transferring into kitchens but the one on the top is from scratch I think the one below is from image net and you know this is the different data sets that are describing the paper and this one is a more complete thing that shows the different data sets they test is the source data sets and then the results when they transfer this shows more of that so in conclusion transfer learning and ganz is a really interesting idea it can save computation and data they find that transferring does work well and additionally they find that datasets with dense classes so like maybe a data set that has a bunch of dog breeds would perform better than a dataset in the source dataset that has like a ton of different things so thanks for watching this video please subscribe for more deep learning videos [Music] you
Original Description
This video explains how Transfer Learning can be applied to Generative Modeling! Thanks for watching, please subscribe!
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from Connor Shorten · Connor Shorten · 13 of 60
1
2
3
4
5
6
7
8
9
10
11
12
▶
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
DenseNets
Connor Shorten
DeepWalk Explained
Connor Shorten
Inception Network Explained
Connor Shorten
StackGAN
Connor Shorten
StyleGAN
Connor Shorten
Progressive Growing of GANs Explained
Connor Shorten
Improved Techniques for Training GANs
Connor Shorten
Word2Vec Explained
Connor Shorten
Must Read Papers on GANs
Connor Shorten
Unsupervised Feature Learning
Connor Shorten
Self-Supervised GANs
Connor Shorten
Embedding Graphs with Deep Learning
Connor Shorten
Transfer Learning in GANs
Connor Shorten
ReLU Activation Function
Connor Shorten
AC-GAN Explained
Connor Shorten
SimGAN Explained
Connor Shorten
DC-GAN Explained!
Connor Shorten
ResNet Explained!
Connor Shorten
Graph Convolutional Networks
Connor Shorten
Neural Architecture Search
Connor Shorten
Henry AI Labs
Connor Shorten
Video Classification with Deep Learning
Connor Shorten
BigGANs in Data Augmentation
Connor Shorten
Introduction to Deep Learning
Connor Shorten
EfficientNet Explained!
Connor Shorten
Self-Attention GAN
Connor Shorten
Curriculum Learning in Deep Neural Networks
Connor Shorten
Deep Learning Podcast #1 | Edward Dixon | Stochastic Weight Averaging
Connor Shorten
Deep Compression
Connor Shorten
Skin Cancer Classification with Deep Learning
Connor Shorten
Deep Learning Podcast #2 | Edward Peake | Deep Learning in Medical Imaging
Connor Shorten
The Lottery Ticket Hypothesis Explained!
Connor Shorten
SqueezeNet
Connor Shorten
GauGAN Explained!
Connor Shorten
AutoML with Hyperband
Connor Shorten
DL Podcast #3 | Yannic Kilcher | Population-Based Search
Connor Shorten
Weakly Supervised Pretraining
Connor Shorten
Image Data Augmentation for Deep Learning
Connor Shorten
Unsupervised Data Augmentation
Connor Shorten
Wide ResNet Explained!
Connor Shorten
RevNet: Backpropagation without Storing Activations
Connor Shorten
GANs with Fewer Labels
Connor Shorten
BigBiGAN Unsupervised Learning!
Connor Shorten
Self-Supervised Learning
Connor Shorten
Multi-Task Self-Supervised Learning
Connor Shorten
Self-Supervised GANs
Connor Shorten
Population Based Training
Connor Shorten
Show, Attend and Tell
Connor Shorten
Siamese Neural Networks
Connor Shorten
WaveGAN Explained!
Connor Shorten
VAE-GAN Explained!
Connor Shorten
Evolution in Neural Architecture Search!
Connor Shorten
AI Research Weekly Update August 18th, 2019
Connor Shorten
Weight Agnostic Neural Networks Explained!
Connor Shorten
AI Research Weekly Update August 25th, 2019
Connor Shorten
Neuroevolution of Augmenting Topologies (NEAT)
Connor Shorten
CoDeepNEAT
Connor Shorten
AI Research Weekly Update September 1st, 2019
Connor Shorten
Randomly Wired Neural Networks
Connor Shorten
Genetic CNN
Connor Shorten
More on: LLM Foundations
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
Want to get started with deep learning
Reddit r/deeplearning
Building a Deepfake Detector From Scratch — What Nobody Tells You
Medium · Deep Learning
Unfolding the Meandering Path: High-Dimensional Invariance and the Flat 2D Plane of Neural…
Medium · Deep Learning
Implementing Neural Style Transfer from Scratch: The Project That Started It All
Medium · Deep Learning
🎓
Tutor Explanation
DeepCamp AI