How Does Deep Learning Work? | Two Minute Papers #24
Key Takeaways
The video explains how deep learning works through artificial neural networks, demonstrating their ability to solve complex classification problems by transforming and warping data into a more separable representation. It highlights the importance of multiple hidden layers in deep learning, allowing for more effective inner representations of the data.
Full Transcript
dear fellow Scholars this is 2minute papers with car a neural network is a very loose model of the human brain that we can program in a computer or it's perhaps more appropriate to say that it is inspired by our knowledge of the inner workings of a human brain now let's note that artificial neural networks have been studied for decades by experts and the goal here is not to show all aspects but one intuitive graphical aspect that is really cool and easy to understand take a look at these curves on a plane these curves are a collection of points and these points you can imagine as images sounds or any kind of input data that we try to learn the red and blue curves represent two different classes the red can mean images of trains and the blue for instance images of bunnies now after we have trained the network from this limited data which is basically a bunch of images of trains and bunnies we will get new points on this plane new images and we would like to know whether this new image looks like a train or a bunny this is what the algorithm has to find out and this we call a classification problem to which a simple and bad solution would be simply cutting the plane in half with a line images belonging to the red regions will be classified as the red class and the blue regions as the blue class now as you can see the red region cuts into the blue curve which means that some trains would be misclassified as bunnies it seems that if we look at the problem from this angle we cannot really separate the two classes perfectly with a straight line however if we use a simple neural network it will give us this result hey but that's cheating we were talking about straight lines right this is anything but a straight line a key concept of neural networks is that they create an inner representation of the data model and try to solve the problem in that space what this intuitively means is that the algorithm will start transforming and warping these curves where their shapes start changing and it finds that if we do well with this warping step we can actually draw a line to separate these two classes after we undo this warping and transform the line back to the original problem it will look like a curve really cool isn't it so these are actually lines only in a different representation of the problem who said that the original representation is the best way to solve a problem take a look at this example with the entangled spirals can we separate these with a line not a chance but the answer is not a chance with this representation but if one starts warping them correctly there will be states where they can easily be separated however there are rules in this game for instance one cannot just rip out one of the Spyros here and put it somewhere else these Transformations have to be homeomorphisms which is a term that mathematicians like to use it intuitively means that the warpings are not too crazy meaning that we don't tear apart important structures and as they remain intact the Warped solution is still meaningful with respect to the original problem problem now comes the Deep learning part deep learning means that the neural network has multiple of these hidden layers and can therefore create much more effective inner representations of the data from an earlier episode we've seen in an image recognition test that as we go further and further into the layers first we'll see an edge detector and as a combination of edges object Parts emerge and in the later layers a combination of object Parts create object models let's take take a look at this example we have a bullseye here if you will and you can see that the network is trying to warp this to separate it with a line but in vain however if we have a deep neural network we have more degrees of freedom more directions and possibilities to warp this data and if you think intuitively if this were a piece of paper you could put your finger behind the Red Zone and push it in making it possible to separate the two regions with a line let's take a look at one-dimensional example to better see what's going on this line is the 1D equivalent of the original problem and you can see that the problem becomes quite trivial if we have the freedom to do this kind of transformation we can easily encounter cases where the data is very severely Tangled and we don't know how good the best solution can be there is a very heavily academic subfield of mathematics called not Theory which is the study of tangling and untangling objects it is is subject to a lot of snarky comments for not being well too exciting or useful what is really mind-blowing is that not Theory can actually help us study these kinds of problems and it may ultimately end up being useful for recognizing traffic signs and designing self-driving cars now it's time to get our hands dirty let's run a neural network on this data set and see what happens if we use a low number of neurons and one layer you can see that it is trying ferociously but we know that it is going to be a fruitless Endeavor upon increasing the number of neurons magic happens and we know exactly why yeah thanks so much for watching and for your generous support I feel really privileged to have supporters like you fellow Scholars thank you and I'll see you next time
Original Description
Artificial neural networks provide us incredibly powerful tools in machine learning that are useful for a variety of tasks ranging from image classification to voice translation. So what is all the deep learning rage about? The media seems to be all over the newest neural network research of the DeepMind company that was recently acquired by Google. They used neural networks to create algorithms that are able to play Atari games, learn them like a human would, eventually achieving superhuman performance.
Deep learning means that we use artificial neural network with multiple layers, making it even more powerful for more difficult tasks. These machine learning techniques proved to be useful for many tasks beyond image recognition: they also excel at weather predictions, breast cancer cell mitosis detection, brain image segmentation and toxicity prediction among many others.
In this episode, an intuitive explanation is given to show the inner workings of deep learning algorithms.
________________________
Original blog post by Christopher Olah (source of many images):
http://colah.github.io/posts/2014-03-NN-Manifolds-Topology/
You can train your own deep neural networks on Andrej Karpathy's website:
http://cs.stanford.edu/people/karpathy/convnetjs/demo/classify2d.html
Images used in this video:
Bunny by Tomi Tapio K (CC BY 2.0) - https://flic.kr/p/8EbcEk
Train by B4bees (CC BY 2.0) - https://flic.kr/p/6RzHe4
Train with bunny by Alyssa L. Miller (CC BY 2.0) - https://flic.kr/p/5WPeRN
The knot theory blackboard image was created by Clayton Shonkwiler (CC BY 2.0) https://flic.kr/p/64FYv
The tangled knot image was created by Mikael Hvidtfeldt Christensen (CC BY 2.0) https://flic.kr/p/beYG9D
The thumbnail image is a work of Duncan Hull (CC BY 2.0) - https://flic.kr/p/98qtJB
Subscribe if you would like to see more of these! - http://www.youtube.com/subscription_center?add_user=keeroyz
Splash screen/thumbnail design: Felícia Fehér - http://felicia.hu
Károly Zsolnai-
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from Two Minute Papers · Two Minute Papers · 27 of 60
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
▶
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Fluid Simulations with Blender and Wavelet Turbulence | Two Minute Papers #1
Two Minute Papers
Capturing Waves of Light With Femto-photography | Two Minute Papers #2
Two Minute Papers
Artificial Neural Networks and Deep Learning | Two Minute Papers #3
Two Minute Papers
Blender Rendering - Top 7 LuxRender Features
Two Minute Papers
Simulating Breaking Glass | Two Minute Papers #4
Two Minute Papers
Time Lapse Videos From Community Photos | Two Minute Papers #5
Two Minute Papers
AI Learns Van Gogh's Art
Two Minute Papers
Hydrographic Printing | Two Minute Papers #7
Two Minute Papers
Announcing LuxRender 1.5
Two Minute Papers
Digital Creatures Learn To Walk | Two Minute Papers #8
Two Minute Papers
Manipulating Photorealistic Renderings | Two Minute Papers #9
Two Minute Papers
Adaptive Fluid Simulations | Two Minute Papers #10
Two Minute Papers
Building Bridges With Flying Machines | Two Minute Papers #11
Two Minute Papers
Reconstructing Sound From Vibrations | Two Minute Papers #12
Two Minute Papers
Creating Photographs Using Deep Learning | Two Minute Papers #13
Two Minute Papers
Adaptive Cloth Simulations | Two Minute Papers #14
Two Minute Papers
Synthesizing Sound From Collisions | Two Minute Papers #15
Two Minute Papers
Metropolis Light Transport | Two Minute Papers #16
Two Minute Papers
3D Printing a Glockenspiel | Two Minute Papers #17
Two Minute Papers
Modeling Colliding and Merging Fluids | Two Minute Papers #18
Two Minute Papers
Recurrent Neural Network Writes Music and Shakespeare Novels | Two Minute Papers #19
Two Minute Papers
Gradients, Poisson's Equation and Light Transport | Two Minute Papers #20
Two Minute Papers
Real-Time Facial Expression Transfer | Two Minute Papers #21
Two Minute Papers
Automatic Lecture Notes From Videos | Two Minute Papers #22
Two Minute Papers
Be a Part of Two Minute Papers on Patreon!
Two Minute Papers
Recurrent Neural Network Writes Sentences About Images | Two Minute Papers #23
Two Minute Papers
How Does Deep Learning Work? | Two Minute Papers #24
Two Minute Papers
Cryptography, Perfect Secrecy and One Time Pads | Two Minute Papers #25
Two Minute Papers
Terrain Traversal with Reinforcement Learning | Two Minute Papers #26
Two Minute Papers
Multiple-Scattering Microfacet BSDFs with the Smith Model
Two Minute Papers
Google DeepMind's Deep Q-Learning & Superhuman Atari Gameplays | Two Minute Papers #27
Two Minute Papers
Are We Living In a Computer Simulation? | Two Minute Papers #28
Two Minute Papers
Artificial Superintelligence [Audio only] | Two Minute Papers #29
Two Minute Papers
Automatic Parameter Control for Metropolis Light Transport | Two Minute Papers #30
Two Minute Papers
Randomness and Bell's Inequality [Audio only] | Two Minute Papers #31
Two Minute Papers
OpenAI - Non-profit AI company by Elon Musk and Sam Altman
Two Minute Papers
How Do Genetic Algorithms Work? | Two Minute Papers #32
Two Minute Papers
Painting with Fluid Simulations | Two Minute Papers #33
Two Minute Papers
Peer Review #1 [Audio only] | Two Minute Papers
Two Minute Papers
Neural Programmer-Interpreters Learn To Write Programs | Two Minute Papers #34
Two Minute Papers
9 Cool Deep Learning Applications | Two Minute Papers #35
Two Minute Papers
Designing Cities and Furnitures With Machine Learning | Two Minute Papers #36
Two Minute Papers
Designing 3D Printable Robotic Creatures | Two Minute Papers #37
Two Minute Papers
3D Printing Objects With Caustics | Two Minute Papers #38
Two Minute Papers
Interactive Editing of Subsurface Scattering | Two Minute Papers #39
Two Minute Papers
Simulating Viscosity and Melting Fluids | Two Minute Papers #40
Two Minute Papers
What Do Virtual Objects Sound Like? | Two Minute Papers #41
Two Minute Papers
How DeepMind Conquered Go With Deep Learning (AlphaGo) | Two Minute Papers #42
Two Minute Papers
Breaking Deep Learning Systems With Adversarial Examples | Two Minute Papers #43
Two Minute Papers
Extrapolations and Crowdfunded Research (Experiment) | Two Minute Papers #44
Two Minute Papers
Biophysical Skin Aging Simulations | Two Minute Papers #45
Two Minute Papers
What is Impostor Syndrome? | Two Minute Papers #46
Two Minute Papers
Should You Take the Stairs at Work? (For Weight Loss) | Two Minute Papers #47
Two Minute Papers
Artistic Manipulation of Caustics | Two Minute Papers #48
Two Minute Papers
Deep Learning Program Learns to Paint | Two Minute Papers #49
Two Minute Papers
Interactive Photo Recoloring | Two Minute Papers #50
Two Minute Papers
How To Get Started With Machine Learning? | Two Minute Papers #51
Two Minute Papers
Awesome Research For Everyone! - Two Minute Papers Channel Trailer
Two Minute Papers
10 More Cool Deep Learning Applications | Two Minute Papers #52
Two Minute Papers
How DeepMind's AlphaGo Defeated Lee Sedol | Two Minute Papers #53
Two Minute Papers
More on: Neural Network Basics
View skill →Related Reads
📰
📰
📰
📰
What Is MLIR and Why Does It Exist?
Dev.to · Fedor Nikolaev
Why Choosing the Right Machine Learning Development Company Matters More Than the AI Model
Medium · Machine Learning
Data privacy in AI training: federated learning, differential privacy, and synthetic data
Dev.to AI
Data Preprocessing: Encoding and Feature Scaling in Machine Learning
Medium · Machine Learning
🎓
Tutor Explanation
DeepCamp AI