Neural Network Overview (C1W3L01)
Key Takeaways
The video provides an overview of neural networks, explaining how they are formed by stacking sigmoid units and introducing new notation for referring to quantities associated with different layers of the network. It also touches on the computation of loss and backward calculations in a neural network, using logistic regression as a reference point.
Full Transcript
welcome back in this week's you learn to implement a neural network before diving into the technical details I wanted in this video to give you a quick overview of what you'll be seeing in this week's videos so if you don't follow all the details in this video don't worry about it we'll delve in the technical details in the next few videos but for now let's give a quick overview of how you implement in your network last week we had talked about logistic regression and we saw how this model corresponds to the following computation graph where you didn't put the features X and parameters W MB does allows you to compute Z which is then used to compute a and we were using a interchangeably with this output Y hat and then you can compute the loss function l a new network looks like this and as I already previously alluded you can form a neural network by stacking together a lot of little sigmoid units whereas previously this node corresponds to two steps of calculations the first three compute the Z value second is it computes this a value in this doing network this stack of notes will correspond to a Z like calculation like this as well as an a like calculation like that and then that node will correspond to another Z and another 8 like calculation so the notation which we should use later will look like this first what inputs the features X together with some parameters W and B and this will allow you to compute z1 so new notation that one should use is that we'll use superscript square bracket 1 to refer to quantities associated with this stack of node is called a lair and then later we'll use superscript square bracket 2 to refer to quantities associated with Daniel really that's called another layer of the network and the superscript square brackets like we have here are not to be confused with the superscript round brackets which we used to refer to individual training examples so whereas X superscript on racket I refer to the I've trained example superscript square bracket 1 and 2 refer to these different um layers layer 1 and layer 2 in this network but they're going on after computing z1 similar to logistic regression there'll be a computation to compute a 1 and that's just some sigmoid of z1 and then you compute Z 2 using another linear equation and then compute a 2 and a 2 is the final output of the neural network and will also be used interchangeably with Y hat so I know that was a lot of details but the key intuition to take away is that whereas for logistic regression we had this Z followed by a calculation and this new network here we just do it multiple times as Z followed by a calculation and a Z qualifying a calculation and then you finally compute the loss and yet and you remember that for logistic regression we had in some backward calculation in order to compute derivatives are so confusing da easy and so on so in the same way a new network will end up doing a backward calculation that looks like this it will jump you end up computing da 2 DZ 2 that allows you to compute DW 2 DB 2 and so on in this sort of a right to left backward calculation that is denoting with the red arrows so thank you quick overview of what a neural network website space you take the logistic regression and repeating it twice I know there's a lot of new notation love new details don't worry about they didn't follow everything we'll go into the details most slowly in the next few videos so let's go on to the next video we'll stop to talk about the neural network representation
Original Description
Take the Deep Learning Specialization: http://bit.ly/3aeHnku
Check out all our courses: https://www.deeplearning.ai
Subscribe to The Batch, our weekly newsletter: https://www.deeplearning.ai/thebatch
Follow us:
Twitter: https://twitter.com/deeplearningai_
Facebook: https://www.facebook.com/deeplearningHQ/
Linkedin: https://www.linkedin.com/company/deeplearningai
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from DeepLearningAI · DeepLearningAI · 29 of 60
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
▶
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Forward and Backward Propagation (C1W4L06)
DeepLearningAI
deeplearning.ai's Heroes of Deep Learning: Yuanqing Lin
DeepLearningAI
deeplearning.ai's Heroes of Deep Learning: Ruslan Salakhutdinov
DeepLearningAI
deeplearning.ai's Heroes of Deep Learning: Yoshua Bengio
DeepLearningAI
deeplearning.ai's Heroes of Deep Learning: Pieter Abbeel
DeepLearningAI
deeplearning.ai's Heroes of Deep Learning: Ian Goodfellow
DeepLearningAI
deeplearning.ai's Heroes of Deep Learning: Andrej Karpathy
DeepLearningAI
Using an Appropriate Scale (C2W3L02)
DeepLearningAI
Gradient Checking (C2W1L13)
DeepLearningAI
Gradient Checking Implementation Notes (C2W1L14)
DeepLearningAI
Learning Rate Decay (C2W2L09)
DeepLearningAI
Understanding Mini-Batch Gradient Dexcent (C2W2L02)
DeepLearningAI
Mini Batch Gradient Descent (C2W2L01)
DeepLearningAI
The Problem of Local Optima (C2W3L10)
DeepLearningAI
Exponentially Weighted Averages (C2W2L03)
DeepLearningAI
Tuning Process (C2W3L01)
DeepLearningAI
Understanding Exponentially Weighted Averages (C2W2L04)
DeepLearningAI
Bias Correction of Exponentially Weighted Averages (C2W2L05)
DeepLearningAI
Gradient Descent With Momentum (C2W2L06)
DeepLearningAI
Normalizing Activations in a Network (C2W3L04)
DeepLearningAI
Hyperparameter Tuning in Practice (C2W3L03)
DeepLearningAI
Adam Optimization Algorithm (C2W2L08)
DeepLearningAI
RMSProp (C2W2L07)
DeepLearningAI
Fitting Batch Norm Into Neural Networks (C2W3L05)
DeepLearningAI
Why Does Batch Norm Work? (C2W3L06)
DeepLearningAI
Batch Norm At Test Time (C2W3L07)
DeepLearningAI
Softmax Regression (C2W3L08)
DeepLearningAI
Deep Learning Frameworks (C2W3L10)
DeepLearningAI
Neural Network Overview (C1W3L01)
DeepLearningAI
Training Softmax Classifier (C2W3L09)
DeepLearningAI
Why Deep Representations? (C1W4L04)
DeepLearningAI
Gradient Descent For Neural Networks (C1W3L09)
DeepLearningAI
Neural Network Representations (C1W3L02)
DeepLearningAI
TensorFlow (C2W3L11)
DeepLearningAI
Activation Functions (C1W3L06)
DeepLearningAI
Explanation For Vectorized Implementation (C1W3L05)
DeepLearningAI
Getting Matrix Dimensions Right (C1W4L03)
DeepLearningAI
Understanding Dropout (C2W1L07)
DeepLearningAI
Building Blocks of a Deep Neural Network (C1W4L05)
DeepLearningAI
Why Non-linear Activation Functions (C1W3L07)
DeepLearningAI
Computing Neural Network Output (C1W3L03)
DeepLearningAI
Backpropagation Intuition (C1W3L10)
DeepLearningAI
Train/Dev/Test Sets (C2W1L01)
DeepLearningAI
Deep L-Layer Neural Network (C1W4L01)
DeepLearningAI
Random Initialization (C1W3L11)
DeepLearningAI
Other Regularization Methods (C2W1L08)
DeepLearningAI
Normalizing Inputs (C2W1L09)
DeepLearningAI
Derivatives Of Activation Functions (C1W3L08)
DeepLearningAI
Parameters vs Hyperparameters (C1W4L07)
DeepLearningAI
Vectorizing Across Multiple Examples (C1W3L04)
DeepLearningAI
What does this have to do with the brain? (C1W4L08)
DeepLearningAI
Dropout Regularization (C2W1L06)
DeepLearningAI
Vanishing/Exploding Gradients (C2W1L10)
DeepLearningAI
Basic Recipe for Machine Learning (C2W1L03)
DeepLearningAI
Bias/Variance (C2W1L02)
DeepLearningAI
Forward Propagation in a Deep Network (C1W4L02)
DeepLearningAI
Weight Initialization in a Deep Network (C2W1L11)
DeepLearningAI
Numerical Approximations of Gradients (C2W1L12)
DeepLearningAI
Regularization (C2W1L04)
DeepLearningAI
Why Regularization Reduces Overfitting (C2W1L05)
DeepLearningAI
More on: Neural Network Basics
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
Mastering TypeScript — Understanding the TypeScript Compiler (tsc) from Scratch — Lesson 2
Medium · JavaScript
Stop Overfitting With Basically One Line of Code
Medium · AI
Stop Overfitting With Basically One Line of Code
Medium · Machine Learning
Stop Overfitting With Basically One Line of Code
Medium · Data Science
🎓
Tutor Explanation
DeepCamp AI