1D convolution for neural networks, part 3: Sliding dot product equations longhand

Brandon Rohrer · Intermediate ·📐 ML Fundamentals ·6y ago

Skills: ML Maths Basics90%Supervised Learning60%

Key Takeaways

The video discusses the mathematical implementation of 1D convolution for neural networks, specifically focusing on the sliding dot product equation and how to handle cases where the kernel does not fully overlap with the signal. It uses explicit pairwise multiplication and summation to calculate the convolution result.

Full Transcript

when we go to write out the values for our convolution result we can do this pairwise multiplication and summation explicitly we can write it out the long way so let's start by jumping kind of into the middle we'll look at Y of P so our result at the position P this means that our kernel fully overlaps the signal X so it goes from minus P 2 plus P and each one of those aligns with one element of the signal and in addition we know that the very tail end after we've flipped it the very left-hand side of that flipped kernel is W sub P we know that lines up exactly with the first value of our signal X sub 0 and we know that because we're calculating Y sub P the convolution at position P so by counting down from P to 0 we're counting our weights from 0 up to P and so we know that our first element is X sub 0 Y sub P and then we just step counting up on our X's and down on our w's because our W has been reversed our kernel has been reversed so we have X sub 1 times W of P minus 1 and we keep going until we get to the center of our kernel which is X sub P times what W sub 0 and then we keep going until we get to the far end of our kernel which is X sub 2 times P W sub minus P you can see there's a little typo here where I repeat the X sub P W sub 0 that shouldn't be there I'll change that but this shows going all the way from counting from WP to W minus B and from X sub 0 to X sub 2 P so this is a full fully overlapped a fully valid convolution as we step backward from here toward Y sub P minus 1 Y sub P minus 2 all the way back to Y sub 2 y sub 1 Y sub Z we get to a condition where our kernel does not completely overlap our signal some of its dangling off to the side the way we handle that mathematically is we pretend that our signal extends out further but that all of those values are zero but what that means is that we can safely ignore them in our equations because zero times each of the weights in our kernel is going to be zero and so we can always start with for y Sub Zero we can just start right in the middle of our kernel and say it's X sub zero times W sub zero and we can count up on our X index down on our W index because there are kernels been flipped X sub one times W minus one X sub 2 times W sub minus two all the way out to X sub P times W sub minus P then when we go to our next position Y sub one our next output value we just shift all of our weights by one but now we can include the W sub one time X sub zero and then move to X sub one times W sub zero and count all the way up to X sub P plus 1 and W sub minus P and we step through until we're completely overlapped at Y sub P and then we keep going we just increment our X index by one every time we increment our Y index by one and run that sliding dot product step by step all the way across and when we get to the far end we do the same thing than we did at the beginning any portion of the kernel that extends past the end of the signal just gets treated like it's lined up with zeros so each of those kernel elements get multiplied by zero so we can ignore them and then we keep going until that Center element on the kernel is lined up with the last element in our Y in our output array Y sub M and then that's our last value

Original Description

Part of an 9-part series on 1D convolution for neural networks. Catch the rest at https://e2eml.school/321

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Brandon Rohrer · Brandon Rohrer · 49 of 60

← Previous Next →

Robot Learning with a Biologically-Inspired Brain (BECCA)

Robot Learning with a Biologically-Inspired Brain (BECCA)

BECCA talk at AGI 2011

BECCA talk at AGI 2011

Robot Learning with a Biologically-Inspired Brain (BECCA), The Sequel

Robot Learning with a Biologically-Inspired Brain (BECCA), The Sequel

BECCA listens to The Hobbit

BECCA listens to The Hobbit

Learning the building blocks of speech: BECCA extracts a hierarchy of audio features

Learning the building blocks of speech: BECCA extracts a hierarchy of audio features

BECCA listens for sound effects in The Hobbit

BECCA listens for sound effects in The Hobbit

BECCA finds movie trailers while watching the Big Bang Theory

BECCA finds movie trailers while watching the Big Bang Theory

Listening for unexpected sounds: BECCA detects anomalies in audio data

Listening for unexpected sounds: BECCA detects anomalies in audio data

Learning the building blocks of vision: BECCA extracts a spatio-temporal hierarchy of features

Learning the building blocks of vision: BECCA extracts a spatio-temporal hierarchy of features

Watching for the unexpected: BECCA detects anomalies in video data

Watching for the unexpected: BECCA detects anomalies in video data

BECCA finds a stationary target

BECCA finds a stationary target

BECCA finds a stationary target at 3X speed

BECCA finds a stationary target at 3X speed

BECCA watches the X-men and Bruce Lee

BECCA watches the X-men and Bruce Lee

BECCA plays Quidditch

BECCA plays Quidditch

BECCA chases a ball

BECCA chases a ball

BECCA chases a ball, part 2

BECCA chases a ball, part 2

Becca chases a ball, part 3

Becca chases a ball, part 3

BECCA creates features from MNIST

BECCA creates features from MNIST

How reinforcement learning works in Becca 7

How reinforcement learning works in Becca 7

Deep Learning Demystified

Deep Learning Demystified

How Data Science Works

How Data Science Works

How Convolutional Neural Networks work

How Convolutional Neural Networks work

How Bayes Theorem works

How Bayes Theorem works

How Deep Neural Networks Work

How Deep Neural Networks Work

Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM)

Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM)

How Support Vector Machines work / How to open a black box

How Support Vector Machines work / How to open a black box

How autocorrelation works

How autocorrelation works

Getting closer to human intelligence through robotics

Getting closer to human intelligence through robotics

A minimalist's guide to slicing and indexing pandas DataFrames

A minimalist's guide to slicing and indexing pandas DataFrames

How decision trees work

How decision trees work

Data scientist archetypes

Data scientist archetypes

How to use python's datetime package

How to use python's datetime package

How optimization for machine learning works, part 1

How optimization for machine learning works, part 1

How optimization for machine learning works, part 2

How optimization for machine learning works, part 2

How optimization for machine learning works, part 3

How optimization for machine learning works, part 3

How optimization for machine learning works, part 4

How optimization for machine learning works, part 4

How convolutional neural networks work, in depth

How convolutional neural networks work, in depth

How to pick a machine learning model 4: Splitting the data

How to pick a machine learning model 4: Splitting the data

How to pick a machine learning model 3: Choosing a loss function

How to pick a machine learning model 3: Choosing a loss function

How to pick a machine learning model 2: Separating signal from noise

How to pick a machine learning model 2: Separating signal from noise

How to pick a machine learning model 1: Choosing between models

How to pick a machine learning model 1: Choosing between models

How to pick a machine learning model 5: Navigating assumptions

How to pick a machine learning model 5: Navigating assumptions

What do neural networks learn?

What do neural networks learn?

Interview with iRobot's Director of Data Science Angela Bassa

Interview with iRobot's Director of Data Science Angela Bassa

How Backpropagation Works

How Backpropagation Works

Evolutionary Powell's method: A discrete optimizer for hyperparameter optimization

Evolutionary Powell's method: A discrete optimizer for hyperparameter optimization

1D convolution for neural networks, part 1: Sliding dot product

1D convolution for neural networks, part 1: Sliding dot product

1D convolution for neural networks, part 2: Convolution copies the kernel

1D convolution for neural networks, part 2: Convolution copies the kernel

1D convolution for neural networks, part 3: Sliding dot product equations longhand

1D convolution for neural networks, part 3: Sliding dot product equations longhand

1D convolution for neural networks, part 4: Convolution equation

1D convolution for neural networks, part 4: Convolution equation

1D convolution for neural networks, part 5: Backpropagation

1D convolution for neural networks, part 5: Backpropagation

1D convolution for neural networks, part 6: Input gradient

1D convolution for neural networks, part 6: Input gradient

1D convolution for neural networks, part 7: Weight gradient

1D convolution for neural networks, part 7: Weight gradient

1D convolution for neural networks, part 8: Padding

1D convolution for neural networks, part 8: Padding

1D convolution for neural networks, part 9: Stride

1D convolution for neural networks, part 9: Stride

The Four Grand Challenges of Robots in the Home

The Four Grand Challenges of Robots in the Home

How Convolution Works

How Convolution Works

The Softmax neural network layer

The Softmax neural network layer

Batch normalization

Batch normalization

Getting ready to learn Python, Mac edition #1: Files and directories

Getting ready to learn Python, Mac edition #1: Files and directories

This video teaches how to implement 1D convolution for neural networks using the sliding dot product equation, handling cases where the kernel does not fully overlap with the signal. It provides a step-by-step mathematical derivation of the convolution result.

Key Takeaways

Write out the convolution result using pairwise multiplication and summation
Handle cases where the kernel does not fully overlap with the signal by pretending the signal extends with zero values
Calculate the convolution result for each position of the output array

💡 The sliding dot product equation can be used to calculate the convolution result, and cases where the kernel does not fully overlap with the signal can be handled by ignoring the zero-valued elements.

🔒 Pro feature: Ask AI to explain this lesson →

More on: ML Maths Basics

View skill →

Important Steps I Have Followed To Improve My Data Science Skills- Sharing My Experience

Important Steps I Have Followed To Improve My Data Science Skills- Sharing My Experience

Learn Python FAST for Beginners 🚀#coding #conditionals #loops #functions

Learn Python FAST for Beginners 🚀#coding #conditionals #loops #functions

ChethanAIChronicles

“Hello, world” from scratch on a 6502 — Part 1

“Hello, world” from scratch on a 6502 — Part 1

PCA (Principal Component Analysis) in Python - Machine Learning From Scratch 11 - Python Tutorial

PCA (Principal Component Analysis) in Python - Machine Learning From Scratch 11 - Python Tutorial

ROC and AUC in R

ROC and AUC in R

StatQuest with Josh Starmer

Data Science Fundamentals: Data Cleaning in Python

Data Science Fundamentals: Data Cleaning in Python

Related Reads

Put Your First ML App on the Internet for Free

Learn to deploy your first ML app on the internet for free and share it with anyone

Medium · Machine Learning

Put Your First ML App on the Internet for Free

Learn to deploy your first ML app online for free and share it with anyone

Medium · Data Science

Put Your First ML App on the Internet for Free

Deploy your first ML app online for free and share it with anyone

Medium · Python

Seq2Seq and Encoder-Decoder: the one-vector bottleneck that led to attention

Learn how Seq2Seq and Encoder-Decoder models led to the development of attention mechanisms in AI

1. Overview of Artificial Intelligence | What is AI? Fundamental Concepts & Complete History of AI

Professor Rahul Jain