Build an AI Artist - Machine Learning for Hackers #5

Siraj Raval · Beginner ·🧬 Deep Learning ·10y ago

Skills: CV Basics70%ML Maths Basics60%

Key Takeaways

This video demonstrates how to build an AI artist using the Keras deep learning library, specifically by implementing the neural style transfer algorithm to transform images into artistic paintings in the style of famous artists.

Full Transcript

[Music] [Applause] only humans can make something this incredible hello world welcome to serology in today's episode we're going to talk about building an AI artist art is the embodiment of The Human Experience it's a synthesis of all of our emotions and experiences in life you might be asking yourself is it really possible to build something that can do this the answer is yes it's incredible we can train a neural net to learn an artist style until it to modify a picture into a painting in that style it all started when the Google research team released a blog post called Inception ISM they trained a deep convolutional neural network on a huge data set of images so that it could start to detect everyday things like dogs and buildings once it was able to do that they gave it a novel image and asked it to detect an object it had learned in the image if it saw something in the image that looked even slightly similar to what I had already learned say a cloud that kind of looked like a dog it would then modify the image to look more like a dog this resulted in some pretty crazy pictures then another group wrote a similar paper called neural algorithm for artistic style but they repainted an image in the style of another using famous paintings as the base image so when they trained their neural net on Star KN then gave it a novel image it would modify the image to look more like Star KN by artify all of its features since we've gone over convolutional neural Nets in a previous episode at a high level we're going to really deep dive into the code for this one and try to understand exactly how this process works we're going to go through the necessary code to recreate the results from the neural style paper in Python using the Deep learning library Kos let's get started we'll start off by defining our arguments when we run our script we Define the base image the style image and the output image we'll create variables for them then reference a pre computed weights file called vgg16 these are just a bunch of pre-computed synapse weights trained to recognize everyday images which will later add to our neural net as a starting point we'll also want to initialize booleans that define whether or not to rescale our image or maintain their aspect ratio based on user input then we initialize our variables for style and content weight so what do we mean by style and content weight well in the neural style paper they found that when they trained a neural net to recognize the painting style of it artist the learn features in the initial layers were style based like you know how when you train a neural net to recognize a dog initial layers detect edges and the next ones detect shapes and more complex shapes and a whole dog well those same layers of abstraction apply to paintings but what they found were the initial layers the edges and curvature and other low-l features equated with style they also found that the highest few layers were more based on content so in the starting night photo the higher levels would be that dope Sun thing and perhaps a collection of stars the lower levels would be the curvature of the night sky and the color scheme in this way CNN helps separate content from style and mirror the capabilities of our biological Vision we then need to Define how much we want to weigh one or the other because we can optimize for one of them or both depending on how much we weigh each we'll get a different output we'll set our image dimensions then create a tensor representation of our base image style image and output image then we'll combine all three into a single input tensor a tensor is a multi-dimensional array an example would be like colors textures each of those would be arrays as well so colors would be its own array containing the seven main colors then each of those colors will be an array of sub colors we convert our image to a single tensor because it's an easily parsable data structure for our neural network tensors help reduce the high dimensionality of our images and that in turn reduces the computational complexity now we're going to want to build our model which is going to be a convolutional neural network we'll add in our input tensor as the first layer then we'll start defining the other layers we're going to add in 31 layers to our neural net there are three types the convolution 2D layer means it has a set of learnable filters which have a small receptive field the receptive field is a subset of filters that are used to connect the neurons to a local region in the next layer instead of every single other neuron the zero padding layer helps us control the size of the output volume by padding zeros across the border then there's the average pooling layer pooling is a concept in CNN where we take the input image and split it into a set or pool of rectangles pooling helps us avoid overfitting and reduce the amount of parameters in computation by only using a subset of the image as a representation rather than all of it the idea is that once a feature has been found the exact location isn't as important as its rough relative location to other features we'll take the average value from the pool the activation function is called relu or rectified linear unit the reu function is faster than sigmoid without a huge difference in generalization accuracy so we'll use that the numbers here are the number of output filters and the length and width of the input image we'll name each of our layers as well for reference now that we have our model We'll add in the preu weights we called earlier then we're going to Define our loss function for the style content and total variation aka the uniformity of the image loss functions help us calculate the difference between the expected output and the actual output we're going to take these loss functions and combine them into a single scaler or number then we'll get the gradients of the generated image using the loss which helps us map the color scheme our last step is to train our model by minimizing the loss using back propagation the back propagation algorithm we'll use in this case is called limited memory bfgs lbfgs helps minimize our loss function and is space efficient in that it only stores a few key vectors instead of all of them minimiz ing the loss function means modifying the output image iteratively so it looks more and more similar to this artistic style we want scipi gives us a technique as a built-in function and after training is over we can rescale and save the output image that's about it this is what happened when I tried out the algorithm you could also apply this algorithm frame by frame to video content and with more data and computing power it's only going to get better I hope that in the next few years we'll be able to generate not just paintings but other forms of art like sculptures and interior design and facial hairstyles for more information check out the links down below and please subscribe for more ml videos for now I've got to go fix a compile time error so thanks for watching

Original Description

This video will get you up and running with your first AI Artist using the deep learning library Keras! The code for this video is here: https://github.com/llSourcell/AI_Artist I created a Slack channel for us, sign up here: https://wizards.herokuapp.com/ Here's the initial Google DeepDream blog post: http://googleresearch.blogspot.com/2015/06/inceptionism-going-deeper-into-neural.html A Deepdream web app: https://dreamscopeapp.com/ The Neural Style Paper: http://arxiv.org/pdf/1508.06576v2.pdf Some great info on convolutional neural networks: http://colah.github.io/posts/2014-07-Conv-Nets-Modular/ You should train this baby in the cloud using AWS. See ML for Hackers #4 for a tutorial on how to use AWS: https://www.youtube.com/watch?v=eKmIVU8EUbw This person went ahead and made a web app so you don't even have to compile the code to try this out: https://deepart.io/ I love you guys! Thanks for watching my videos, I do it for you. I left my awesome job at Twilio and I'm doing this full time now. I recently created a Patreon page. If you like my videos, feel free to help support my effort here!: https://www.patreon.com/user?ty=h&u=3191693 Much more to come so please subscribe, like, and comment. Follow me: Twitter: https://twitter.com/sirajraval Facebook: https://www.facebook.com/sirajology Instagram: https://www.instagram.com/sirajraval/ Instagram: https://www.instagram.com/sirajraval/ Signup for my newsletter for exciting updates in the field of AI: https://goo.gl/FZzJ5w Hit the Join button above to sign up to become a member of my channel for access to exclusive content! Join my AI community: http://chatgptschool.io/ Sign up for my AI Sports betting Bot, WagerGPT! (500 spots available): https://www.wagergpt.xyz

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Siraj Raval · Siraj Raval · 19 of 60

← Previous Next →

What is Bitcoin?

What is Bitcoin?

5 Ways to Use Bitcoin

5 Ways to Use Bitcoin

BTC Fever - Siraj [Music Video]

BTC Fever - Siraj [Music Video]

5 Reasons to Build Decentralized Apps

5 Reasons to Build Decentralized Apps

The Interplanetary File System

The Interplanetary File System

How to Build a Dapp in 3 min

How to Build a Dapp in 3 min

Life Before Smartphones

Life Before Smartphones

4 Ways to Use Smart Contracts

4 Ways to Use Smart Contracts

3 Dapps You HAVE to See

3 Dapps You HAVE to See

Char's Life as a BitTorrent Engineer

Char's Life as a BitTorrent Engineer

4 Reasons AlphaGo is a Huge Deal

4 Reasons AlphaGo is a Huge Deal

Build a Neural Net in 4 Minutes

Build a Neural Net in 4 Minutes

Sentiment Analysis in 4 Minutes

Sentiment Analysis in 4 Minutes

The Hackathon Life

The Hackathon Life

Your First ML App - Machine Learning for Hackers #1

Your First ML App - Machine Learning for Hackers #1

Build an AI Composer - Machine Learning for Hackers #2

Build an AI Composer - Machine Learning for Hackers #2

Build a Game AI - Machine Learning for Hackers #3

Build a Game AI - Machine Learning for Hackers #3

Build a Movie Recommender - Machine Learning for Hackers #4

Build a Movie Recommender - Machine Learning for Hackers #4

Build an AI Artist - Machine Learning for Hackers #5

Build an AI Artist - Machine Learning for Hackers #5

Build a Chatbot - ML for Hackers #6

Build a Chatbot - ML for Hackers #6

Build an AI Reader - Machine Learning for Hackers #7

Build an AI Reader - Machine Learning for Hackers #7

Build an AI Writer - Machine Learning for Hackers #8

Build an AI Writer - Machine Learning for Hackers #8

Build a Chatbot w/ an API - ML for Hackers #9

Build a Chatbot w/ an API - ML for Hackers #9

One-Shot Learning - Fresh Machine Learning #1

One-Shot Learning - Fresh Machine Learning #1

Generative Adversarial Nets - Fresh Machine Learning #2

Generative Adversarial Nets - Fresh Machine Learning #2

Tone Analysis - Fresh Machine Learning #3

Tone Analysis - Fresh Machine Learning #3

Generate Rap Lyrics - Fresh Machine Learning #4

Generate Rap Lyrics - Fresh Machine Learning #4

Build an Autoencoder in 5 Min - Fresh Machine Learning #5

Build an Autoencoder in 5 Min - Fresh Machine Learning #5

Build a Self Driving Car in 5 Min - Fresh Machine Learning #6

Build a Self Driving Car in 5 Min - Fresh Machine Learning #6

Build an Antivirus in 5 Min - Fresh Machine Learning #7

Build an Antivirus in 5 Min - Fresh Machine Learning #7

TensorFlow in 5 Minutes (tutorial)

TensorFlow in 5 Minutes (tutorial)

Build a Recurrent Neural Net in 5 Min

Build a Recurrent Neural Net in 5 Min

Build a Simulation in 5 Min

Build a Simulation in 5 Min

Build a TensorFlow Image Classifier in 5 Min

Build a TensorFlow Image Classifier in 5 Min

Tensorboard Explained in 5 Min

Tensorboard Explained in 5 Min

Generate Music in TensorFlow

Generate Music in TensorFlow

Build a Game Bot (LIVE)

Build a Game Bot (LIVE)

Deep Learning Frameworks Compared

Deep Learning Frameworks Compared

Introduction - Learn Python for Data Science #1

Introduction - Learn Python for Data Science #1

Build a Neural Network (LIVE)

Build a Neural Network (LIVE)

Twitter Sentiment Analysis - Learn Python for Data Science #2

Twitter Sentiment Analysis - Learn Python for Data Science #2

Recommendation Systems - Learn Python for Data Science #3

Recommendation Systems - Learn Python for Data Science #3

Predicting Stock Prices - Learn Python for Data Science #4

Predicting Stock Prices - Learn Python for Data Science #4

Pong Neural Network (LIVE)

Pong Neural Network (LIVE)

Deep Dream in TensorFlow - Learn Python for Data Science #5

Deep Dream in TensorFlow - Learn Python for Data Science #5

Visualizing Data with D3.js (LIVE)

Visualizing Data with D3.js (LIVE)

Genetic Algorithms - Learn Python for Data Science #6

Genetic Algorithms - Learn Python for Data Science #6

Enter Siraj [Music Video]

Enter Siraj [Music Video]

Build a Web Scraper (LIVE)

Build a Web Scraper (LIVE)

Why is P vs NP Important?

Why is P vs NP Important?

How to Make a Neural Network (LIVE)

How to Make a Neural Network (LIVE)

How to Make an Amazing Tensorflow Chatbot Easily

How to Make an Amazing Tensorflow Chatbot Easily

How to Make an Amazing Video Game Bot Easily

How to Make an Amazing Video Game Bot Easily

How to Make a Tensorflow Neural Network (LIVE)

How to Make a Tensorflow Neural Network (LIVE)

How to Make a Simple Tensorflow Speech Recognizer

How to Make a Simple Tensorflow Speech Recognizer

Joel Shor - Really Quick Questions with an Awesome Google Engineer

Joel Shor - Really Quick Questions with an Awesome Google Engineer

How to Make a Path Planning Algorithm Easily (LIVE)

How to Make a Path Planning Algorithm Easily (LIVE)

The Best Way to Prepare a Dataset Easily

The Best Way to Prepare a Dataset Easily

Catherine Olsson - Really Quick Questions with an OpenAI Engineer

Catherine Olsson - Really Quick Questions with an OpenAI Engineer

How to Make a Tic Tac Toe Neural Network Easily (LIVE)

How to Make a Tic Tac Toe Neural Network Easily (LIVE)

This video teaches you how to build an AI artist using Keras and the neural style transfer algorithm, allowing you to transform images into artistic paintings in the style of famous artists. You'll learn about convolutional neural networks, tensors, and back propagation, and how to apply these concepts to a real-world problem.

Key Takeaways

Define the base image, style image, and output image
Load pre-computed weights and initialize variables
Build a convolutional neural network model
Add layers to the model, including convolutional, pooling, and activation layers
Define the loss function and combine it with the style and content loss
Train the model using back propagation and limited memory BFGS
Rescale and save the output image

💡 The neural style transfer algorithm can be used to transform images into artistic paintings in the style of famous artists by separating the content and style of an image and recombining them in a new way.

🔒 Pro feature: Ask AI to explain this lesson →

More on: CV Basics

View skill →

Identify Horses or Humans with TensorFlow and Vertex AI

Building a Dog Breed Identifier App from scratch - DogNet

Building a Dog Breed Identifier App from scratch - DogNet

Aladdin Persson

Apply OpenGL Texturing and Camera Systems

Apply OpenGL Texturing and Camera Systems

Aerial Image Segmentation with PyTorch

Aerial Image Segmentation with PyTorch

How to Install Stable Diffusion - automatic1111

How to Install Stable Diffusion - automatic1111

Sebastian Kamph

NVIDIA RTXGI Unreal Engine 4 Plugin: Introduction and Setup

NVIDIA RTXGI Unreal Engine 4 Plugin: Introduction and Setup

NVIDIA Developer

Related Reads

I Found the Neural Network I Built in Class 9 — Here’s What Happened When I Tried to Run It Again

Revisiting a 4-year-old neural network project for handwritten digit recognition using a convolutional neural network and analyzing its performance

Medium · Deep Learning

Introduction to Deep Learning and Neural Networks: From Human Brain to Artificial Intelligence

Learn how biological neurons inspired artificial neural networks and deep learning, transforming the AI landscape

Medium · Deep Learning

Want to get started with deep learning

Get started with deep learning by leveraging resources like Andrew Karpathy's playlist and frameworks such as TensorFlow or PyTorch

Reddit r/deeplearning

Building a Deepfake Detector From Scratch — What Nobody Tells You

Learn to build a deepfake detector from scratch and understand the challenges involved in detecting AI-generated fake media

Medium · Deep Learning

Image Classification with ml5.js

The Coding Train