AI That Could Soon Replace Vector Artists [DALL-E]

bycloud · Advanced ·📰 AI News & Updates ·5y ago

Skills: LLM Foundations70%

Key Takeaways

DALL-E, a neural network model, can create images by associating concepts within given text descriptions, leveraging the success of GPT-3 language model, and demonstrating capabilities in style transfer, texture, and color generation, as well as object recognition and illustration.

Full Transcript

who knew pikachu can be such a chat playing a grand piano or even riding a motorbike this is not until a couple days ago open ai decided to once again blow our mind away with its newest ai models just five days into 2021. we are off to a great start and in the ai part of course its name is inspired by the artist salvador dali and pixar's wall-e or wally is a trained neural network that can create images by associating concepts within the given text descriptions this has only become possible because of the success of gpt3 language model which can complete any text by literally learning nearly everything the internet has to offer which is absolutely mental so how insane is this dolly ai model well let's look at the results together right away openai did not release their trained model so we would be looking at the official demo instead as of why they did not make the model available i will elaborate on that later on at the very top we can see the text prompt that is used to generate the images below these are already being generated previously by open ai so we are just loading images here but surprisingly there are a ton of demos and you can go play around yourself on their blog but as amazing as it looks it is still cherry picked by another ai that they made called clip which is also released at the same time and it ranks the similarity between the given text and the image so what we are seeing now is already being ranked and the more samples that are being taken the better quality of the images become and it also shows how well dolly and clip work together as a dual especially as a sorting function but generally as we can see dolly is spectacular at generating images that show traits of style transfer and just basically anything that is changing the surface of something this also ranges from art styles textures and colors it is also good at generating a typical object when mentioned in the text prompt things that often appear in the data set show high consistency at generating the overall shape or structure of it like the stop sign however when it is about object symmetry like faces it starts to struggle as faces can be symmetrical and asymmetrical at times to further nick pick the false to make me look smarter you can tell that for some image generation it is bad at generating the exact precise description if the text prompt gives ideas that are abstract or unseen before the ai seems like it would fail to process every single detail and only succeed in just a few ones when the number starts to get too big it struggles at generating the right amount of it or the shape starts to deform and you can clearly tell that it is not a pentagon and if a word could have multiple meanings we can see different objects being brought forward and shown to us interestingly they also provided demos for objects that would contain concepts like reflections to test out how well the ai would understand it and as expected it only occasionally gets it right same goes to a lot of concepts like playing the piano would require you to sit outside not inside pikachu novel view generation is hard to be consistent but also can be proven successful in some cases and you can also tell it to generate illustrations too but it is hard to stay consistent generating symmetrical faces with asymmetrical features there is image completion alongside the text prompt too so generating images based on a specific part of the image can hugely affect the outcome like the mannequin here so they all look really consistent or to use the image to let the ai to refer to and it really is sad that i can't play with the trained model myself it just looks so much fun to test it but if it's actually released there are all kinds of let's say potential issues that could be a little problematic for instance just by looking at the quality of these illustrations i feel like it might be a big threat to a lot of small businesses like icon or cartoon illustrations distributing companies let alone photoshop artists or just artists in general who might lose some of their smaller deals there's just so much money that could be involved in this and could definitely slam pretty hard on the freelancer markets on the bright side this does give other people easier access to other creative aspects such as generating infinite amounts of the same texture just with slight variations maybe reusing assets in games would not be a thing in the future anymore because it is just too convenient to get something slightly different and just training gpt 3 can cost up to 5 million usd according to the internet so no one would probably invest this much money into something that is not entirely perfect yet also there are similar concerns whereby people were able to extract the training data from the gpd2 model there is a paper about this problem which showed how gpt2 was capable of extracting personal information about someone when a correct prompt is given note that dolly has the same concern but it still has the possibility of being able to extract personal ids or credit cards are unintentionally being included in the data set too there's just too many malicious things that you can do with this who knows if this dolly can photoshop or even deep fake just a text prompt away not even mentioning those not safe for work stuff and how well it can possibly do well no more with the speculation on what dolly can do but if it's me i just generate some dumb like a bunch of pout faces with a variation of my mako and they did release the code for clip so if you do want to try it out i'll link its blog and the collab down in the description the least block from openai would also be down there too and this video is sponsored by infinite red infinite consulting handles your mobile web and ai needs if you are looking for someone to bear your app visit with the link down in the description thank you guys for watching hope you enjoyed it openai is just amazing at blowing our mind away and this is just a wild start for the year 2021. shout out to mark schwinn and many other patreons that support my work through patreon and i'll see you guys in the next one

Original Description

DALL-E's paper is not yet released, but we can expect it soon in the future. For now, we can only enjoy the blog they have provided us with. Even though it may be the cherry picked results, but even a few papers earlier, nothing can come close to this level of quality, this is just insane. DALL-E [Blog] https://openai.com/blog/dall-e/ CLIP [Blog] https://openai.com/blog/clip/ [GitHub] https://github.com/openai/CLIP [Paper] https://cdn.openai.com/papers/Learning_Transferable_Visual_Models_From_Natural_Language_Supervision.pdf [Colab] https://colab.research.google.com/github/openai/clip/blob/master/Interacting_with_CLIP.ipynb Today's Sponsor is Infinite Red Infinite Red consulting handles your mobile, web, and AI needs Check it out here: https://bit.ly/2UwddmM This video is supported by the kind Patrons: 🙏Marc Schwyn, Mazen Alotaibi, Sascha Henrichs, Jake Disco, Peter Davidowicz Support me on Patreon if you hope to see more: https://www.patreon.com/bycloud [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Music] Steaminwaffles - The Walk Home [Profile Art] https://twitter.com/bynicalcynical

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from bycloud · bycloud · 34 of 60

← Previous Next →

Can Deepfake work on Anime?

Can Deepfake work on Anime?

AI that Can Copy Voices

AI that Can Copy Voices

Live Action Is Terrible So AI Turned It Back Into Anime

Live Action Is Terrible So AI Turned It Back Into Anime

2 AIs Enhance Anime to 4K 240FPS, but is it good?

2 AIs Enhance Anime to 4K 240FPS, but is it good?

IRL to Anime With Cartoonization AI

IRL to Anime With Cartoonization AI

How Does AI Generated Songs Sound Like? [OpenAI Jukebox]

How Does AI Generated Songs Sound Like? [OpenAI Jukebox]

AI Makes Any Images Cinematic [3D Photo Inpainting]

AI Makes Any Images Cinematic [3D Photo Inpainting]

AI Generates Anime Faces, And It's Getting Even Better [StyleGAN2]

AI Generates Anime Faces, And It's Getting Even Better [StyleGAN2]

Tech Behind The Meme: Dame Da Ne AI - Single Image Deepfake

Tech Behind The Meme: Dame Da Ne AI - Single Image Deepfake

AI Generates New Light Source for Images [PaintingLight]

AI Generates New Light Source for Images [PaintingLight]

Depixelizing Doom Guy? Mona Lisa in Real Life? The "Upscaling" AI: PULSE

Depixelizing Doom Guy? Mona Lisa in Real Life? The "Upscaling" AI: PULSE

Image Completion AI - Predict Pixels Just Like Text Predictions [Image-GPT]

Image Completion AI - Predict Pixels Just Like Text Predictions [Image-GPT]

AI Generates 3D Human Model from 2D Image [PIFuHD - FacebookAI]

AI Generates 3D Human Model from 2D Image [PIFuHD - FacebookAI]

AI Assisted Masking - Save Your Precious Time Right Now [AE Rotobrush 2]

AI Assisted Masking - Save Your Precious Time Right Now [AE Rotobrush 2]

This AI Reconstruct Real Life Objects From Just Images [NeRF]

This AI Reconstruct Real Life Objects From Just Images [NeRF]

Image Restoration AI - Upscale and Restore Faces with DFDNet

Image Restoration AI - Upscale and Restore Faces with DFDNet

Best Image Colorization AI 2020

Best Image Colorization AI 2020

Image Decomposition AI - Edit Highlights and Textures Easily [Appearance Eraser]

Image Decomposition AI - Edit Highlights and Textures Easily [Appearance Eraser]

Deepfake With Audio Only [Wav2Lip]

Deepfake With Audio Only [Wav2Lip]

Copy IRL, Paste on your PC [AR Cut & Paste]

Copy IRL, Paste on your PC [AR Cut & Paste]

This AI Transform Faces into Hyper-Realistic Cartoon Characters [Toonify]

This AI Transform Faces into Hyper-Realistic Cartoon Characters [Toonify]

This AI Restores Old Photos with Damages Automatically!

This AI Restores Old Photos with Damages Automatically!

Anime Filter with AI - Snapchat vs. TikTok

Anime Filter with AI - Snapchat vs. TikTok

AI Reduces Bandwidth Problems for Video Calls [NVIDIA Maxine]

AI Reduces Bandwidth Problems for Video Calls [NVIDIA Maxine]

AI Motion Capture - Track Your Hands & Body WITHOUT Bodysuit [FrankMocap]

AI Motion Capture - Track Your Hands & Body WITHOUT Bodysuit [FrankMocap]

AI Converts Cartoon Characters To Real Life [Pixel2Style2Pixel]

AI Converts Cartoon Characters To Real Life [Pixel2Style2Pixel]

AI Sky Replacement with SkyAR

AI Sky Replacement with SkyAR

Better Than DAIN? NEW BEST Tool for Boosting Video's FPS with AI [RIFE/Flowframes]

Better Than DAIN? NEW BEST Tool for Boosting Video's FPS with AI [RIFE/Flowframes]

AI That Paints Anything Stroke By Stroke

AI That Paints Anything Stroke By Stroke

What Happens When AI Robots Design Themselves

What Happens When AI Robots Design Themselves

Deepfake Movements with 1 image ONLY [Liquid Warping GAN]

Deepfake Movements with 1 image ONLY [Liquid Warping GAN]

ANYTHING can be a "Green Screen" Now [Real-Time High-Resolution Background Matting]

ANYTHING can be a "Green Screen" Now [Real-Time High-Resolution Background Matting]

AI Transform any Image into Sketch or Line Art [ArtLine]

AI Transform any Image into Sketch or Line Art [ArtLine]

AI That Could Soon Replace Vector Artists [DALL-E]

AI That Could Soon Replace Vector Artists [DALL-E]

Photoshop Detector AI Is Useless

Photoshop Detector AI Is Useless

The Future Of Online Shopping

The Future Of Online Shopping

How The Future of Image Search Would Look Like

How The Future of Image Search Would Look Like

Everyone Can Make 3D Animations Easily Now! [Monster Mash]

Everyone Can Make 3D Animations Easily Now! [Monster Mash]

3D Video Stabilization with AI [NSFF]

3D Video Stabilization with AI [NSFF]

OpenAI’s Sarcastic Chat Bot [GPT-3 API Beta]

OpenAI’s Sarcastic Chat Bot [GPT-3 API Beta]

You Describe & AI Photoshops Faces For You [StyleCLIP]

You Describe & AI Photoshops Faces For You [StyleCLIP]

You Only Need Audio To Deepfake Now! Might look slightly cursed tho [PCAVS]

You Only Need Audio To Deepfake Now! Might look slightly cursed tho [PCAVS]

This AI Transfers Anime Back Into Sketch [Anime2Sketch]

This AI Transfers Anime Back Into Sketch [Anime2Sketch]

AI Learns To Play CS:GO By Watching Humans Play!

AI Learns To Play CS:GO By Watching Humans Play!

How AI Fixes The Horrendous CR7 Statue

How AI Fixes The Horrendous CR7 Statue

Best Vocal Isolation & Instrumental Extraction 2021 [lalal.ai vs Spleeter]

Best Vocal Isolation & Instrumental Extraction 2021 [lalal.ai vs Spleeter]

Face Enhance AI Restores Extremely Blurry Faces [GPEN]

Face Enhance AI Restores Extremely Blurry Faces [GPEN]

AI That Only Needs 1 Image To Deepfake [SimSwap]

AI That Only Needs 1 Image To Deepfake [SimSwap]

The Amazing AI Behind the TikTok JoJo Pose Challenge [BoostMonocularDepth + 3DP]

The Amazing AI Behind the TikTok JoJo Pose Challenge [BoostMonocularDepth + 3DP]

StyleGAN3!? - What AI Actually Sees When Generating Faces [Alias-Free GAN]

StyleGAN3!? - What AI Actually Sees When Generating Faces [Alias-Free GAN]

AI generated art goes brrrrr [VQGAN+CLIP]

AI generated art goes brrrrr [VQGAN+CLIP]

AI That Doodles Any Given Description

AI That Doodles Any Given Description

Best AI Motion Capture 2021 - OpenPose vs DeepMotion

Best AI Motion Capture 2021 - OpenPose vs DeepMotion

Anime Image Enhance AI Has Gone To The Next Level [Real-ESRGAN]

Anime Image Enhance AI Has Gone To The Next Level [Real-ESRGAN]

This Video's Voice Is Entirely Made From Audio Deepfake

This Video's Voice Is Entirely Made From Audio Deepfake

I Can’t Sing So I Cloned My Voice w/ AI To Cover Goodbye Sengen (English Cover)

I Can’t Sing So I Cloned My Voice w/ AI To Cover Goodbye Sengen (English Cover)

Best Background Removal - AIs Removes BG Without Green Screen And It's Amazing. [RVM]

Best Background Removal - AIs Removes BG Without Green Screen And It's Amazing. [RVM]

How I Deepfaked VTuber Gawr Gura with AI

How I Deepfaked VTuber Gawr Gura with AI

AI Magic Removal - Removes ANYTHING & Inpaints For You [LaMa]

AI Magic Removal - Removes ANYTHING & Inpaints For You [LaMa]

I Did NOT Expect AI Anime Filter To Be This Good [AnimeGANv2]

I Did NOT Expect AI Anime Filter To Be This Good [AnimeGANv2]

DALL-E is a revolutionary AI model that can generate images from text descriptions, leveraging GPT-3's capabilities, and demonstrating impressive results in style transfer, object recognition, and illustration. However, it also raises concerns about potential misuse and impact on creative industries.

Key Takeaways

Explore DALL-E's capabilities through OpenAI's official demo
Analyze the model's strengths and weaknesses in image generation
Consider the potential impact of DALL-E on creative industries
Investigate the CLIP model and its role in ranking image similarity
Evaluate the potential risks and concerns associated with DALL-E

💡 DALL-E's capabilities in image generation and style transfer have significant potential, but also raise concerns about misuse and impact on creative industries, highlighting the need for responsible AI development and deployment.

🔒 Pro feature: Ask AI to explain this lesson →

More on: LLM Foundations

View skill →

Getting Started with Vertex AI Gemini 1.5 Flash

I TRAINED AN AI TO SOLVE 2+2 (w/ Live Coding)

I TRAINED AN AI TO SOLVE 2+2 (w/ Live Coding)

How to use the ChatGPT API with Python!!

How to use the ChatGPT API with Python!!

Nicholas Renotte

Gemini 2.5: Create an interactive plot of economic data

Gemini 2.5: Create an interactive plot of economic data

Google DeepMind

LangChain Chatbots: Building a Personalized AI Assistant

LangChain Chatbots: Building a Personalized AI Assistant

Analytics Vidhya

Auto-generating meeting notes with Python

Auto-generating meeting notes with Python

Related AI Lessons

[PoV] When Everyone Is Smart, No One Is

In a world where AI makes everyone smart, the value of intelligence decreases, and new challenges arise

The Honeymoon Is Over: AI Music Has Entered Its Institutional Era

AI music has transitioned from proving its functionality to proving its value and deservingness of existence

Critical thinking in the AI Era

Develop critical thinking skills to navigate the AI era effectively and make informed decisions

Medium · Data Science

Anthropic Just Passed OpenAI Among Business Users. Here’s What That Means for Your Stack.

Anthropic surpasses OpenAI in business user adoption, impacting the AI stack for enterprises

Tasty Weird! Book 16 by Anh Do · Audiobook preview

Google Play Books