AI That Could Soon Replace Vector Artists [DALL-E]

bycloud · Advanced ·📰 AI News & Updates ·5y ago

Key Takeaways

DALL-E, a neural network model, can create images by associating concepts within given text descriptions, leveraging the success of GPT-3 language model, and demonstrating capabilities in style transfer, texture, and color generation, as well as object recognition and illustration.

Full Transcript

who knew pikachu can be such a chat playing a grand piano or even riding a motorbike this is not until a couple days ago open ai decided to once again blow our mind away with its newest ai models just five days into 2021. we are off to a great start and in the ai part of course its name is inspired by the artist salvador dali and pixar's wall-e or wally is a trained neural network that can create images by associating concepts within the given text descriptions this has only become possible because of the success of gpt3 language model which can complete any text by literally learning nearly everything the internet has to offer which is absolutely mental so how insane is this dolly ai model well let's look at the results together right away openai did not release their trained model so we would be looking at the official demo instead as of why they did not make the model available i will elaborate on that later on at the very top we can see the text prompt that is used to generate the images below these are already being generated previously by open ai so we are just loading images here but surprisingly there are a ton of demos and you can go play around yourself on their blog but as amazing as it looks it is still cherry picked by another ai that they made called clip which is also released at the same time and it ranks the similarity between the given text and the image so what we are seeing now is already being ranked and the more samples that are being taken the better quality of the images become and it also shows how well dolly and clip work together as a dual especially as a sorting function but generally as we can see dolly is spectacular at generating images that show traits of style transfer and just basically anything that is changing the surface of something this also ranges from art styles textures and colors it is also good at generating a typical object when mentioned in the text prompt things that often appear in the data set show high consistency at generating the overall shape or structure of it like the stop sign however when it is about object symmetry like faces it starts to struggle as faces can be symmetrical and asymmetrical at times to further nick pick the false to make me look smarter you can tell that for some image generation it is bad at generating the exact precise description if the text prompt gives ideas that are abstract or unseen before the ai seems like it would fail to process every single detail and only succeed in just a few ones when the number starts to get too big it struggles at generating the right amount of it or the shape starts to deform and you can clearly tell that it is not a pentagon and if a word could have multiple meanings we can see different objects being brought forward and shown to us interestingly they also provided demos for objects that would contain concepts like reflections to test out how well the ai would understand it and as expected it only occasionally gets it right same goes to a lot of concepts like playing the piano would require you to sit outside not inside pikachu novel view generation is hard to be consistent but also can be proven successful in some cases and you can also tell it to generate illustrations too but it is hard to stay consistent generating symmetrical faces with asymmetrical features there is image completion alongside the text prompt too so generating images based on a specific part of the image can hugely affect the outcome like the mannequin here so they all look really consistent or to use the image to let the ai to refer to and it really is sad that i can't play with the trained model myself it just looks so much fun to test it but if it's actually released there are all kinds of let's say potential issues that could be a little problematic for instance just by looking at the quality of these illustrations i feel like it might be a big threat to a lot of small businesses like icon or cartoon illustrations distributing companies let alone photoshop artists or just artists in general who might lose some of their smaller deals there's just so much money that could be involved in this and could definitely slam pretty hard on the freelancer markets on the bright side this does give other people easier access to other creative aspects such as generating infinite amounts of the same texture just with slight variations maybe reusing assets in games would not be a thing in the future anymore because it is just too convenient to get something slightly different and just training gpt 3 can cost up to 5 million usd according to the internet so no one would probably invest this much money into something that is not entirely perfect yet also there are similar concerns whereby people were able to extract the training data from the gpd2 model there is a paper about this problem which showed how gpt2 was capable of extracting personal information about someone when a correct prompt is given note that dolly has the same concern but it still has the possibility of being able to extract personal ids or credit cards are unintentionally being included in the data set too there's just too many malicious things that you can do with this who knows if this dolly can photoshop or even deep fake just a text prompt away not even mentioning those not safe for work stuff and how well it can possibly do well no more with the speculation on what dolly can do but if it's me i just generate some dumb like a bunch of pout faces with a variation of my mako and they did release the code for clip so if you do want to try it out i'll link its blog and the collab down in the description the least block from openai would also be down there too and this video is sponsored by infinite red infinite consulting handles your mobile web and ai needs if you are looking for someone to bear your app visit with the link down in the description thank you guys for watching hope you enjoyed it openai is just amazing at blowing our mind away and this is just a wild start for the year 2021. shout out to mark schwinn and many other patreons that support my work through patreon and i'll see you guys in the next one

Original Description

DALL-E's paper is not yet released, but we can expect it soon in the future. For now, we can only enjoy the blog they have provided us with. Even though it may be the cherry picked results, but even a few papers earlier, nothing can come close to this level of quality, this is just insane. DALL-E [Blog] https://openai.com/blog/dall-e/ CLIP [Blog] https://openai.com/blog/clip/ [GitHub] https://github.com/openai/CLIP [Paper] https://cdn.openai.com/papers/Learning_Transferable_Visual_Models_From_Natural_Language_Supervision.pdf [Colab] https://colab.research.google.com/github/openai/clip/blob/master/Interacting_with_CLIP.ipynb Today's Sponsor is Infinite Red Infinite Red consulting handles your mobile, web, and AI needs Check it out here: https://bit.ly/2UwddmM This video is supported by the kind Patrons: 🙏Marc Schwyn, Mazen Alotaibi, Sascha Henrichs, Jake Disco, Peter Davidowicz Support me on Patreon if you hope to see more: https://www.patreon.com/bycloud [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Music] Steaminwaffles - The Walk Home [Profile Art] https://twitter.com/bynicalcynical
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from bycloud · bycloud · 34 of 60

1 Can Deepfake work on Anime?
Can Deepfake work on Anime?
bycloud
2 AI that Can Copy Voices
AI that Can Copy Voices
bycloud
3 Live Action Is Terrible So AI Turned It Back Into Anime
Live Action Is Terrible So AI Turned It Back Into Anime
bycloud
4 2 AIs Enhance Anime to 4K 240FPS, but is it good?
2 AIs Enhance Anime to 4K 240FPS, but is it good?
bycloud
5 IRL to Anime With Cartoonization AI
IRL to Anime With Cartoonization AI
bycloud
6 How Does AI Generated Songs Sound Like? [OpenAI Jukebox]
How Does AI Generated Songs Sound Like? [OpenAI Jukebox]
bycloud
7 AI Makes Any Images Cinematic [3D Photo Inpainting]
AI Makes Any Images Cinematic [3D Photo Inpainting]
bycloud
8 AI Generates Anime Faces, And It's Getting Even Better [StyleGAN2]
AI Generates Anime Faces, And It's Getting Even Better [StyleGAN2]
bycloud
9 Tech Behind The Meme: Dame Da Ne AI - Single Image Deepfake
Tech Behind The Meme: Dame Da Ne AI - Single Image Deepfake
bycloud
10 AI Generates New Light Source for Images [PaintingLight]
AI Generates New Light Source for Images [PaintingLight]
bycloud
11 Depixelizing Doom Guy? Mona Lisa in Real Life? The "Upscaling" AI: PULSE
Depixelizing Doom Guy? Mona Lisa in Real Life? The "Upscaling" AI: PULSE
bycloud
12 Image Completion AI - Predict Pixels Just Like Text Predictions [Image-GPT]
Image Completion AI - Predict Pixels Just Like Text Predictions [Image-GPT]
bycloud
13 AI Generates 3D Human Model from 2D Image [PIFuHD - FacebookAI]
AI Generates 3D Human Model from 2D Image [PIFuHD - FacebookAI]
bycloud
14 AI Assisted Masking - Save Your Precious Time Right Now [AE Rotobrush 2]
AI Assisted Masking - Save Your Precious Time Right Now [AE Rotobrush 2]
bycloud
15 This AI Reconstruct Real Life Objects From Just Images [NeRF]
This AI Reconstruct Real Life Objects From Just Images [NeRF]
bycloud
16 Image Restoration AI - Upscale and Restore Faces with DFDNet
Image Restoration AI - Upscale and Restore Faces with DFDNet
bycloud
17 Best Image Colorization AI 2020
Best Image Colorization AI 2020
bycloud
18 Image Decomposition AI - Edit Highlights and Textures Easily [Appearance Eraser]
Image Decomposition AI - Edit Highlights and Textures Easily [Appearance Eraser]
bycloud
19 Deepfake With Audio Only [Wav2Lip]
Deepfake With Audio Only [Wav2Lip]
bycloud
20 Copy IRL, Paste on your PC [AR Cut & Paste]
Copy IRL, Paste on your PC [AR Cut & Paste]
bycloud
21 This AI Transform Faces into Hyper-Realistic Cartoon Characters [Toonify]
This AI Transform Faces into Hyper-Realistic Cartoon Characters [Toonify]
bycloud
22 This AI Restores Old Photos with Damages Automatically!
This AI Restores Old Photos with Damages Automatically!
bycloud
23 Anime Filter with AI - Snapchat vs. TikTok
Anime Filter with AI - Snapchat vs. TikTok
bycloud
24 AI Reduces Bandwidth Problems for Video Calls [NVIDIA Maxine]
AI Reduces Bandwidth Problems for Video Calls [NVIDIA Maxine]
bycloud
25 AI Motion Capture - Track Your Hands & Body WITHOUT Bodysuit [FrankMocap]
AI Motion Capture - Track Your Hands & Body WITHOUT Bodysuit [FrankMocap]
bycloud
26 AI Converts Cartoon Characters To Real Life [Pixel2Style2Pixel]
AI Converts Cartoon Characters To Real Life [Pixel2Style2Pixel]
bycloud
27 AI Sky Replacement with SkyAR
AI Sky Replacement with SkyAR
bycloud
28 Better Than DAIN? NEW BEST Tool for Boosting Video's FPS with AI [RIFE/Flowframes]
Better Than DAIN? NEW BEST Tool for Boosting Video's FPS with AI [RIFE/Flowframes]
bycloud
29 AI That Paints Anything Stroke By Stroke
AI That Paints Anything Stroke By Stroke
bycloud
30 What Happens When AI Robots Design Themselves
What Happens When AI Robots Design Themselves
bycloud
31 Deepfake Movements with 1 image ONLY [Liquid Warping GAN]
Deepfake Movements with 1 image ONLY [Liquid Warping GAN]
bycloud
32 ANYTHING can be a "Green Screen" Now [Real-Time High-Resolution Background Matting]
ANYTHING can be a "Green Screen" Now [Real-Time High-Resolution Background Matting]
bycloud
33 AI Transform any Image into Sketch or Line Art [ArtLine]
AI Transform any Image into Sketch or Line Art [ArtLine]
bycloud
AI That Could Soon Replace Vector Artists [DALL-E]
AI That Could Soon Replace Vector Artists [DALL-E]
bycloud
35 Photoshop Detector AI Is Useless
Photoshop Detector AI Is Useless
bycloud
36 The Future Of Online Shopping
The Future Of Online Shopping
bycloud
37 How The Future of Image Search Would Look Like
How The Future of Image Search Would Look Like
bycloud
38 Everyone Can Make 3D Animations Easily Now! [Monster Mash]
Everyone Can Make 3D Animations Easily Now! [Monster Mash]
bycloud
39 3D Video Stabilization with AI [NSFF]
3D Video Stabilization with AI [NSFF]
bycloud
40 OpenAI’s Sarcastic Chat Bot [GPT-3 API Beta]
OpenAI’s Sarcastic Chat Bot [GPT-3 API Beta]
bycloud
41 You Describe & AI Photoshops Faces For You [StyleCLIP]
You Describe & AI Photoshops Faces For You [StyleCLIP]
bycloud
42 You Only Need Audio To Deepfake Now! Might look slightly cursed tho [PCAVS]
You Only Need Audio To Deepfake Now! Might look slightly cursed tho [PCAVS]
bycloud
43 This AI Transfers Anime Back Into Sketch [Anime2Sketch]
This AI Transfers Anime Back Into Sketch [Anime2Sketch]
bycloud
44 AI Learns To Play CS:GO By Watching Humans Play!
AI Learns To Play CS:GO By Watching Humans Play!
bycloud
45 How AI Fixes The Horrendous CR7 Statue
How AI Fixes The Horrendous CR7 Statue
bycloud
46 Best Vocal Isolation & Instrumental Extraction 2021 [lalal.ai vs Spleeter]
Best Vocal Isolation & Instrumental Extraction 2021 [lalal.ai vs Spleeter]
bycloud
47 Face Enhance AI Restores Extremely Blurry Faces [GPEN]
Face Enhance AI Restores Extremely Blurry Faces [GPEN]
bycloud
48 AI That Only Needs 1 Image To Deepfake [SimSwap]
AI That Only Needs 1 Image To Deepfake [SimSwap]
bycloud
49 The Amazing AI Behind the TikTok JoJo Pose Challenge [BoostMonocularDepth + 3DP]
The Amazing AI Behind the TikTok JoJo Pose Challenge [BoostMonocularDepth + 3DP]
bycloud
50 StyleGAN3!? - What AI Actually Sees When Generating Faces [Alias-Free GAN]
StyleGAN3!? - What AI Actually Sees When Generating Faces [Alias-Free GAN]
bycloud
51 AI generated art goes brrrrr [VQGAN+CLIP]
AI generated art goes brrrrr [VQGAN+CLIP]
bycloud
52 AI That Doodles Any Given Description
AI That Doodles Any Given Description
bycloud
53 Best AI Motion Capture 2021 - OpenPose vs DeepMotion
Best AI Motion Capture 2021 - OpenPose vs DeepMotion
bycloud
54 Anime Image Enhance AI Has Gone To The Next Level [Real-ESRGAN]
Anime Image Enhance AI Has Gone To The Next Level [Real-ESRGAN]
bycloud
55 This Video's Voice Is Entirely Made From Audio Deepfake
This Video's Voice Is Entirely Made From Audio Deepfake
bycloud
56 I Can’t Sing So I Cloned My Voice w/ AI To Cover Goodbye Sengen (English Cover)
I Can’t Sing So I Cloned My Voice w/ AI To Cover Goodbye Sengen (English Cover)
bycloud
57 Best Background Removal - AIs Removes BG Without Green Screen And It's Amazing. [RVM]
Best Background Removal - AIs Removes BG Without Green Screen And It's Amazing. [RVM]
bycloud
58 How I Deepfaked VTuber Gawr Gura with AI
How I Deepfaked VTuber Gawr Gura with AI
bycloud
59 AI Magic Removal - Removes ANYTHING & Inpaints For You [LaMa]
AI Magic Removal - Removes ANYTHING & Inpaints For You [LaMa]
bycloud
60 I Did NOT Expect AI Anime Filter To Be This Good [AnimeGANv2]
I Did NOT Expect AI Anime Filter To Be This Good [AnimeGANv2]
bycloud

DALL-E is a revolutionary AI model that can generate images from text descriptions, leveraging GPT-3's capabilities, and demonstrating impressive results in style transfer, object recognition, and illustration. However, it also raises concerns about potential misuse and impact on creative industries.

Key Takeaways
  1. Explore DALL-E's capabilities through OpenAI's official demo
  2. Analyze the model's strengths and weaknesses in image generation
  3. Consider the potential impact of DALL-E on creative industries
  4. Investigate the CLIP model and its role in ranking image similarity
  5. Evaluate the potential risks and concerns associated with DALL-E
💡 DALL-E's capabilities in image generation and style transfer have significant potential, but also raise concerns about misuse and impact on creative industries, highlighting the need for responsible AI development and deployment.

Related AI Lessons

Up next
Tasty Weird! Book 16 by Anh Do · Audiobook preview
Google Play Books
Watch →