AI Transforms Google Photos into Real-Life Scenes
Key Takeaways
Researchers at Cornell University introduced a new method to reconstruct photorealistic scenes from public photos on the internet using a novel multiplane image representation called Deep MPI, allowing for real-time synthesis of novel views with continuous lighting and spatial consistency.
Full Transcript
using tourist public photos from the internet they were able to reconstruct multiple viewpoints of a scene conserving the realistic shadows and lightings this is a huge advancement of the state-of-the-art techniques for photorealistic scene rendering and their results are simply amazing let's see how they achieve that and some more examples [Music] this is what's ai and i share artificial intelligence news every week if you are new to the channel and want to stay up to date please consider subscribing to not miss any further news researchers at cornell university introduced a new way to use online public photos taken by tourists to construct a continuous set of light fields and synthesize novel views capturing all times of day scene appearance the complexity behind this task is that all the pictures are taken at different times of the day different seasons and different orientations in order to answer this problem they introduced deep mpi which is a new multiplane image representation that does exactly what they needed their method is completely unsupervised needing zero information other than the photo itself from the internet and allows real-time synthesis of photorealistic views that are continuous in both space and lighting you can see how much better the results are compared with the previous state-of-the-art models now that we've covered what they've done and why it's so impressive let's see how they have achieved that and some more results in short they synthesize arbitrary views of a scene with continuous viewing condition such as lighting by using pictures from the internet of multiple lighting and angle sources it takes unstructured internet pictures of a specific place and learns how to reconstruct a representation of the light field that respects the real world shadow physics as you just saw previous works like fields are inconsistent through the scene which is the greatest contribution of the paper this is done with a two-stage models architecture at first they use their new deep mpi representation they start by reprojecting every image to the reference viewpoint and averaging all these reprojected images at each depth plane thus creating a mean rgb plane sweep volume psv which is a set of views wrapped with disparities in a given range since this mean rgb psv cannot accurately model a scene content that is obstructed in a reference view they introduce the second phase of their network the second part optimizes the latin features in their deep mpi representation using an encoder and a rendering network it is able to capture and re-render time-varying appearance the encoder's role is to produce an appearance vector from an exemplar image and an auxiliary deep buffer containing semantic and depth information of the scene the deep buffer allows the encoder to learn complex appearance by aligning the illumination information in the exemplar image using the scene intrinsic properties encoded in the deep mpi representation without this alignment the results will be as inconsistent as the previous work we've seen this align deep buffer is the main reason for the realistic shadows and lightings in the rendered scenes then the rendering network represented by g in this model's architecture takes both the deep mpi projected to a specific target viewpoint and its appearance vector produced from the encoder and predicts the corresponding rgb color layers this rendering network is a variant of a u-net architecture with an encoder decoder architecture called aiden used for style transfer applications this model produces natural scene appearance while stabilizing the training preserving the color and style of the exemplar images i linked the aiden's architecture paper in the description for more information in short given a specific exemplar photo they were able to synthesize novel views close to the reference point while preserving the exemplars appearance it is mind-blowingly accurate just take a minute to see these results with multiple lightings the link of the project website is in the description with the code and data set coming soon as per the authors do of course this was just a simple overview of this new paper i strongly recommend to read the paper linked in the description for more information please leave a like if you went this far in the video and since there are over 90 percent of you guys watching that are not subscribed yet consider subscribing to the channel to not miss any further news clearly explained if you would like to start or improve with machine learning i've linked all the best online courses in a reporter in the description thank you for watching [Music] you
Original Description
Read the article: https://medium.com/towards-artificial-intelligence/reconstruct-photorealistic-scenes-from-tourists-public-photos-on-the-internet-bb9ad39c96f3
This week my interest was directed towards a new paper where they are using tourists' public photos from the internet, they were able to reconstruct multiple viewpoints of a scene conserving the realistic shadows and lightings! Ask any questions or remarks you have in the comments, I will gladly answer everything!
Subscribe to not miss any AI news and terms clearly vulgarized! Share this to someone who needs to learn more about Artificial Intelligence! Spread knowledge, not germs!
Project page (paper & code coming soon): https://research.cs.cornell.edu/crowdplenoptic/
AdaIN architecture: https://arxiv.org/abs/1703.06868
Follow me for more AI content:
Instagram: https://www.instagram.com/whats_ai/
LinkedIn: https://www.linkedin.com/in/whats-ai/
Twitter: https://twitter.com/Whats_AI
Facebook: https://www.facebook.com/whats.artificial.intelligence/
Medium: https://medium.com/@whats_ai
The best courses to start and progress in AI:
https://www.omologapps.com/whats-ai
Join Our Discord channel, Learn AI Together:
https://discord.gg/SVse4Sr
Support me on patreon:
https://www.patreon.com/whatsai
Chapters:
0:00 Hey! Tap the Thumbs Up button and Subscribe to help me. You'll learn a lot of cool stuff, I promise.
0:38 Paper explanation
4:26 Examples
6:18 Conclusion
Song credit: https://soundcloud.com/mattis-rodrigue/sans-titre
#deeplearning #artificialintelligence #machinelearning
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from What's AI by Louis-François Bouchard · What's AI by Louis-François Bouchard · 55 of 60
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
▶
56
57
58
59
60
What is Artificial intelligence? | Artificial Intelligence terms explained for everyone 1
What's AI by Louis-François Bouchard
What is Machine Learning? | Introduction to ML for beginners in a minute 2
What's AI by Louis-François Bouchard
What is Deep Learning | Introduction to DL for beginners in a minute 3
What's AI by Louis-François Bouchard
What is Supervised Learning | Machine Learning basics explained for beginners 4
What's AI by Louis-François Bouchard
What is Unsupervised Learning | Machine Learning basics explained for beginners 5
What's AI by Louis-François Bouchard
What is Semi-Supervised Learning | Machine Learning basics explained for beginners 6
What's AI by Louis-François Bouchard
What is Reinforcement Learning | Machine Learning basics explained for beginners 7
What's AI by Louis-François Bouchard
What is Classification | Introduction to Machine Learning for beginners | The Most Used Terms 8
What's AI by Louis-François Bouchard
What is Regression | Introduction to Machine Learning for beginners | The Most Used Terms 9
What's AI by Louis-François Bouchard
What is Clustering | Introduction to Machine Learning for beginners | The Most Used Terms 10
What's AI by Louis-François Bouchard
What is Backpropagation | Artificial Intelligence & Machine Learning Basics for Beginners 11
What's AI by Louis-François Bouchard
What is NLP ? | Introduction to Natural Language Processing for Beginners | Machine Learning 12
What's AI by Louis-François Bouchard
Comparing AGI and Traditional AI: Now and Beyond
What's AI by Louis-François Bouchard
Demystifying Neural Network: A Beginner's Guide to Machine Learning Fundamentals
What's AI by Louis-François Bouchard
Understanding Computer Vision: An Entry-Level Introduction to ML-Driven CV
What's AI by Louis-François Bouchard
Chatbots for Beginners: A Comprehensive Intro to Machine Learning Applications
What's AI by Louis-François Bouchard
What is Image Segmentation ? | Computer Vision & ML Techniques Explained for Beginners 17
What's AI by Louis-François Bouchard
Object Detection Clearly Explained for Everyone
What's AI by Louis-François Bouchard
What is a RNN ? | Introduction to Recurrent Neural Network FOR EVERYONE 19
What's AI by Louis-François Bouchard
What is Transfer Learning ? | Deep Learning Basics Explained for Beginners 20
What's AI by Louis-François Bouchard
Data Science Demystified - An Essential Introduction
What's AI by Louis-François Bouchard
Demystifying Data Mining - A Clear and Concise Explanation
What's AI by Louis-François Bouchard
Decoding Logistic Regression - A Simple and Comprehensive Explanation
What's AI by Louis-François Bouchard
What is the YOLO algorithm? | Introduction to You Only Look Once, Real Time Object Detection 24
What's AI by Louis-François Bouchard
AI or Human? What is the Turing Test
What's AI by Louis-François Bouchard
Genetic Algorithms Demystified - How Algorithms Evolve
What's AI by Louis-François Bouchard
What is Data Labeling ? | Prepare Your Data for ML and AI | Attaching meaning to digital data 27
What's AI by Louis-François Bouchard
Human Pose Estimation in Machine Learning Explained (2D & 3D)
What's AI by Louis-François Bouchard
What is Self-Supervised Learning ? | Will machines be able to learn like humans ? 29
What's AI by Louis-François Bouchard
What are GANs ? | Introduction to Generative Adversarial Networks | Face Generation & Editing - 30
What's AI by Louis-François Bouchard
Introduction to Energy-Based Learning | Yann LeCun Paper
What's AI by Louis-François Bouchard
The Science Behind Google Translate: Understanding Transformers
What's AI by Louis-François Bouchard
Mastering CNNs in 5 Minutes | ConvNets Explained
What's AI by Louis-François Bouchard
Discover the Power of YOLOv4 - Real-Time Object Detection Simplified
What's AI by Louis-François Bouchard
Learn to Draw Real People using AI: Unveiling Future of Image-to-Image Translation
What's AI by Louis-François Bouchard
AI Powers PAC-MAN - The Game Engine-Free Revolution
What's AI by Louis-François Bouchard
This AI makes blurry faces look 60 times sharper! Introduction to PULSE: photo upsampling
What's AI by Louis-François Bouchard
Facebook's TransCoder: Converting Programming Languages with AI
What's AI by Louis-François Bouchard
Transforming Images to 3D Models with AI - Discover PIFuHD
What's AI by Louis-François Bouchard
Optimize Your ML Models - Avoid Underfitting and Overfitting
What's AI by Louis-François Bouchard
Behind the Scenes - Disney's Secrets to High-Res Face Swaps
What's AI by Louis-François Bouchard
Linear Regression in Machine Learning Explained in 5 Minutes
What's AI by Louis-François Bouchard
Style Transfer Better Than GANs! Swapping Autoencoder Explained
What's AI by Louis-François Bouchard
Use AI to Remove Objects from Videos
What's AI by Louis-François Bouchard
OpenAI's Language Generator: GPT | The first AI Generating Text, Code, Websites...
What's AI by Louis-François Bouchard
Autocomplete Images With AI: image-GPT explained
What's AI by Louis-François Bouchard
Turning Reality into Art - AI That Cartoonizes Your Pictures and Videos
What's AI by Louis-François Bouchard
From Portrait to Cartoon - Discover the Power of FreezeG
What's AI by Louis-François Bouchard
Transfer clothes between photos using AI. From a single image!
What's AI by Louis-François Bouchard
Precise 3D Human Pose and Mesh Estimation from a Single RGB Image
What's AI by Louis-François Bouchard
Smart Navigation - How AI Robots Understand and Explore Environments
What's AI by Louis-François Bouchard
Techfitlab Breaks Down Tesla Autopilot, AI, ML, and DL Complexities
What's AI by Louis-François Bouchard
ECCV 2020 Best Paper Award | RAFT: A New Deep Network Architecture For Optical Flow | WITH CODE
What's AI by Louis-François Bouchard
Maximize Business Efficiency with AI / GPT Technology!
What's AI by Louis-François Bouchard
AI Transforms Google Photos into Real-Life Scenes
What's AI by Louis-François Bouchard
Old Photo Restoration Using Deep Learning | 2020 Novel Approach Explained & Results
What's AI by Louis-François Bouchard
This computer vision algorithm removes the water from underwater images !
What's AI by Louis-François Bouchard
DeepFakes in 5 minutes | Understand how deepfakes work and create your own!
What's AI by Louis-François Bouchard
A new brain-inspired intelligent system can drive a car using only 19 control neurons!
What's AI by Louis-François Bouchard
Toonify: Turn Real Faces into Animated Disney Characters
What's AI by Louis-François Bouchard
More on: Reading ML Papers
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
I Spent Weeks Looking for a Research Gap Before I Realized I Was Searching the Wrong Way
Medium · AI
ICMI 2026 Reviews [D]
Reddit r/MachineLearning
Workshop submission for main conference paper under review [D]
Reddit r/MachineLearning
Kept context-switching between arxiv, OpenReview, GitHub, and HuggingFace for every paper, so I built this. Chrome extension + website with everything inline, plus citation graph + SPECTER2 neighbors. 3M papers, free, feedback welcome [P]
Reddit r/MachineLearning
Chapters (4)
Hey! Tap the Thumbs Up button and Subscribe to help me. You'll learn a lot of co
0:38
Paper explanation
4:26
Examples
6:18
Conclusion
🎓
Tutor Explanation
DeepCamp AI