PyTorch Front-End Features: Named Tensors and Type Promotion - Gregory Chanan

PyTorch · Beginner ·🧬 Deep Learning ·6y ago

Skills: ML Maths Basics70%

Key Takeaways

The video discusses PyTorch's new front-end features, specifically named tensors and type promotion, which aim to simplify the user experience and improve productivity in deep learning tasks. Named tensors allow users to name dimensions, making code more readable and maintainable, while type promotion enables automatic handling of data type mismatches in tensor operations.

Full Transcript

[Music] hi everyone my name is Greg Chanin and I am one of the tech leads on PI torch and I'm going to be talking about the new front-end features available in pi torch 1.3 or how name tensors and tie promotion can simplify your PI torch life and the PI torch front-end were really guided by having the best user experience by focusing on expressivity and productivity you see this from the beginning of high torch where we focused on building a framework around writing programs not manually building graphs and if we apply this thinking to pi torch today to think you know how can we make pi torch even more expressive and make using pi torch even more productive where can we make improvements I think one area is around retaining semantic information so today users start with images with text with video but as soon as you start doing PI torch operations we kind of force you to throw away that information and use an abstract mathematical object called the tensor so the idea behind the in tensor is super simple it's to name dimensions so if we take our image we turn it into the typical 3d tensor format instead of erasing what the dimensions mean we're gonna name them explicitly so height width channel and HW see in this example this idea was proposed in a blog called tensor considered harmful by Professor Sascha rush who's now at Cornell Tech and we worked closely with professor rush on developing this named tensor feature okay so how does this look like what does this look like in code today you'll often see tensor dimensions being named by comments so here we set up a tensor and then we name it n CHW and then when we access the dimensions we access them by positions so here we are summing over the channels and I know that because I can you know refer back to the comment and this is trivial in a tusla to line slide example but in a real program with complex shape manipulation operation this can be very onerous to track so at its core name tensors is a simple extension to the to the tensor API which is just passing names explicitly so here we pass the names of the dimensions as n CHW and then I can access dimensions by name to the by position so here we're summing over the channels just like we did on the left hand side and this is more readable because it's closer to our intent right I want to sum over the channels and I don't want to sum over the first dimension and it's also more maintainable because if my shape manipulation code changes the names are automatically propagated and so if my channels end up in a different position this code is resilient to those changes okay so that's a very high-level view of the API let's I'm just going to go through a quick case study on image normalization here's a function from torch vision which is normalizing the channels over a batch of images and if you look at the implementation in torch vision you'll see something like this which is using none indexing and if you're familiar with not indexing essentially it's shifting the dimension positions by one this is pretty hard to read it's pretty unintuitive because like you can't reason from first principles what indexing a dimension by none means and it also has this other issue where there are many different formats and if you if you manipulate dimensions by position and you have different formats like NC h WN h WC etc you have to have multiple normalization functions and then you also have to be very careful because we type a centrally type a trace with the dimensions mean not to call the wrong normalization function for your format so how can we improve this situation we just use this nifty new function called align as so same set up but now we're going to say that all the tensors are named and we're going to again subtract the mean divide off the standard deviation but instead of using none indexing we're gonna call aligned as so essentially what this does is shifts over the channel dimension of the mean to match wherever the channels are in the batch of images so this is more readable than the non indexing and it also works for all the different formats so no matter where the channels dimensions are positionally in the image the channels will be just be shifted by name and so I only need one normalization function and I no longer need to worry about calling the right normalization function for my format okay that's a super brief introduction to name tensors it's considered experimental in version 1.3 but we would love if you tried it out and gave us feedback the core functionality is in an eager mode this basically means that the top-level Torche operations are supported have name propagation rules you can also mix named and unnamed tensors this is a useful property so that you can add name tensors incrementally to your program you don't have to do it all in one go but this is interesting to you there's a more in-depth tutorial online about supporting name tensors in multi-headed attention that just goes into a lot more detail in the future we'll be expanding coverage to more apply torch so this means most of the NN package will be supported we will propagate Auto grad names today you can you can run Auto grad on name tensors but the gradients that come out are unnamed and we'll do similar things for serialization multi-processing distributed and JIT okay that was named tensors I'm going to talk briefly about type promotion which is just a nice quality of life improvement that we've added in version 1.3 around MIT's d-type operations so you may have seen or written PI torch code like this in the past this is just adding a Python number to a tensor and this just works even though the number is an integer and the tensor is floating point where pi charges handles this automatically for you but in previous versions of high torch if we tried to generalize this to tensors and replace the integer number with a integer tensor you would get an error that complained about D type mismatch so essentially we've just generalized this this feature to tensors now this will just work in 1.3 and it also works for all D types so that means that you know I showed an example of floating point tensors and integer tensors but you can mix float32 and float64 tensors and essentially the type promotion system will just pick the minimal D type that retains the fidelity of the data so type promotion is available in 1.3 arithmetic operations comparison operations and a number of other operations are supported there's full documentation on the website about all the rules if for you numpy fans out there the rules are very similar to numpy we just made some slight tweaks to support our use case and in the future we'll be expanding coverage to the long tail of operators that was a brief intro to name tensors and type promotion we'd love for you to try them give us feedback so we can tailor it for your use case and our hope here is really that incorporating these features into your program will make hi towards programs more readable more maintainable less air prone and ultimately that that makes writing PI torch even more enjoyable than it is today thank you [Applause] [Music]

Original Description

Cornell University’s Sasha Rush has argued that, despite its ubiquity in deep learning, the traditional implementation of tensors has significant shortcomings, such as exposing private dimensions, broadcasting based on absolute position, and keeping type information in documentation. He proposed named tensors as an alternative approach. PyTorch now supports the ability to name tensors, allowing for clearer code with less need for inline comments.

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from PyTorch · PyTorch · 22 of 60

← Previous Next →

What is PyTorch?

What is PyTorch?

PyTorch Tutorial: A Quick Preview

PyTorch Tutorial: A Quick Preview

PyTorch Summer Hackathon 2019

PyTorch Summer Hackathon 2019

Tips and Tricks on Hacking with PyTorch: A Quick Tutorial by Brad Heintz

Tips and Tricks on Hacking with PyTorch: A Quick Tutorial by Brad Heintz

PyTorch 1.2 and PyTorch Hub: A Quick Introduction by Soumith Chintala and Ailing Zhang

PyTorch 1.2 and PyTorch Hub: A Quick Introduction by Soumith Chintala and Ailing Zhang

Torchtext 0.4 with Supervised Learning Datasets: A Quick Introduction by George Zhang

Torchtext 0.4 with Supervised Learning Datasets: A Quick Introduction by George Zhang

Torchaudio 0.3 with Kaldi Compatibility, New Transforms: A Quick Introduction by Jason Lian

Torchaudio 0.3 with Kaldi Compatibility, New Transforms: A Quick Introduction by Jason Lian

Torchvision 0.4 with Support for Video: A Quick Introduction by Francisco Massa

Torchvision 0.4 with Support for Video: A Quick Introduction by Francisco Massa

Introduction to Machine Learning for Developers at F8 2019

Introduction to Machine Learning for Developers at F8 2019

Powered by PyTorch at F8 2019

Powered by PyTorch at F8 2019

Developing and Scaling AI Experiences at Facebook with PyTorch at F8 2019

Developing and Scaling AI Experiences at Facebook with PyTorch at F8 2019

New Approaches to Image and Video Reconstruction Using Deep Learning at Facebook at F8 2019

New Approaches to Image and Video Reconstruction Using Deep Learning at Facebook at F8 2019

PyTorch Developer Conference 2018: Recap

PyTorch Developer Conference 2018: Recap

PyTorch Developer Conference 2018: Keynote & Deep Dive

PyTorch Developer Conference 2018: Keynote & Deep Dive

PyTorch Developer Conference 2018: Production & Research Sessions

PyTorch Developer Conference 2018: Production & Research Sessions

PyTorch Developer Conference 2018: Cloud & Academia Sessions

PyTorch Developer Conference 2018: Cloud & Academia Sessions

PyTorch Developer Conference 2018: Enterprise, Education, & Future of AI Panel

PyTorch Developer Conference 2018: Enterprise, Education, & Future of AI Panel

PyTorch Developer Conference 2019 | Full Livestream

PyTorch Developer Conference 2019 | Full Livestream

PyTorch Developer Conference 2019: Recap

PyTorch Developer Conference 2019: Recap

PyTorch Developer Conference Keynote - Mike Schroepfer

PyTorch Developer Conference Keynote - Mike Schroepfer

What’s new in PyTorch 1.3 - Lin Qiao

What’s new in PyTorch 1.3 - Lin Qiao

PyTorch Front-End Features: Named Tensors and Type Promotion - Gregory Chanan

PyTorch Front-End Features: Named Tensors and Type Promotion - Gregory Chanan

Research to Production: PyTorch JIT/TorchScript Updates - Michael Suo

Research to Production: PyTorch JIT/TorchScript Updates - Michael Suo

Quantization - Dmytro Dzhulgakov

Quantization - Dmytro Dzhulgakov

PyTorch ONNX Export Support - Lara Haidar, Microsoft

PyTorch ONNX Export Support - Lara Haidar, Microsoft

Apex - Michael Carilli, NVIDIA

Apex - Michael Carilli, NVIDIA

Dataloader Design for PyTorch - Tongzhou Wang, MIT

Dataloader Design for PyTorch - Tongzhou Wang, MIT

Linear Algebra in PyTorch - Vishwak Srinivasan, CMU

Linear Algebra in PyTorch - Vishwak Srinivasan, CMU

PyTorch Mobile - David Reiss

PyTorch Mobile - David Reiss

Model Interpretability with Captum - Narine Kokhilkyan

Model Interpretability with Captum - Narine Kokhilkyan

Detectron2 - Next Gen Object Detection Library - Yuxin Wu

Detectron2 - Next Gen Object Detection Library - Yuxin Wu

Speech Extensions to Fairseq - Dmytro Okhonko

Speech Extensions to Fairseq - Dmytro Okhonko

PyTorch on Google Cloud TPUs - Google, Salesforce, Facebook

PyTorch on Google Cloud TPUs - Google, Salesforce, Facebook

PyTorch Summer Hackathon Winners - Joe Spisak, Sebastien Arnold, Tristan Deleu

PyTorch Summer Hackathon Winners - Joe Spisak, Sebastien Arnold, Tristan Deleu

PyTorch in Robotics - Yisong Yue, Caltech

PyTorch in Robotics - Yisong Yue, Caltech

StanfordNLP - Yuhao Zhang, Stanford

StanfordNLP - Yuhao Zhang, Stanford

Sotabench for Reproducible Research - Robert Stojnic, Papers with Code

Sotabench for Reproducible Research - Robert Stojnic, Papers with Code

Collaborative Natural Language Inference - Sasha Rush, Cornell

Collaborative Natural Language Inference - Sasha Rush, Cornell

Privacy Preserving AI - Andrew Trask, OpenMined

Privacy Preserving AI - Andrew Trask, OpenMined

CrypTen - Laurens van der Maaten

CrypTen - Laurens van der Maaten

PyTorch at Uber - Sidney Zhang, Uber

PyTorch at Uber - Sidney Zhang, Uber

PyTorch at Tesla - Andrej Karpathy, Tesla

PyTorch at Tesla - Andrej Karpathy, Tesla

PyTorch at Microsoft - Saurabh Tiwary, Microsoft

PyTorch at Microsoft - Saurabh Tiwary, Microsoft

PyTorch at Dolby Labs - Vivek Kumar, Dolby Labs

PyTorch at Dolby Labs - Vivek Kumar, Dolby Labs

PyTorch Developer Conference 2019 - Panel Discussion

PyTorch Developer Conference 2019 - Panel Discussion

Using deep learning and PyTorch to power next gen aircraft at Caltech

Using deep learning and PyTorch to power next gen aircraft at Caltech

Named Tensors, Model Quantization, and the Latest PyTorch Features - Part 1

Named Tensors, Model Quantization, and the Latest PyTorch Features - Part 1

TorchScript and PyTorch JIT | Deep Dive

TorchScript and PyTorch JIT | Deep Dive

Announcing the PyTorch Global Summer Hackathon 2020

Announcing the PyTorch Global Summer Hackathon 2020

Opening Up the Black Box: Model Understanding with Captum and PyTorch

Opening Up the Black Box: Model Understanding with Captum and PyTorch

PyTorch Mobile Runtime for Android

PyTorch Mobile Runtime for Android

Torchvision in 5 minutes

Torchvision in 5 minutes

3D Deep Learning with PyTorch3D

3D Deep Learning with PyTorch3D

What is Torchtext?

What is Torchtext?

TorchAudio: A Quick Intro

TorchAudio: A Quick Intro

PyTorch Mobile Runtime for iOS

PyTorch Mobile Runtime for iOS

PySlowFast: Deep learning with Video

PySlowFast: Deep learning with Video

PyTorch Pruning | How it's Made by Michela Paganini

PyTorch Pruning | How it's Made by Michela Paganini

Measuring Fairness in Machine Learning Systems

Measuring Fairness in Machine Learning Systems

PyTorch for Hackathons

PyTorch for Hackathons

The video introduces named tensors and type promotion in PyTorch, which can simplify deep learning tasks and improve productivity. Named tensors allow users to name dimensions, making code more readable and maintainable, while type promotion enables automatic handling of data type mismatches in tensor operations.

Key Takeaways

Understand the concept of named tensors and their benefits
Apply named tensors to simplify code
Use type promotion to handle data type mismatches in tensor operations
Experiment with named tensors and type promotion in PyTorch

💡 Named tensors and type promotion can significantly improve the readability and maintainability of deep learning code, reducing errors and improving productivity.

🔒 Pro feature: Ask AI to explain this lesson →

More on: ML Maths Basics

View skill →

Important Steps I Have Followed To Improve My Data Science Skills- Sharing My Experience

Important Steps I Have Followed To Improve My Data Science Skills- Sharing My Experience

Learn Python FAST for Beginners 🚀#coding #conditionals #loops #functions

Learn Python FAST for Beginners 🚀#coding #conditionals #loops #functions

ChethanAIChronicles

“Hello, world” from scratch on a 6502 — Part 1

“Hello, world” from scratch on a 6502 — Part 1

PCA (Principal Component Analysis) in Python - Machine Learning From Scratch 11 - Python Tutorial

PCA (Principal Component Analysis) in Python - Machine Learning From Scratch 11 - Python Tutorial

ROC and AUC in R

ROC and AUC in R

StatQuest with Josh Starmer

Data Science Fundamentals: Data Cleaning in Python

Data Science Fundamentals: Data Cleaning in Python

Related AI Lessons

Want to get started with deep learning

Get started with deep learning by leveraging resources like Andrew Karpathy's playlist and frameworks such as TensorFlow or PyTorch

Reddit r/deeplearning

Building a Deepfake Detector From Scratch — What Nobody Tells You

Learn to build a deepfake detector from scratch and understand the challenges involved in detecting AI-generated fake media

Medium · Deep Learning

Unfolding the Meandering Path: High-Dimensional Invariance and the Flat 2D Plane of Neural…

Learn about high-dimensional invariance and its relation to the flat 2D plane of neural networks, and how to apply these concepts to improve model performance

Medium · Deep Learning

Implementing Neural Style Transfer from Scratch: The Project That Started It All

Learn to implement Neural Style Transfer from scratch and understand its significance in deep learning

Medium · Deep Learning

Image Classification with ml5.js

The Coding Train