What is Torchtext?
Key Takeaways
Torchtext is a PyTorch library that provides fundamental components for working with text data, including datasets and preprocessing pipelines, to accelerate NLP research and ML development. It offers easy access to commonly used datasets, text processing pipelines, and ARP-related modules, with a focus on transferring research to production and engaging with the community to discover novel technologies.
Full Transcript
Oh everyone welcome to the PI torch stammer hexam my name is George I'm a software engineer at Facebook I work for the text domain in high-touch team today I'm going to talk about poached eggs which you my used for some text problem for your summer hexam project so why not how about touch tanks in addition to pi torch first we want to accelerate NLP research and provide some reusable Domino in a crack building block for the cutting edge research based on our knowledge in the text or main and research community second we want to provide a solution to transfer from research to production so we integrate those pipeline in the building block with a wide range of height wash ability such as particles on transition distributed data panel and mobile technology we won't have a better support of fully researched production transition for a lot of into end of the application certainly we also engage with the community and discover novel technology the track stolen team in tight watch want to develop a good technology understanding in the end of the area and abuse new research collaboration with this goal in mind we provide those easy access to some commonly used data set text processing pipeline and some ARP related module here I gave an example show how we engage closely with any researcher in the open source community since the release of transformer and motivic agent last year when you save a lot of feedback on github especially many researcher would like to have more flexibility with the multi heritage container so this half we develop develop a new module called multi chicken tender we will release it by the end of July here I want to give a few highlights for the new feature in our multi-headed engine first is for the drop-in replacements with this only a few lines user will have the full flexibility to try different custom component with the motivation concept in addition to the drop-in replacement the pneumatic annotation container will support our suite and based on the feedback from user we add incremental decoding in the broadcast support with our container we also put together some example to apply the motivation to dinner with some novel research idea so please give us P beta tried once we released in in July at the same time we would like to store easy transfer to the production here I gave an overview for the end-to-end pipeline with hydrogen in Tashkent so the roll text we are really innocent to a field transform like that to the miser and the McAlary currently we are working on rewrite source of data processing transport as a few rows or no building block with G support after this pre-processing the data are sent to data loader in this Emperor where we generates the data back in after the stamp you data already for the model we can also rewrite a few existing players in Taj tents and will release them in point 7 the new dataset show here are fully compatible with data loader in tight watch user will also have the flexibility to build a data processing type 1 based on our standard 2 neither McHenry block so here is a list of the new data sets once it is released please give up kids about rights and it gave us feedback here going to show you a case how to load this data set with a single line and all the defaults they have processing pipeline with another line you will get the material so yeah it's very simple to have those big assets our website we have stereo text related tutorial including the one to show how to use the new data set to text classification nurses we also put together an example in point 7 release and show how to build a pipeline to train the per bottle from scratch so you please yeah so if you have any question I'm trying to talk also feel free to reach out to us on github for the firm hand sound there are many other aoki library like pharisee hacking phase transformer so if you plan to work on some alt problem very likely you don't need to build or staff at scratch thank you so much and enjoy the exome
Original Description
Torchtext is a domain library for PyTorch that provides the fundamental components for working with text data, such as commonly used datasets and basic preprocessing pipelines, designed to accelerate natural language processing (NLP) research and machine learning (ML) development. George Zhang, a PyTorch Software Engineer, provides an overview of Torchtext and walks through the latest updates.
Haven't signed up yet? Get involved, and learn how you could build with the community and also have a chance to win up to $25,000: https://bit.ly/2ZwLYKX
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from PyTorch · PyTorch · 54 of 60
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
▶
55
56
57
58
59
60
What is PyTorch?
PyTorch
PyTorch Tutorial: A Quick Preview
PyTorch
PyTorch Summer Hackathon 2019
PyTorch
Tips and Tricks on Hacking with PyTorch: A Quick Tutorial by Brad Heintz
PyTorch
PyTorch 1.2 and PyTorch Hub: A Quick Introduction by Soumith Chintala and Ailing Zhang
PyTorch
Torchtext 0.4 with Supervised Learning Datasets: A Quick Introduction by George Zhang
PyTorch
Torchaudio 0.3 with Kaldi Compatibility, New Transforms: A Quick Introduction by Jason Lian
PyTorch
Torchvision 0.4 with Support for Video: A Quick Introduction by Francisco Massa
PyTorch
Introduction to Machine Learning for Developers at F8 2019
PyTorch
Powered by PyTorch at F8 2019
PyTorch
Developing and Scaling AI Experiences at Facebook with PyTorch at F8 2019
PyTorch
New Approaches to Image and Video Reconstruction Using Deep Learning at Facebook at F8 2019
PyTorch
PyTorch Developer Conference 2018: Recap
PyTorch
PyTorch Developer Conference 2018: Keynote & Deep Dive
PyTorch
PyTorch Developer Conference 2018: Production & Research Sessions
PyTorch
PyTorch Developer Conference 2018: Cloud & Academia Sessions
PyTorch
PyTorch Developer Conference 2018: Enterprise, Education, & Future of AI Panel
PyTorch
PyTorch Developer Conference 2019 | Full Livestream
PyTorch
PyTorch Developer Conference 2019: Recap
PyTorch
PyTorch Developer Conference Keynote - Mike Schroepfer
PyTorch
What’s new in PyTorch 1.3 - Lin Qiao
PyTorch
PyTorch Front-End Features: Named Tensors and Type Promotion - Gregory Chanan
PyTorch
Research to Production: PyTorch JIT/TorchScript Updates - Michael Suo
PyTorch
Quantization - Dmytro Dzhulgakov
PyTorch
PyTorch ONNX Export Support - Lara Haidar, Microsoft
PyTorch
Apex - Michael Carilli, NVIDIA
PyTorch
Dataloader Design for PyTorch - Tongzhou Wang, MIT
PyTorch
Linear Algebra in PyTorch - Vishwak Srinivasan, CMU
PyTorch
PyTorch Mobile - David Reiss
PyTorch
Model Interpretability with Captum - Narine Kokhilkyan
PyTorch
Detectron2 - Next Gen Object Detection Library - Yuxin Wu
PyTorch
Speech Extensions to Fairseq - Dmytro Okhonko
PyTorch
PyTorch on Google Cloud TPUs - Google, Salesforce, Facebook
PyTorch
PyTorch Summer Hackathon Winners - Joe Spisak, Sebastien Arnold, Tristan Deleu
PyTorch
PyTorch in Robotics - Yisong Yue, Caltech
PyTorch
StanfordNLP - Yuhao Zhang, Stanford
PyTorch
Sotabench for Reproducible Research - Robert Stojnic, Papers with Code
PyTorch
Collaborative Natural Language Inference - Sasha Rush, Cornell
PyTorch
Privacy Preserving AI - Andrew Trask, OpenMined
PyTorch
CrypTen - Laurens van der Maaten
PyTorch
PyTorch at Uber - Sidney Zhang, Uber
PyTorch
PyTorch at Tesla - Andrej Karpathy, Tesla
PyTorch
PyTorch at Microsoft - Saurabh Tiwary, Microsoft
PyTorch
PyTorch at Dolby Labs - Vivek Kumar, Dolby Labs
PyTorch
PyTorch Developer Conference 2019 - Panel Discussion
PyTorch
Using deep learning and PyTorch to power next gen aircraft at Caltech
PyTorch
Named Tensors, Model Quantization, and the Latest PyTorch Features - Part 1
PyTorch
TorchScript and PyTorch JIT | Deep Dive
PyTorch
Announcing the PyTorch Global Summer Hackathon 2020
PyTorch
Opening Up the Black Box: Model Understanding with Captum and PyTorch
PyTorch
PyTorch Mobile Runtime for Android
PyTorch
Torchvision in 5 minutes
PyTorch
3D Deep Learning with PyTorch3D
PyTorch
What is Torchtext?
PyTorch
TorchAudio: A Quick Intro
PyTorch
PyTorch Mobile Runtime for iOS
PyTorch
PySlowFast: Deep learning with Video
PyTorch
PyTorch Pruning | How it's Made by Michela Paganini
PyTorch
Measuring Fairness in Machine Learning Systems
PyTorch
PyTorch for Hackathons
PyTorch
More on: ML Pipelines
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
10 Python Concepts You Must Know Before Calling Yourself Advanced
Medium · AI
10 Python Concepts You Must Know Before Calling Yourself Advanced
Medium · Data Science
10 Python Concepts You Must Know Before Calling Yourself Advanced
Medium · Programming
10 Python Concepts You Must Know Before Calling Yourself Advanced
Medium · Python
🎓
Tutor Explanation
DeepCamp AI