Using Semantic Trees In Place of Sentences | Munashe Shumba | OpenAI Scholars Demo Day 2018

OpenAI · Intermediate ·📰 AI News & Updates ·7y ago

Key Takeaways

Munashe Shumba discusses using semantic trees in place of sentences for natural language processing, demonstrating the effectiveness of this approach on a semantic relatedness task using dependency trees and LSTMs.

Full Transcript

hi everyone thank you for taking time to come and see what we've been working on last a couple of months my name is Manoj Jaya and my project was on investigating the effectiveness of using somatic trees in place of regular sentences when it comes to natural language processing our problems so to get started let's take a look at some quiz questions okay I have pairs of sentences that either mean the same thing or doughnut hole or somewhere in the middle I want you to take a look at them think how closely related in their meaning they are and come up with a score from 1 to 5 if that's too hard for you you could come up with a score from low medium to high alright I'm gonna give you couple seconds to think all right now I'm gonna tell you this course the first two sentences they are pretty closely related in the meaning talking about dogs fighting wrestling sort of the same thing the second one not really related except that it's somebody doing something the last one somewhere in the middle in one sentence there are kids with a football in the other there kids with that if the ball so like I say my goal is to investigate the effectiveness of using semantic trees in natural language processing instead of using a regular sentences alright so for the tests that I just gave you the quiz if one is to build a model for it you're probably lsdm s because we're talking about sentences it's sequences and it looks something like this data would flow through the Ella stem cells from the first element of the input to the end all right and then at the end of the Elysium layers you have some combination that gives you a prediction so now let's take a look at the inputs it's a sentence and represented as a sequence but is that really how we really think about sentences is that really how we portray the meaning of a sentence if I were to summarize this sentence let's say maybe it's about I mean it's really about our fighting ok I'll put that down fighting but who's fighting some dogs dogs and when is it happening it's happening now or which dogs two dots okay you can see from the top you have the most important aspect of the sentence basically the essence are the meaning and then as you go down you get more details about how this is happening the further down you get you get even more details about the children are far the top-level aspect same thing goes with the other sentence which was in the quiz two dogs are wrestling and hugging it's about wrestling and they's hugging happening with wrestling okay let's move on the task that I chose for this investigation was semantic relatedness so you get two sentences just like in the quiz and you have to come up with a school that saves how similar again in meaning the sentence is armed and I used it is it cold sick no idea come up with that name this dataset has 10,000 pairs of our sentences 10,000 scores to go into the the sentences and I also used a glove which is an embedding for words okay in order to convert my sentences from this data from this sick database this is sick dealer said I used synthetic net which is very well known as parsing on phosphates and the output looks something like this make some mistakes sometimes so you have to manually correct it so from there I built a model again based on SDO I'm sorry LS CMS and it looks like that same as from before just with additional details so the question now is how do I feed my trees into an LST M because remember elysium sequences they work with sentences but trees are you know not quite sequences so what I did is was to express my trees in sequential form pretty easy I did depth-first search so the sentence on the left ribs end about that tree which is truth of the finding Simpson is from before that becomes fighting the main main idea forward by the children are fighting children are dogs and are ok but dogs is a child of its own so the child appears in its onset Bart gets set of brackets right in front of dogs so I trained my models in fact I've trained two models one I used depends the trees oh by the way this fries are called dependency trees so one model I use that this depends the trees and in the other I just use regular sentences and again when I say dependency trees here are I'm talking about the sequential presentation alpha the tip it is dependent the trees from the precursor slide over here what I observed was that the model trained on dependency trees had significantly lower min squid error a surprise there 0.35 and the one that was trying to radiances had 1.3 so that's about three point seven times the amount of the air if you look at the scale on the horizontal axis you can see that the trees model only trained for very few number a very small number of cycles so it got to the perfect level at about 150 steps but the other model it took about one point eight so 106 million steps so significant significantly more they had about the same amount of for training loss it's just one arrived at the same loss after significantly more steps the next step from here is to instead of using a LCM cells I'm planning to use three LST ohms which is this STM that is a specifically for our trees so instead of for working on sipping seeds it works on trees I'm also going to try to use that this same idea on question answering are using the squad database sorry the squad dataset what I've noticed is that with that depends the trees it's actually really easy to manipulate them so if for instance you wanted to up meant the dealer could maybe change the order of five children in the trees and then come up with a new set of trees which you can just pile onto your data set you have more data it works in most cases there's some cases where you just can't do that I'd like to thank my mentor I was not able to come to this event my name is that yes mean she's been a great help and also I'd like to thank the open e aí team for supporting us throughout this program all right I'm gonna open out for questions so feel free [Applause] yeah good that's a good question this question was how were the scores for the sentence pears are calculated wasn't actually an empirical score it was decided by people there was significant consensus but it's really not it's really not an empirical so you can really quite measure it I'm sure you could try to come up with ways of creating a criteria for you know what's a 1 and what's a 5 and wasn't middle but uh yeah it was a lot of people who curated the data said this cause and there was a significant consensus on now on the schools and you can go and take a look at this cause and I'll see how you feel about it when I was looking at the date of myself just you know taking browsing through it I sort of agreed with this course it makes sense but yeah it's not empirical yes great question the question was how did I fit in the words for the tree and what I do with the parentheses I used our glove for embedding the words and then for the parentheses because you know parentheses actually do up here in language you could have sentences that do have parentheses so I I couldn't really use parentheses so what I did is I created my own symbols and I specifically made him so that they were far away from the rest of the words I give them you know really high values for their vectors and the two the two symbols for the opening parentheses in the closing parentheses closer together but again they were really far away from the rest of the day just to make it create the model that this is these are not you know this is not the same same kind of data all right I think that's all I have all right sorry I ran out of time but thank you so much [Applause]

Original Description

Munashe Shumba talks about Using Semantic Trees In Place of Sentences on OpenAI Scholars Demo Day on September 20, 2018. Learn more: https://openai.com/blog/openai-scholars-2018-final-projects#munashe
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from OpenAI · OpenAI · 19 of 60

1 Robots that Learn
Robots that Learn
OpenAI
2 Emergence of Grounded Compositional Language in Multi-Agent Populations
Emergence of Grounded Compositional Language in Multi-Agent Populations
OpenAI
3 OpenAI + Dota 2
OpenAI + Dota 2
OpenAI
4 Dendi vs. OpenAI at The International 2017
Dendi vs. OpenAI at The International 2017
OpenAI
5 Competitive Self-Play
Competitive Self-Play
OpenAI
6 Learning a Hierarchy
Learning a Hierarchy
OpenAI
7 Physical Spam Detection
Physical Spam Detection
OpenAI
8 Ingredients for Robotics Research
Ingredients for Robotics Research
OpenAI
9 OpenAI Five
OpenAI Five
OpenAI
10 OpenAI Five: Dota Gameplay
OpenAI Five: Dota Gameplay
OpenAI
11 Learning Dexterity
Learning Dexterity
OpenAI
12 Learning Dexterity: Uncut
Learning Dexterity: Uncut
OpenAI
13 OpenAI Five Benchmark: Post-Game Analysis
OpenAI Five Benchmark: Post-Game Analysis
OpenAI
14 Investigating Model Based RL for Continuous Control | Alex Botev | 2018 Summer Intern Open House
Investigating Model Based RL for Continuous Control | Alex Botev | 2018 Summer Intern Open House
OpenAI
15 Generative Modelling | Sadhika Malladi | 2018 Summer Intern Open House
Generative Modelling | Sadhika Malladi | 2018 Summer Intern Open House
OpenAI
16 A pathway to more efficient generative models | Will Grathwohl | 2018 Summer Intern Open House
A pathway to more efficient generative models | Will Grathwohl | 2018 Summer Intern Open House
OpenAI
17 Learning Dexterity | Alex Ray | 2018 Summer Intern Open House
Learning Dexterity | Alex Ray | 2018 Summer Intern Open House
OpenAI
18 Robust Vision-Based State Estimation | Hsiao-Yu 'Fish' Tung | 2018 Summer Intern Open House
Robust Vision-Based State Estimation | Hsiao-Yu 'Fish' Tung | 2018 Summer Intern Open House
OpenAI
Using Semantic Trees In Place of Sentences | Munashe Shumba | OpenAI Scholars Demo Day 2018
Using Semantic Trees In Place of Sentences | Munashe Shumba | OpenAI Scholars Demo Day 2018
OpenAI
20 Reinforcement Learning with Prediction-Based Rewards
Reinforcement Learning with Prediction-Based Rewards
OpenAI
21 OpenAI Spinning Up in Deep RL Workshop
OpenAI Spinning Up in Deep RL Workshop
OpenAI
22 Arena Announcement and Closing | OpenAI Five Finals (6/6)
Arena Announcement and Closing | OpenAI Five Finals (6/6)
OpenAI
23 Co-Op Match | OpenAI Five Finals (5/6)
Co-Op Match | OpenAI Five Finals (5/6)
OpenAI
24 OpenAI Five vs. OG, Game 2 | OpenAI Five Finals (4/6)
OpenAI Five vs. OG, Game 2 | OpenAI Five Finals (4/6)
OpenAI
25 OpenAI Five vs. OG, Game 1 | OpenAI Five Finals (3/6)
OpenAI Five vs. OG, Game 1 | OpenAI Five Finals (3/6)
OpenAI
26 Pre-Match Panel Discussion | OpenAI Five Finals (2/6)
Pre-Match Panel Discussion | OpenAI Five Finals (2/6)
OpenAI
27 Opening Keynote | OpenAI Five Finals (1/6)
Opening Keynote | OpenAI Five Finals (1/6)
OpenAI
28 OpenAI Robotics Symposium 2019
OpenAI Robotics Symposium 2019
OpenAI
29 OpenAI Scholars Demo Day 2019
OpenAI Scholars Demo Day 2019
OpenAI
30 Multi-Agent Hide and Seek
Multi-Agent Hide and Seek
OpenAI
31 Solving Rubik’s Cube with a Robot Hand: Uncut
Solving Rubik’s Cube with a Robot Hand: Uncut
OpenAI
32 Solving Rubik’s Cube with a Robot Hand: Perturbations
Solving Rubik’s Cube with a Robot Hand: Perturbations
OpenAI
33 Solving Rubik’s Cube with a Robot Hand
Solving Rubik’s Cube with a Robot Hand
OpenAI
34 Music Generation | Christine Payne | OpenAI Scholars Demo Day 2018
Music Generation | Christine Payne | OpenAI Scholars Demo Day 2018
OpenAI
35 Deephypebot | Nadja Rhodes | OpenAI Scholars Demo Day 2018
Deephypebot | Nadja Rhodes | OpenAI Scholars Demo Day 2018
OpenAI
36 Physics Net | Ifu Aniemeka | OpenAI Scholars Demo Day 2018
Physics Net | Ifu Aniemeka | OpenAI Scholars Demo Day 2018
OpenAI
37 Art Composition Attributes + CycleGAN | Holly Grimm | OpenAI Scholars Demo Day 2018
Art Composition Attributes + CycleGAN | Holly Grimm | OpenAI Scholars Demo Day 2018
OpenAI
38 Generating Emotional Landscapes | Hannah Davis | OpenAI Scholars Demo Day 2018
Generating Emotional Landscapes | Hannah Davis | OpenAI Scholars Demo Day 2018
OpenAI
39 Looking For Grammar In All The Right Places | Alethea Power | OpenAI Scholars Demo Day 2020
Looking For Grammar In All The Right Places | Alethea Power | OpenAI Scholars Demo Day 2020
OpenAI
40 Semantic Parsing English to GraphQL | Andre Carerra | OpenAI Scholars Demo Day 2020
Semantic Parsing English to GraphQL | Andre Carerra | OpenAI Scholars Demo Day 2020
OpenAI
41 Long term credit assignment with temporal reward transp… | Cathy Yeh | OpenAI Scholars Demo Day 2020
Long term credit assignment with temporal reward transp… | Cathy Yeh | OpenAI Scholars Demo Day 2020
OpenAI
42 Social learning in independent multi-agent reinfor… | Kamal N’dousse | OpenAI Scholars Demo Day 2020
Social learning in independent multi-agent reinfor… | Kamal N’dousse | OpenAI Scholars Demo Day 2020
OpenAI
43 Quantifying Interpretability of Models Trained on Coi… | Jorge Orbay | OpenAI Scholars Demo Day 2020
Quantifying Interpretability of Models Trained on Coi… | Jorge Orbay | OpenAI Scholars Demo Day 2020
OpenAI
44 Towards Epileptic Seizure Prediction with Deep Network | Kata Slama | OpenAI Scholars Demo Day 2020
Towards Epileptic Seizure Prediction with Deep Network | Kata Slama | OpenAI Scholars Demo Day 2020
OpenAI
45 Universal Adversarial Perturbations and Language M… | Pamela Mishkin | OpenAI Scholars Demo Day 2020
Universal Adversarial Perturbations and Language M… | Pamela Mishkin | OpenAI Scholars Demo Day 2020
OpenAI
46 Introductions by Sam Altman & Greg Brockman | OpenAI Scholars Demo Day 2020
Introductions by Sam Altman & Greg Brockman | OpenAI Scholars Demo Day 2020
OpenAI
47 Introduction by Sam Altman | OpenAI Scholars Demo Day 2021
Introduction by Sam Altman | OpenAI Scholars Demo Day 2021
OpenAI
48 Breaking Contrastive Models with the SET Card Game | Legg Yeung | OpenAI Scholars Demo Day 2021
Breaking Contrastive Models with the SET Card Game | Legg Yeung | OpenAI Scholars Demo Day 2021
OpenAI
49 Large Scale Reward Modeling | Jonathan Ward | OpenAI Scholars Demo Day 2021
Large Scale Reward Modeling | Jonathan Ward | OpenAI Scholars Demo Day 2021
OpenAI
50 Words to Bytes: Exploring Language Tokenizations | Sam Gbafa | OpenAI Scholars Demo Day 2021
Words to Bytes: Exploring Language Tokenizations | Sam Gbafa | OpenAI Scholars Demo Day 2021
OpenAI
51 Learning Multiple Modes of Behavior in a Continuous… | Tyna Eloundou | OpenAI Scholars Demo Day 2021
Learning Multiple Modes of Behavior in a Continuous… | Tyna Eloundou | OpenAI Scholars Demo Day 2021
OpenAI
52 Scaling Laws for Language Transfer Learning | Christina Kim | OpenAI Scholars Demo Day 2021
Scaling Laws for Language Transfer Learning | Christina Kim | OpenAI Scholars Demo Day 2021
OpenAI
53 Contrastive Language Encoding | Ellie Kitanidis | OpenAI Scholars Demo Day 2021
Contrastive Language Encoding | Ellie Kitanidis | OpenAI Scholars Demo Day 2021
OpenAI
54 Characterizing Test Time Compute on Graph Structur… | Kudzo Ahegbebu | OpenAI Scholars Demo Day 2021
Characterizing Test Time Compute on Graph Structur… | Kudzo Ahegbebu | OpenAI Scholars Demo Day 2021
OpenAI
55 Studying Scaling Laws for Transformer Architecture … | Shola Oyedele | OpenAI Scholars Demo Day 2021
Studying Scaling Laws for Transformer Architecture … | Shola Oyedele | OpenAI Scholars Demo Day 2021
OpenAI
56 Feedback Loops in Opinion Modeling | Danielle Ensign | OpenAI Scholars Demo Day 2021
Feedback Loops in Opinion Modeling | Danielle Ensign | OpenAI Scholars Demo Day 2021
OpenAI
57 Creating a Space Game with OpenAI Codex
Creating a Space Game with OpenAI Codex
OpenAI
58 “Hello World” with OpenAI Codex
“Hello World” with OpenAI Codex
OpenAI
59 Talking to Your Computer with OpenAI Codex
Talking to Your Computer with OpenAI Codex
OpenAI
60 Data Science with OpenAI Codex
Data Science with OpenAI Codex
OpenAI

This video demonstrates the effectiveness of using semantic trees in place of sentences for natural language processing, showcasing a significant reduction in error rate and training time on a semantic relatedness task. The approach uses dependency trees and LSTMs, with the trees represented in sequential form using depth-first search. The video also discusses the use of GloVe for word embedding and the creation of custom symbols for parentheses.

Key Takeaways
  1. Build a semantic tree for a given sentence
  2. Represent the tree in sequential form using depth-first search
  3. Use an LSTM to train on the sequential representation of the tree
  4. Evaluate the model on a semantic relatedness task
  5. Compare the results to a model trained on regular sentences
💡 Using semantic trees in place of sentences can significantly improve the performance of natural language processing models on semantic relatedness tasks.

Related AI Lessons

The AI Hype Cycle: Calm Before the Next Breakthrough?
Understand the AI hype cycle to anticipate the next breakthrough and make informed decisions
Medium · Programming
AI won’t replace scientists. It will make the current model of science obsolete
AI is not replacing scientists, but rather making the current model of science obsolete, enabling new forms of discovery and collaboration
Medium · Data Science
The End of Knowledge: Why Artificial Intelligence Is Changing Not Only What We Know, but What It…
AI is changing the concept of knowledge and how we acquire it, making us question what it means to know something
Medium · AI
Japan Gave the World Robots, Bullet Trains, and PlayStation. So Why Is It Losing the AI Race?
Japan, a pioneer in technology, is struggling to keep up in the AI race, and understanding the reasons behind this can provide valuable insights for other countries and businesses
Medium · AI
Up next
Motorist saved by human chain | 9 News Australia
9 News Australia
Watch →