Proof that AI Understands? ๐Ÿ‘€ Andrew Ng on LLMs building mental models, Othello GPT, Geoffrey Hinton

Wes Roth ยท Beginner ยท๐Ÿง  Large Language Models ยท2y ago

Key Takeaways

The video discusses how large language models (LLMs) build world models and convey learning and understanding of the world, with a focus on Othello GPT and its ability to predict legal moves in the game of Othello without explicit training, featuring insights from Andrew Ng and Geoffrey Hinton.

Full Transcript

yeah I think it's fairly urgent for the researchers to come to a consensus about whether these big chatbots like gpg4 or Bard actually understand what they're saying um there's clearly some people believe they do and some people believe they're just stochastic parrots I believe the large language models and other large AI models are building a world model or building something that looks a lot like a world model so my guts is that I do believe to the extent this building World model is conveying learning some understanding of the world so there's been a lot of discussion recently whether or not llm models these AI models behind tragic BT and others do they understand can they reason do they think what are words you want to use the question is is our children is this an intelligence that is in some way similar to humans or is this just a very Advanced autocomplete model that's spitting out statistically likely words one after the other and he posts a suite saying do large language models really understand the world or just give the appearance of understanding and then he talks about some evidence from Othello GPT shows that llms build models of how the world Works which makes me comfortable saying that they do understand as yesterday's positive report card shows children's do learn do larger language models understand the world as a scientist and engineer I've avoided asking whether an AI system understands anything there's no widely agreed upon scientific tests for whether system really understands as opposed to appearing to understand just as such no tests exist for Consciousness or sentience as I discussed in an earlier letter this makes the question of understanding a matter of philosophy rather than science but with this caveat I believe that LMS build sufficiently complex models of the world that I feel comfortable saying that to some extent they do understand the world so Othello a board game in which two players take turns placing game pieces on an 8x8 grid I'm not familiar with this game but they do have some diagrams on the paper that they wrote so that might be helpful so during training the network saw only sequences of moves it wasn't explicitly told that these were moves on a square 8x8 board or the rules of the game after training on a large data set of such moves it did a decent job of predicting what the next move should be the key question is did the network make these predictions by building a world model that is did it discover that there was an 8x8 board and a specific set of rules for placing pieces on it that underpin these moves the authors demonstrated convincingly that the answer is yes specifically given a sequence of moves the Network's hidden unit activations appear to capture a representation of the current board position as well as available legal moves this shows that rather than being a stochastic parrot that tried only to mimic the statistics of straining data the network did indeed build a world model while this study used Othello I have little doubt that LMS trained on human text also build World models you know with that let's take a look at this paper it's called emergent World representations exploring a sequence model trained on a synthetic task and so here we have people from Harvard MIT Northeastern University language models show a surprising range of capabilities but the source of their apparent competence isn't clear do these networks just memorize a collection of service statistics or do they rely on internal representation of the process that generates the sequences they see and so they have a variant of the GPT model made to predict legal moves in a simple board game a fellow after you actually I didn't realize this but I do know this game I just never heard of it as Othello and so here they basically describe the problem so they're basically saying that some people believe that these models these have seemingly good performance from memorizing certain statistics and they're just autocompletes and stochastic parrots and there's nothing really intelligent going on there the sort of the Counterpoint to that that the people believe is that they do build some sort of world model they do develop some understanding they do develop some emergent abilities and skills that they were not trained for and yet somehow they learned to do so they think of the board of Othello as the world right that grid 8x8 grade as the world but they don't tell the model that it is an eBay grid they keep that secret and in fact the model has no previous knowledge of the game or its rules all that it sees during training is a series of tokens derived from the game transcripts so as they pointed out here before or rather I guess those back on this page you know the moves might be D3 C5 F6 Etc so basically it's just these collections of numbers and letters that describe where on the board that move is taking place and they do not explicitly train the model to make good moves or to win the game nevertheless the model is able to generate legal AFL moves with high accuracy so in Othello like we said the world consists of the the current board position quick explanation of how the researchers approach this problem they want to figure out if Othello GPT a computer program designed for the game Othello truly understands the current state of the game board understanding the state of the game means knowing where all the black and white discs are on the board and what spaces are empty to do this they use a special tool called a probe imagine a probe as a detective that looks inside Othello gpt's thinking process it examines the thoughts or internal signals of the program and tries to predict specific features like the current state of the game board we train this detective the probe to predict the game's board state by looking at the signals inside Othello GPT after certain moves are made if the probe can accurately predict the state of the game board it means that Othello GPT is actually understanding and representing the game's current situation so one thing that we need to know to understand this paper is what a latent saliency map is to better understand what happens in the hidden layers of the neural networks we can use latent saliency Maps these Maps provide a visual representation of the parts of the neural networks engaged in decision making in these images for example the top row are actual photos and the AI is asked to determine if something is present in those images for example it's asked it it seems someone smiling or someone wearing glasses or if there is a blue square in the image below the image for that particular query is a latent saliency map that the AI used to to determine whether the image has those objects for example to determine if an image has someone smiling it focused on the area around the mouth and cheeks to determine if someone has bushy eyebrows it focused on the area around the eyes and eyebrows to find if there is a blue square if focused on the Blue Square in the image this shows that it's understanding what we want and it's looking in the right place to find it as a counter example there was an instance where a model That was supposed to detect if an image was that of a dog or a wolf started to sometimes mistake the two it was later determined that the model looked for items in the background of the image to determine if the object was a wolf or not images of dogs in a snowy forest for example would be classified as wolves by looking at a latent saliency map of that model we would be able to see that if it was making decisions based on what it found in the background instead of solely looking at the animal and so here's the sort of published Laden salinity Maps so each Supply shows a different game State and the top one prediction by the model is enclosed in the black box so it's all of these ones that's the thing that it thinks is the best move or at least a correct move it's the move it's predicting and so in conclusion our experience provided evidence that Othello gbt maintains a representation of game board states that is the Othello world to produce sequences it was trained on so I wanted to jump here really fast and just kind of like maybe over explain this thing a little bit just in case people are not quite getting what is happening because this is kind of important so here I have a notepad with you know it says E4 E5 D5 F4 Etc and then this so it has I believe 64 total moves this is a game of Othello or river C or whatever reverse whatever you want to call it so they made this into a sentence with each coordinate being a word so just like I would say I went to the store today this is that but in or fellow notation of how the players move so probably goes I think black goes first so it's black white black white Etc until no moves are left so they take this game this sentence and they make a whole book out of it so many many sentences I forget the exact where they're using thousands or Millions whatever a huge number of these games AKA sentences and they feed it into the machine and then we start asking the machine to predict move so let's say we get you know we play a game like this and we ask okay what do you think comes next and goes well I think H4 comes next this is not surprising everyone agrees that it can do this at this point no one's denying its predictive abilities because what it's doing is it's sort of predicting it's using statistics to predict next move and it's getting pretty good at it but where the communities kind of splitting up is that there's part of them they're saying it's just this it's just statistics it's just it's just like a parrot that is just kind of repeating what it's heard and there's nothing else going on beyond that and so the other side of the debate and as you see Andrew Noe and Jeffrey Hinton they talk about this they're also saying yeah it is just statistics but there seems to be some emerging skills and properties that are occurring that really seem like this thing is understanding what it's doing it's building certain mental models that showed that it's deducing that it's inferring some information that we never told it about that we never trained it to do and so when we start probing into it and what we begin to realize that you know let's say we get to right here prediction black E6 I mean this is a fake game I made up it has nothing to do with this but let's say you're over here the next thing it predicts is E6 so it's like this is the next move fine that makes total sense but here's where it gets weird we can start asking it to predict the state of all these other squares on the board and again we never told it that there was a board we never told it there were squares winner explain what the different states were the different states are you know it's either empty or has a black tile on it or it has a white tile on it so but that's what they mean by state is like what what's there you know nothing white or black and so if this was just a statistical prediction machine you know and it spits out E6 is the next move and we're like okay but what's on A1 it would have no idea it would say what do you mean what's on a one right well there there wouldn't be any concept of it thinking that there's something there or something isn't there it might not even realize that this is where it is it might not realize that there's a board like there's it has no way of of knowing this this information was not given to it and yet somehow it is able to visualize it predict where all these other pieces are and not only that when it's trying to make its next move these kind of colors show what squares it's looking at to make those predictions so for example I believe they said that red is high blue is low so the very red squares it means that those are really taken into account in predicting the next move whereas the squares are marked blue not so much so for example let's say it's predicting that the white piece will be placed right here what squares is it looking at to make that decision well it's not this one it doesn't here at all if this is empty or if it's white or if it's red it cares not right it's looking kind of like this these squares to see it's looking at the red squares to see that prediction and so the contribution by what they mean by contribution how much do the squares contribute to its prediction the contribution is higher when changing the internal representation of square makes the prediction prediction less likely what's funny is this reminded me of an episode from the office if you've ever watched that show where Michael the boss who's supposed to be kind of not the smartest person in the room he starts acting a little bit weird and one of the employees Oscar who's one probably one of the smarter characters he has an idea he thinks that the reason that Michael is acting weird is that he's aware that he can receive a certain bonus from the company if he completes some objective that kind of goes against the rest of the employees so he asks them a question that normally Michael would not be able to answer do you know do I know what I think you know no does anyone happen to know what 15 of 4 300 is 645 Michael's a genius right why did you say dollars because that is how my mind works what's 15 of 200 thank you everyone Michael is returning the Surplus so he can keep 15 as a bonus so the idea here is similar here's what we told it here's the information we gave it we didn't tell it about a board we didn't tell it about a square we didn't tell it about tiles or whatever so later when we ask what's the state of this particular square and goes oh there's a black tile on it that means it knows about the tile and the board and the square it figured it out it figured it out from this from just rows and rows and lines and lines of this and nothing else but why is this important like who cares if they're building mental models or not here's a video clip that I think kind of shines some light in this this is Jeffrey Hinton so he's often referred to as the Godfather of AI he was the original person that really double down on neural networks specifically something that resembled the human brain the neural network of the human brain as being sort of the key that's going to unlock Ai and he held on to that belief for many decades when it was not the popular belief like he was going sort of against consensus against you know all the other experts in the field and he held on to that belief and he said no it's going to be neural networks and so it's you know 2023 now and uh we're kind of like yeah the dude was right let's take a look at this because these two guys are some of the brightest Minds in Ai and uh it is important to know what they think so let's dive in and uh my name is Wes Roth thank you for watching yeah I think it's fairly urgent for the researchers to come to a consensus about whether these big chatbots like gpc4 or Bard actually understand what they're saying um there's clearly some people believe they do and some people believe they're just stochastic parrots and so long as we have those difference um we're not going to be able to come to a consensus about dangers and so I think it's sort of urgent from for the research Community to address this issue of whether they understand or not and I think both of us believe they do understand but people we respect a lot like Yan think they don't really understand and it's crucial to resolve this issue and we may not be able to come to success about other issues until we've resolved that issue my view is that I believe the large language models and other large AI models are building a world model or building something that looks a lot like the world model so my guts is that I do believe to the extent this building a world model is conveying learning some understanding of the world and one aspect of this is the idea it's just statistics and we all agree that in some sense it has to be just statistics all these things have is the statistics of their input but many people who think it's just statistics of thinking in terms of things like trigram models or counting co-occurrence frequencies of words and it's not just that we believe that this process of creating features of embeddings and then interactions between features um is actually understanding once you've once you've taken the raw data of symbol streams and you can now predict the next symbol not by things like trigrams but by huge numbers of features interacting in very complicated ways to predict the features of the next word and from that make a prediction about the probabilities of next words the point is that is understanding at least I believe that is understanding I believe that's what our brains are doing too that they're not just stochastic parrots

Original Description

#gpt4 #deepmind #openai ๐Ÿ”ฅ Get my A.I. + Business Newsletter (free): https://natural20.com/ https://twitter.com/AndrewYNg/status/1689693276234989569 https://www.deeplearning.ai/the-batch/issue-209/ https://arxiv.org/abs/2210.13382 [RELATED VIDEOS] Minecraft AI - SELF-IMPROVING ๐Ÿคฏ autonomous agent: https://www.youtube.com/watch?v=7yI4yfYftfM 25 ChatGPTs play a videogame... https://youtu.be/GwsRu9yLXnw GPT-4 leaked! ๐Ÿ”ฅ All details exposed ๐Ÿ”ฅ It is over... https://youtu.be/GwsRu9yLXnw LLMs as Tool Makers [LATM] - GPT-4 *UPGRADES* lower AI Models. https://www.youtube.com/watch?v=qWI1AJ2nSDY Sam Altman on UBI and Massive Job Losses from AI Automation https://www.youtube.com/watch?v=5Nsqv3FWXio [TIMELINE] [00:00] Geoffrey Hinton [00:33] Andrew Ng [03:06] Emergent World Representations: Othello-GPT [07:46] TLDR summary [14:28] Geoffrey Hinton and Andrew Ng Discuss
Watch on YouTube โ†— (saves to browser)
Sign in to unlock AI tutor explanation ยท โšก30

Playlist

Uploads from Wes Roth ยท Wes Roth ยท 54 of 60

1 Which Vanguard index fund to buy? (hint: it's the one Warren Buffett recommends)
Which Vanguard index fund to buy? (hint: it's the one Warren Buffett recommends)
Wes Roth
2 What does PALANTIR do - Palantir Stock, Founder, Controversy Explained Simply (plus why I'm BUYING).
What does PALANTIR do - Palantir Stock, Founder, Controversy Explained Simply (plus why I'm BUYING).
Wes Roth
3 Paypal misinformation fine ($2,500) - Close Your Accounts ASAP!
Paypal misinformation fine ($2,500) - Close Your Accounts ASAP!
Wes Roth
4 China Was Just Sent Back to the Dark Ages  |  US starts aggressively cutting ties
China Was Just Sent Back to the Dark Ages | US starts aggressively cutting ties
Wes Roth
5 ChatGPT Business Ideas - How I Use ChatGPT to make money
ChatGPT Business Ideas - How I Use ChatGPT to make money
Wes Roth
6 ChatGPT Explained - The AI revolution is happening right now... [ chat gpt ]
ChatGPT Explained - The AI revolution is happening right now... [ chat gpt ]
Wes Roth
7 ChatGPT Banned - New York blocking network access to ChatGPT
ChatGPT Banned - New York blocking network access to ChatGPT
Wes Roth
8 ChatGPT Trading - this [INSANE] tool A.I. built for me
ChatGPT Trading - this [INSANE] tool A.I. built for me
Wes Roth
9 Small Business Grants for ChatGPT and A.I. (similar to PPP and EIDL in 2023) |
Small Business Grants for ChatGPT and A.I. (similar to PPP and EIDL in 2023) |
Wes Roth
10 How to Make Passive Income with ChatGPT AI
How to Make Passive Income with ChatGPT AI
Wes Roth
11 OpenAIโ€™s GPT-4 Artificial Intelligence = AGI? TRILLIONS of Parameters Plus THIS
OpenAIโ€™s GPT-4 Artificial Intelligence = AGI? TRILLIONS of Parameters Plus THIS
Wes Roth
12 How Nvidia AI Robot Trained 42 Years In 32 Hours And Did THIS | Google DeepMind AlphaCode
How Nvidia AI Robot Trained 42 Years In 32 Hours And Did THIS | Google DeepMind AlphaCode
Wes Roth
13 John Carmack | AGI by 2030 | Will John Carmack's AI company be the one to make it?
John Carmack | AGI by 2030 | Will John Carmack's AI company be the one to make it?
Wes Roth
14 AI Small Business Grants
AI Small Business Grants
Wes Roth
15 Elon Musk attacks OpenAI - here's Sam Altman's response
Elon Musk attacks OpenAI - here's Sam Altman's response
Wes Roth
16 Bill Gates on ChatGPT and OpenAI "The Age of AI has begun"
Bill Gates on ChatGPT and OpenAI "The Age of AI has begun"
Wes Roth
17 Sparks of AGI | Microsoft Researchers claim GPT-4 Is showing "Artificial General Intelligence"
Sparks of AGI | Microsoft Researchers claim GPT-4 Is showing "Artificial General Intelligence"
Wes Roth
18 Elon Musk and Others Call for Pause on AI as GPT-4 shows signs of AGI.
Elon Musk and Others Call for Pause on AI as GPT-4 shows signs of AGI.
Wes Roth
19 Comparing GPT-4 and Google's Bard AI - Who is getting closer to AGI?
Comparing GPT-4 and Google's Bard AI - Who is getting closer to AGI?
Wes Roth
20 Sam Altman on UBI, OpenAI to $100 TRILLION and Massive Job Losses from AI Automation
Sam Altman on UBI, OpenAI to $100 TRILLION and Massive Job Losses from AI Automation
Wes Roth
21 25 ChatGPTs play a videogame...
25 ChatGPTs play a videogame...
Wes Roth
22 NVIDIA's new AI: Better Games, Art and... better life?
NVIDIA's new AI: Better Games, Art and... better life?
Wes Roth
23 Google AI Documents Leak about "Google and OpenAI"
Google AI Documents Leak about "Google and OpenAI"
Wes Roth
24 PaLM 2 vs GPT-4 | why Google is having a hard time catching up...
PaLM 2 vs GPT-4 | why Google is having a hard time catching up...
Wes Roth
25 How To Access ChatGPT Plugins | They are LIVE! (but hidden)
How To Access ChatGPT Plugins | They are LIVE! (but hidden)
Wes Roth
26 Sam Altman to Congress "America HAS to lead the world in AI"...
Sam Altman to Congress "America HAS to lead the world in AI"...
Wes Roth
27 Sam Altman Opening Statement to Congress on AI Regulation
Sam Altman Opening Statement to Congress on AI Regulation
Wes Roth
28 Sam Altman Congress Hearing "AI is the Biggest Threat to Human Race"
Sam Altman Congress Hearing "AI is the Biggest Threat to Human Race"
Wes Roth
29 Tree of Thoughts - GPT-4 Reasoning is Improved 900%
Tree of Thoughts - GPT-4 Reasoning is Improved 900%
Wes Roth
30 Governance of Superintelligence | OpenAI proposes measures for safe AI development.
Governance of Superintelligence | OpenAI proposes measures for safe AI development.
Wes Roth
31 Model Evaluation For Extreme Risks of AI | Google DeepMind and OpenAI Paper
Model Evaluation For Extreme Risks of AI | Google DeepMind and OpenAI Paper
Wes Roth
32 Minecraft AI - NVIDIA uses GPT-4 to create a SELF-IMPROVING ๐Ÿคฏ autonomous agent.
Minecraft AI - NVIDIA uses GPT-4 to create a SELF-IMPROVING ๐Ÿคฏ autonomous agent.
Wes Roth
33 AI Human Extinction Risk - Experts Warn of "Serious Risk"
AI Human Extinction Risk - Experts Warn of "Serious Risk"
Wes Roth
34 LoRA - Low-rank Adaption of AI Large Language Models: LoRA and QLoRA Explained Simply
LoRA - Low-rank Adaption of AI Large Language Models: LoRA and QLoRA Explained Simply
Wes Roth
35 99.3% of ChatGPT Performance with OpenSource AI - [QLoRA paper]
99.3% of ChatGPT Performance with OpenSource AI - [QLoRA paper]
Wes Roth
36 AlphaFold2 Explained | Google's DeepMind Solves Protein Folding
AlphaFold2 Explained | Google's DeepMind Solves Protein Folding
Wes Roth
37 Illumina AI - ChatGPT for your genome...
Illumina AI - ChatGPT for your genome...
Wes Roth
38 Text to Video Invasion! Runway AI releases GEN 2 text to video.
Text to Video Invasion! Runway AI releases GEN 2 text to video.
Wes Roth
39 LLMs as Tool Makers [LATM] - GPT-4 *UPGRADES* lower AI Models.
LLMs as Tool Makers [LATM] - GPT-4 *UPGRADES* lower AI Models.
Wes Roth
40 AlphaDev - DeepMind AI Discovers Better Algorithms for Foundational Computing
AlphaDev - DeepMind AI Discovers Better Algorithms for Foundational Computing
Wes Roth
41 OpenAI GPT-4 Function Calling: *HUGE* Potential
OpenAI GPT-4 Function Calling: *HUGE* Potential
Wes Roth
42 GPT-4 leaked! ๐Ÿ”ฅ All details exposed ๐Ÿ”ฅ It is over...
GPT-4 leaked! ๐Ÿ”ฅ All details exposed ๐Ÿ”ฅ It is over...
Wes Roth
43 Elon Musk announced XAI - the answer to OpenAI = X.AI
Elon Musk announced XAI - the answer to OpenAI = X.AI
Wes Roth
44 Andrej Karpathy GPT - Advice for building AI agents
Andrej Karpathy GPT - Advice for building AI agents
Wes Roth
45 TEST TO SEE IF AI CAN MAKE $1,000,000   (modern Turing test)
TEST TO SEE IF AI CAN MAKE $1,000,000 (modern Turing test)
Wes Roth
46 ChatGPT custom instructions are *POWERFUL*  Replace AutoGPT and BabyAGI?
ChatGPT custom instructions are *POWERFUL* Replace AutoGPT and BabyAGI?
Wes Roth
47 WORLDCOIN LAUNCH is starting! Backed by Sam Altman of OpenAI.
WORLDCOIN LAUNCH is starting! Backed by Sam Altman of OpenAI.
Wes Roth
48 WORLDCOIN ORB - I went to L.A. to get my eye scanned for WorldCoin [my experience]
WORLDCOIN ORB - I went to L.A. to get my eye scanned for WorldCoin [my experience]
Wes Roth
49 The Biggest Week of AI News In Months!
The Biggest Week of AI News In Months!
Wes Roth
50 Google Deepmind RT 2 - Using LLMs to Build Thinking, Learning Robots
Google Deepmind RT 2 - Using LLMs to Build Thinking, Learning Robots
Wes Roth
51 AI News is Getting *WEIRD* Human Brain Matter in Chips. OpenAI tutorial. Amazon unleashed it's AI.
AI News is Getting *WEIRD* Human Brain Matter in Chips. OpenAI tutorial. Amazon unleashed it's AI.
Wes Roth
52 GPT 5 release date ๐Ÿ”ฅ might be closer than we think | OpenAI applies for GPT-5 Trademark in the US.
GPT 5 release date ๐Ÿ”ฅ might be closer than we think | OpenAI applies for GPT-5 Trademark in the US.
Wes Roth
53 AI Agents Simulate a Town ๐Ÿคฏ Generative Agents: Interactive Simulacra of Human Behavior.
AI Agents Simulate a Town ๐Ÿคฏ Generative Agents: Interactive Simulacra of Human Behavior.
Wes Roth
โ–ถ Proof that AI Understands? ๐Ÿ‘€ Andrew Ng on LLMs building mental models, Othello GPT,  Geoffrey Hinton
Proof that AI Understands? ๐Ÿ‘€ Andrew Ng on LLMs building mental models, Othello GPT, Geoffrey Hinton
Wes Roth
55 OpenAI acquires Biomes ๐Ÿ‘€ an open-source MMORPG. ChatGPT plus Minecraft? ๐Ÿ”ฅ
OpenAI acquires Biomes ๐Ÿ‘€ an open-source MMORPG. ChatGPT plus Minecraft? ๐Ÿ”ฅ
Wes Roth
56 OpenAI announces FINETUNING ๐Ÿ‘€ for ChatGPT
OpenAI announces FINETUNING ๐Ÿ‘€ for ChatGPT
Wes Roth
57 Autonomous AI Agents - why YOU should be building them... and HOW.
Autonomous AI Agents - why YOU should be building them... and HOW.
Wes Roth
58 ChatGPT Enterprise - OpenAI launches the next BIG thing
ChatGPT Enterprise - OpenAI launches the next BIG thing
Wes Roth
59 HOODWINKED -  AI gets away with MURDER ๐Ÿ‘€ GPT-4 is an effective killer...
HOODWINKED - AI gets away with MURDER ๐Ÿ‘€ GPT-4 is an effective killer...
Wes Roth
60 Install Open Interpreter in 2 min | The free, open source CODE INTERPRETER!
Install Open Interpreter in 2 min | The free, open source CODE INTERPRETER!
Wes Roth

This video explores the capabilities of large language models (LLMs) in building world models and conveying learning and understanding of the world, with a focus on Othello GPT and its ability to predict legal moves in the game of Othello without explicit training. The video features insights from Andrew Ng and Geoffrey Hinton, highlighting the importance of neural networks in AI and the potential of LLMs in various applications. By watching this video, viewers can gain a deeper understanding of

Key Takeaways
  1. Watch the video to understand the basics of LLMs and their capabilities
  2. Read the papers referenced in the video to learn more about Othello GPT and its applications
  3. Experiment with LLMs and fine-tuning techniques to build and optimize LLM-based systems
  4. Explore the use of latent saliency maps and probes to analyze LLM decision-making processes
  5. Consider the potential applications of LLMs in various fields, such as game playing and natural language processing
๐Ÿ’ก LLMs are capable of building world models and conveying learning and understanding of the world, and are not just stochastic parrots or autocompletes.
๐Ÿ”’ Pro feature: Ask AI to explain this lesson โ†’

Related Reads

๐Ÿ“ฐ
LLM Tokens Explained: Cost, Memory, Speed and Context Windows
Understand LLM tokens and their impact on cost, memory, speed, and context windows to optimize your language model usage
Medium ยท AI
๐Ÿ“ฐ
5 Best Time-Aware Memory Layers for Long-Term AI Agents (2026 Guide)
Learn about the top time-aware memory layers for building long-term AI agents and improve their performance in stateful interactions
Medium ยท Machine Learning
๐Ÿ“ฐ
5 Best Time-Aware Memory Layers for Long-Term AI Agents (2026 Guide)
Learn about the 5 best time-aware memory layers for long-term AI agents and improve their performance in autonomous tasks
Medium ยท LLM
๐Ÿ“ฐ
Arquitetura Cognitiva de Baixa Latรชncia: Padrรฃo RAG com Cache
Learn how to implement low-latency cognitive architecture using the RAG pattern with cache to overcome corporate adoption hurdles of Generative AI
Medium ยท Python
Up next
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)
Watch โ†’