Building LLM Agents with Tool Use

Jay Alammar · Beginner ·🤖 AI Agents & Automation ·2y ago

Skills: LLM Engineering90%Tool Use & Function Calling85%Agent Foundations80%

Key Takeaways

Building LLM agents with tool use, integrating Cohere's Command models with external tools like search engines and APIs, and utilizing Langchain for execution.

Full Transcript

by now a lot of people are used to the nature of large language models as generators of text you know they take a text prompt and they generate a response so you give them a question for example and they give you an answer or you ask them to write you an article about something and they produce that article you've come across maybe the concept of rag which is this ability of a large language model to go and get information from a external data source for example like searching the web or searching uh a specific data set or company database for example tool use just takes this to the next level and instead of just saying that we can only retrieve information from this external Source why not have it do other kinds of things why not use it as a tool and here we'll look at an example um so there are multiple tools so web search is only one of a lot of things but we'll look at an example here of a data analyst agent and so this is this fascinating idea about an llm backed agent so now it's useful to not think of this as just communicating with the language model itself rather uh you're talking with a a software system that is better called an agent that is powered by a language model and that is able to use these external tools like web search like a python interpreter like a calculator like other sort of apis that are that are possible out there and we will look at if you go to Lang chain there are a large number of of apis that you can or or tools that that that you can use so let's look at this example here of this data uh analyst agent and we'll look at the code right after but conceptually it's this piece of software or llm backed agent that takes a user request and then it doesn't directly just goes and answers the the request because these agents are built to solve more complex tasks they start with outputting a plan and then start executing steps uh of that plan and reflecting at each each time and then finally they they go out and and produce the result so let's look at at the example here of this agent and so we'll go through the code in a in a bit but here we can give this command to this uh this agent and say create a plot of the number of full-time employees at the three tech companies with the highest market cap in the United States in 2024 right so this requires a number of steps searching and then plotting and but how can you plot when you're a text generation model well this text generation model is able to write code that is one capability that command R plus brings to the table but then it also can if we give it this tool that is a python interpreter it's able to create graphing code and then give you that plot and this is the final output of of the model so the data sources where it found the the information uh and then this is the plot that it sort of plotted so this is just the input and the output but we'll go through the the uh successive steps of how that is is is generated and so here you can see the final plot this notebook uh starts out with uh installing Lang chain um and then coh here um together and then we go through the setup so you need to put in your API key here uh if you don't have one you can register and sort of sign up and copy it and paste it here now we can Define our tools we said we had these two tools so let's start with the first one so in LLY chain there's this web search Tool uh powered by Tav this API so you need to go on here and get the API key and plug it here for for for tavali um and then this is the name of the tool and then a description of the tool and this is our second tool the python interpreter um also it's imported from from Lang chain experimental utilities uh it's called the name of it is uh we'll name it Python reppel and this is a description of what it does it executes python code and and Returns the result so now we've defined our two tools here we start to um Define and declare our agent and sort of how it works so let's say we run this and so this is a react agent this is the method of uh this is one method of of of agents that are backed by llms this is not to be confused with uh the JavaScript uh framework um after we've declared our agent and and set it up we can now give it the request that we have and here we can start to sort of to see it um thinking and outputting the the results and so first it says I will search for the three tech companies with the highest market cap in the US uh in 2024 then I will search for the number of full-time employees and plot the data using python so this is the plan of the model that it did in the first SP so after it generated this description of its plan now it's starting to execute the plan um and saying okay now let me invoke this tool named internet search uh and give it this query top three tech companies highest market cap and then this is one of the results that it's got and then it's the model is reflecting on the result that it's got so this is the let's say the web page and this is some of its content um and it says I found that the three tech companies with the highest market cap are Microsoft Apple and Nvidia now I will search for the number of full-time employees so this is what's called multihop uh tool use or multihop uh rag where it's not necessarily uh all done in one step of searching or one step of of tool use so the model sort of realizes that it needs more information and so it will call the web uh Search tool again and now it says Okay internet search Microsoft full-time employees and then another search of apple and then another search of Nvidia and so now the model is or the agent uh is doing four successive searches to find the information that you sort of um ask for and then here after it retrieves all of that information it says okay I have the information now this is what it's like now I will write and execute python code to uh plot this data so now it throws it says okay this is code I'm going to send to the python interpreter code and then this is the code that it it wants to um use to sort of create the plot and this is the final output that includes that data and this is the plot that was generated by by Matt plot lib so you can try different kinds of um tools here you can go to Lang chain and there are tons and tons of different tools that you can set your model up um with and have it sort of U experiment with and uh you know build more and more complex uh systems this way notice here that if you for example ask the model a simple question like like how are you it it does not have to uh search for it it says okay I'll just directly answer I don't need to search for this and so this is the answer um I'm doing well thank you for asking uh so here you get to see that now the llm is really able to reason on okay how do I tackle different kinds of problems and this is an absolutely fascinating way of of thinking thinking about the kinds of new applications that you can build with with with language models um I hope you find this useful um try it out let us know what you think

Original Description

Tool use is a method whichs allows developers to connect Cohere's Command models to external tools like search engines, APIs, databases, and other software tools. Just like how Retrieval-Augmented Generation (RAG) allows a model to use an external data source to improve factual generation, tool use is a capability that allows retrieving data from multiple sources. But it goes beyond simply retrieving information and is able to use software tools to execute code, or even create entries in a CRM system. In this video, we'll see how we can use two tools to create a simple data analyst agent that is able to search the web and run code in a python interpreter. This agent uses Cohere's Command R+ mode and Langchain. Find the code here at Colab: https://colab.research.google.com/github/cohere-ai/notebooks/blob/main/notebooks/agents/Data_Analyst_Agent_Cohere_and_Langchain.ipynb Github: https://github.com/cohere-ai/notebooks/blob/main/notebooks/agents/Data_Analyst_Agent_Cohere_and_Langchain.ipynb LangChain Tools: https://python.langchain.com/docs/integrations/tools/

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Jay Alammar · Jay Alammar · 36 of 38

← Previous Next →

Jay's Visual Intro to AI

Jay's Visual Intro to AI

Making Money from AI by Predicting Sales - Jay's Intro to AI Part 2

Making Money from AI by Predicting Sales - Jay's Intro to AI Part 2

How GPT3 Works - Easily Explained with Animations

How GPT3 Works - Easily Explained with Animations

The Narrated Transformer Language Model

The Narrated Transformer Language Model

My Visualization Tools (my Apple Keynote setup for visualizations and animations)

My Visualization Tools (my Apple Keynote setup for visualizations and animations)

Explainable AI Cheat Sheet - Five Key Categories

Explainable AI Cheat Sheet - Five Key Categories

The Unreasonable Effectiveness of RNNs (Article and Visualization Commentary) [2015 article]

The Unreasonable Effectiveness of RNNs (Article and Visualization Commentary) [2015 article]

Neural Activations & Dataset Examples

Neural Activations & Dataset Examples

Up and Down the Ladder of Abstraction [interactive article by Bret Victor, 2011]

Up and Down the Ladder of Abstraction [interactive article by Bret Victor, 2011]

Probing Classifiers: A Gentle Intro (Explainable AI for Deep Learning)

Probing Classifiers: A Gentle Intro (Explainable AI for Deep Learning)

Inspecting Neural Networks with CCA - A Gentle Intro (Explainable AI for Deep Learning)

Inspecting Neural Networks with CCA - A Gentle Intro (Explainable AI for Deep Learning)

Language Processing with BERT: The 3 Minute Intro (Deep learning for NLP)

Language Processing with BERT: The 3 Minute Intro (Deep learning for NLP)

Behavioral Testing of ML Models (Unit tests for machine learning)

Behavioral Testing of ML Models (Unit tests for machine learning)

Favorite AI/ML Books: Intro to ML with Python (Book Review)

Favorite AI/ML Books: Intro to ML with Python (Book Review)

Favorite Python Books: Effective Python

Favorite Python Books: Effective Python

Favorite Stats Books: Seven Pillars of Statistical Wisdom

Favorite Stats Books: Seven Pillars of Statistical Wisdom

Understanding Animal Languages - Seeing Voices 2

Understanding Animal Languages - Seeing Voices 2

How digital assistants like Siri work #shorts

How digital assistants like Siri work #shorts

Writing Code in Jupyter Notebooks #shorts

Writing Code in Jupyter Notebooks #shorts

Experience Grounds Language: Improving language models beyond the world of text

Experience Grounds Language: Improving language models beyond the world of text

pandas for data science in python #shorts

pandas for data science in python #shorts

The Illustrated Retrieval Transformer

The Illustrated Retrieval Transformer

AI Image Generation is MIND BLOWING! #shorts

AI Image Generation is MIND BLOWING! #shorts

A Generalist Agent (Gato) - DeepMind's single model learns 600 tasks

A Generalist Agent (Gato) - DeepMind's single model learns 600 tasks

The Illustrated Word2vec - A Gentle Intro to Word Embeddings in Machine Learning

The Illustrated Word2vec - A Gentle Intro to Word Embeddings in Machine Learning

AI Art Explained: How AI Generates Images (Stable Diffusion, Midjourney, and DALLE)

AI Art Explained: How AI Generates Images (Stable Diffusion, Midjourney, and DALLE)

What is Generative AI? 4 Important Things to Know (about ChatGPT, MidJourney, Cohere & future AIs)

What is Generative AI? 4 Important Things to Know (about ChatGPT, MidJourney, Cohere & future AIs)

AI is Eating The World - This is Where YOU Can Use it to Compete (AI Product Moats)

AI is Eating The World - This is Where YOU Can Use it to Compete (AI Product Moats)

What is LangChain? Where does it fit with LLMs like ChatGPT and Cohere? #shorts

What is LangChain? Where does it fit with LLMs like ChatGPT and Cohere? #shorts

Are language models with more parameters better? #shorts #chatgpt

Are language models with more parameters better? #shorts #chatgpt

How to manage LLM prompts with tools like LangChain #languagemodels #chatgpt

How to manage LLM prompts with tools like LangChain #languagemodels #chatgpt

What is Llama Index? how does it help in building LLM applications? #languagemodels #chatgpt

What is Llama Index? how does it help in building LLM applications? #languagemodels #chatgpt

prompt chains are important for building large language model applications

prompt chains are important for building large language model applications

ChatGPT has Never Seen a SINGLE Word (Despite Reading Most of The Internet). Meet LLM Tokenizers.

ChatGPT has Never Seen a SINGLE Word (Despite Reading Most of The Internet). Meet LLM Tokenizers.

What makes LLM tokenizers different from each other? GPT4 vs. FlanT5 Vs. Starcoder Vs. BERT and more

What makes LLM tokenizers different from each other? GPT4 vs. FlanT5 Vs. Starcoder Vs. BERT and more

Building LLM Agents with Tool Use

Building LLM Agents with Tool Use

SWE-Bench authors reflect on the state of LLM agents at Neurips 2024

SWE-Bench authors reflect on the state of LLM agents at Neurips 2024

Learn how ChatGPT and DeepSeek models work: How Transformer LLMs Work [Free Course]

Learn how ChatGPT and DeepSeek models work: How Transformer LLMs Work [Free Course]

This video teaches how to build LLM agents with tool use, allowing developers to connect Cohere's Command models to external tools, and execute code in Python interpreters. The example showcases a simple data analyst agent that searches the web and runs code, utilizing Langchain and Cohere's Command R+ mode.

Key Takeaways

Connect Cohere's Command models to external tools like search engines and APIs
Utilize Langchain for tool execution
Design a simple data analyst agent using Cohere's Command R+ mode
Execute code in a Python interpreter
Create entries in a CRM system

💡 Tool use allows LLM agents to go beyond simply retrieving information and execute code, or create entries in external systems, enabling more complex and dynamic interactions.

🔒 Pro feature: Ask AI to explain this lesson →

More on: LLM Engineering

View skill →

Build an LLM and RAG-based Chat Application using AlloyDB and LangChain

FULLY LOCAL Mistral AI PDF Processing [Hands-on Tutorial]

FULLY LOCAL Mistral AI PDF Processing [Hands-on Tutorial]

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation

Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation

Ultimate Guide: Deploy Google ADK Agents to Vertex AI & Cloud Run (Step-by-Step Tutorial)

Ultimate Guide: Deploy Google ADK Agents to Vertex AI & Cloud Run (Step-by-Step Tutorial)

Shane | LLM Implementation

How to Make an Asteroids Game Bot (LIVE)

How to Make an Asteroids Game Bot (LIVE)

Using Claude Code + Nano Banana Pro To Create a Dataset of Engineering Drawings

Using Claude Code + Nano Banana Pro To Create a Dataset of Engineering Drawings

Automata Learning Lab

Related AI Lessons

How I Passed the Claude Certified Architect Exam (CCA-F)

Learn how to pass the Claude Certified Architect Exam with a proven preparation strategy and study resources

Smart Traffic Systems: Understanding and Configuring the Core Logic

Learn to build and configure smart traffic systems using core logic and AI-powered tools to reduce congestion and improve traffic flow

Dev.to · Mike Clarke

Your IDE Is Becoming the AI Control Plane

Learn how AI decisions are made across various systems in an organization and the importance of a unified control plane

Spring AI + MCP Bridge Tutorial — Connect External Tool Servers to ChatClient (2026)

Learn to connect external tool servers to ChatClient using Spring AI and MCP Bridge in this 2026 tutorial

Building Great Agent Skills: The Missing Manual