Build Computing Olympiad Agents with LangGraph
In this tutorial, we create Olympiad programming agents using LangGraph, drawing upon the techniques and benchmark dataset introduced in the paper "Can Language Models Solve Olympiad Programming?" by Quan Shi, Michael Tang, Karthik Narasimhan, and Shunyu Yao. #AI #LangGraph #llm
Throughout the tutorial, we learn how to enhance the agent's performance by incorporating three key techniques:
1. Reflection: In the first part, we develop a zero-shot tool calling agent and prompt it to reflect on the test case results, enabling it to correct its initial errors. This agent is comparable to the one reported in the paper, achieving a pass rate of 12.38 on the USACO benchmark.
2. Retrieval: In the second part, we introduce an initial retrieval step, functioning as an "episodic memory" for the agent. This step retrieves high-quality few-shot examples from our corpora of programming problems, assisting in solving the bronze level question. The agent created in this part is similar to the one benchmarked at 20.2 in the paper.
3. Human-in-the-loop: In the third part, we utilize the `interrupt_after` feature to allow the user to copilot the agent, guiding it towards a better answer. The benchmark performance in this scenario is limited only by the competitiveness of the human collaborator.
By the end of this tutorial, you will have a solid understanding of how to build agents in LangGraph, leveraging advanced techniques such as reflection, retrieval, and human-in-the-loop interaction.
Chapters:
00:00 Introduction
01:30 Overview of the USACO Dataset and Benchmark Performance
02:38 Introduce the 3 techniques
06:01 Part 1: Zero-Shot Agent with Reflection Capabilities
12:27 Review Zero-Shot Results
13:33 Part 2: Add Episodic Memory Retrieval
18:48 Review Results of Augmented Agent
21:06 Part 3: Add Human-in-the-Loop
26:43 Review Copilot Results
27:44 Conclusion
Additional Resources:
- Research Paper: https://arxiv.org/abs/2404.10952v1
- Tutorial Code: https://langchain-ai.gi
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from LangChain · LangChain · 0 of 60
← Previous
Next →
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Chat With Your Documents Using LangChain + JavaScript
LangChain
LangChain SQL Webinar
LangChain
LangChain "OpenAI functions" Webinar
LangChain
LangSmith Launch
LangChain
LangChain x Pinecone: Supercharging Llama-2 with RAG
LangChain
LangChain Expression Language
LangChain
Building LLM applications with LangChain with Lance
LangChain
Benchmarking Question/Answering Over CSV Data
LangChain
LangChain "RAG Evaluation" Webinar
LangChain
Fine-tuning in Your Voice Webinar
LangChain
Tabular Data Retrieval
LangChain
Building an LLM Application with Audio by AssemblyAI
LangChain
Superagent Deepdive Webinar
LangChain
Lessons from Deploying LLMs with LangSmith
LangChain
Shortwave Assistant Deepdive Webinar
LangChain
Cognitive Architectures for Language Agents
LangChain
Effectively Building with LLMs in the Browser with Jacob
LangChain
Data Privacy for LLMs
LangChain
"Theory of Mind" Webinar with Plastic Labs
LangChain
LangChain Templates
LangChain
Using Natural Language to Query Postgres with Jacob
LangChain
Building a Research Assistant from Scratch
LangChain
Benchmarking RAG over LangChain Docs
LangChain
Skeleton-of-Thought: Building a New Template from Scratch
LangChain
Benchmarking Methods for Semi-Structured RAG
LangChain
LangSmith Highlights: Getting Started
LangChain
LangSmith Highlights: Debugging
LangChain
LangSmith Highlights: Datasets
LangChain
LangSmith Highlights: Evaluation
LangChain
LangSmith Highlights: Human Annotation
LangChain
LangSmith Highlights: Monitoring
LangChain
LangSmith Highlights: Hub
LangChain
SQL Research Assistant
LangChain
Getting Started with Multi-Modal LLMs
LangChain
Build a Full Stack RAG App With TypeScript
LangChain
Auto-Prompt Builder (with Hosted LangServe)
LangChain
LangChain v0.1.0 Launch: Introduction
LangChain
LangChain v0.1.0 Launch: Observability
LangChain
LangChain v0.1.0 Launch: Integrations
LangChain
LangChain v0.1.0 Launch: Composability
LangChain
LangChain v0.1.0 Launch: Streaming
LangChain
LangChain v0.1.0 Launch: Output Parsing
LangChain
LangChain v0.1.0 Launch: Retrieval
LangChain
LangChain v0.1.0 Launch: Agents
LangChain
Build and Deploy a RAG app with Pinecone Serverless
LangChain
Hosted LangServe + LangChain Templates
LangChain
LangGraph: Intro
LangChain
LangGraph: Agent Executor
LangChain
LangGraph: Chat Agent Executor
LangChain
LangGraph: Human-in-the-Loop
LangChain
LangGraph: Dynamically Returning a Tool Output Directly
LangChain
LangGraph: Respond in a Specific Format
LangChain
LangGraph: Managing Agent Steps
LangChain
LangGraph: Force-Calling a Tool
LangChain
LangGraph: Multi-Agent Workflows
LangChain
Streaming Events: Introducing a new `stream_events` method
LangChain
Building a web RAG chatbot: using LangChain, Exa (prev. Metaphor), LangSmith, and Hosted Langserve
LangChain
OpenGPTs
LangChain
Open Source RAG with Nomic's New Embedding Model (and ChromaDB and Ollama)
LangChain
LangGraph: Persistence
LangChain
More on: LLM Engineering
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
The ABCs of reading medical research and review papers these days
Medium · LLM
#1 DevLog Meta-research: I Got Tired of Tab Chaos While Reading Research Papers.
Dev.to AI
How to Set Up a Karpathy-Style Wiki for Your Research Field
Medium · AI
The Non-Optimality of Scientific Knowledge: Path Dependence, Lock-In, and The Local Minimum Trap
ArXiv cs.AI
Chapters (10)
Introduction
1:30
Overview of the USACO Dataset and Benchmark Performance
2:38
Introduce the 3 techniques
6:01
Part 1: Zero-Shot Agent with Reflection Capabilities
12:27
Review Zero-Shot Results
13:33
Part 2: Add Episodic Memory Retrieval
18:48
Review Results of Augmented Agent
21:06
Part 3: Add Human-in-the-Loop
26:43
Review Copilot Results
27:44
Conclusion
🎓
Tutor Explanation
DeepCamp AI