LangSmith Highlights: Human Annotation
Key Takeaways
The video demonstrates how to add human feedback to annotate runs in LangSmith, including tagging a run with feedback and checking out the annotation queue.
Full Transcript
one of the things that we help you do in Lang Smith is ADD human feedback to annotate your runs so we just showed how you can use automatic evaluation to have llms grade your runs or to programmatically Auto evaluate each of your runs but there's really no substitute for a human adding annotation feedback on runs as well you might do this for a couple of reasons maybe there's some kind of measure that's hard to have an automatic evaluation on or maybe you've used auto evals on thousands of runs and you want to have a human just pick through a small sub subset of those runs to make sure that your llm grader is is still doing a good job so I'll show how to do that in this video so this is a test run we have uh some feedback already recorded on each of these runs things like correctness helpfulness and sensitivity uh as well as uh embedding cosine distance and what we're going to do is we're going to pick all of the RS that had a correct score uh and we're going to grab them all and send them to an annotation queue so we're going to add this second human review uh annotation cue and now all of these runs will be queed up in this way that we can easily go through and add our own own feedback and we can we can see here all of the tags this Ron already has but maybe we want to have a different kind of feedback like creativity which is harder for llms to creade and it can have a score of 1 to five and just making this up but this one is a creativity of SC two maybe you have a rubric that a human evaluator wants to follow that this one is done I can now add a score of again creativity to this run here we'll give it a score of five this one is done and you can see how I can quickly add tags at to each of these runs and add some additional feedback manually uh to to the ones that are in my queue so that if I'm uh supporting a a flow of making sure that each of the runs have a good response uh you can do that pretty seamlessly within your annotation que and we're all cut caught up meaning I have no more to review and this is really helpful if you're in a supporter role or you're helping curate data sets uh to make sure that you have the appropriate tags and feedback on each of your runs
Original Description
See how to:
-Tag a run with feedback
-Check out your annotation queue
Log in or sign up for LangSmith (BETA): https://smith.langchain.com/
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from LangChain · LangChain · 30 of 60
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
▶
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Chat With Your Documents Using LangChain + JavaScript
LangChain
LangChain SQL Webinar
LangChain
LangChain "OpenAI functions" Webinar
LangChain
LangSmith Launch
LangChain
LangChain x Pinecone: Supercharging Llama-2 with RAG
LangChain
LangChain Expression Language
LangChain
Building LLM applications with LangChain with Lance
LangChain
Benchmarking Question/Answering Over CSV Data
LangChain
LangChain "RAG Evaluation" Webinar
LangChain
Fine-tuning in Your Voice Webinar
LangChain
Tabular Data Retrieval
LangChain
Building an LLM Application with Audio by AssemblyAI
LangChain
Superagent Deepdive Webinar
LangChain
Lessons from Deploying LLMs with LangSmith
LangChain
Shortwave Assistant Deepdive Webinar
LangChain
Cognitive Architectures for Language Agents
LangChain
Effectively Building with LLMs in the Browser with Jacob
LangChain
Data Privacy for LLMs
LangChain
"Theory of Mind" Webinar with Plastic Labs
LangChain
LangChain Templates
LangChain
Using Natural Language to Query Postgres with Jacob
LangChain
Building a Research Assistant from Scratch
LangChain
Benchmarking RAG over LangChain Docs
LangChain
Skeleton-of-Thought: Building a New Template from Scratch
LangChain
Benchmarking Methods for Semi-Structured RAG
LangChain
LangSmith Highlights: Getting Started
LangChain
LangSmith Highlights: Debugging
LangChain
LangSmith Highlights: Datasets
LangChain
LangSmith Highlights: Evaluation
LangChain
LangSmith Highlights: Human Annotation
LangChain
LangSmith Highlights: Monitoring
LangChain
LangSmith Highlights: Hub
LangChain
SQL Research Assistant
LangChain
Getting Started with Multi-Modal LLMs
LangChain
Build a Full Stack RAG App With TypeScript
LangChain
Auto-Prompt Builder (with Hosted LangServe)
LangChain
LangChain v0.1.0 Launch: Introduction
LangChain
LangChain v0.1.0 Launch: Observability
LangChain
LangChain v0.1.0 Launch: Integrations
LangChain
LangChain v0.1.0 Launch: Composability
LangChain
LangChain v0.1.0 Launch: Streaming
LangChain
LangChain v0.1.0 Launch: Output Parsing
LangChain
LangChain v0.1.0 Launch: Retrieval
LangChain
LangChain v0.1.0 Launch: Agents
LangChain
Build and Deploy a RAG app with Pinecone Serverless
LangChain
Hosted LangServe + LangChain Templates
LangChain
LangGraph: Intro
LangChain
LangGraph: Agent Executor
LangChain
LangGraph: Chat Agent Executor
LangChain
LangGraph: Human-in-the-Loop
LangChain
LangGraph: Dynamically Returning a Tool Output Directly
LangChain
LangGraph: Respond in a Specific Format
LangChain
LangGraph: Managing Agent Steps
LangChain
LangGraph: Force-Calling a Tool
LangChain
LangGraph: Multi-Agent Workflows
LangChain
Streaming Events: Introducing a new `stream_events` method
LangChain
Building a web RAG chatbot: using LangChain, Exa (prev. Metaphor), LangSmith, and Hosted Langserve
LangChain
OpenGPTs
LangChain
Open Source RAG with Nomic's New Embedding Model (and ChromaDB and Ollama)
LangChain
LangGraph: Persistence
LangChain
More on: Tool Use & Function Calling
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
I Let 5 AI Agents Shop For Me in 2026. It Went About as Well as You’d Expect.
Medium · AI
The Governance Gap Nobody's Measuring
Medium · AI
My agent kept reading data it wasn't allowed to. The prompt was never going to stop it.
Dev.to AI
8 Must-Know AI Chatbot Tools That Actually Help Small Businesses
Dev.to AI
🎓
Tutor Explanation
DeepCamp AI