Building and Evaluating Data Agents

External: Coursera Courses ↗ · Coursera

Open Course on External: Coursera

Free to audit · Opens on External: Coursera

Building and Evaluating Data Agents

Coursera · Intermediate ·🤖 AI Agents & Automation ·3mo ago

Key Takeaways

Designs digital systems from logic gates to processors

Original Description

Learn how to build and evaluate a data agent in “Building and Evaluating Data Agents,” a course created in collaboration with Snowflake, and taught by Anupam Datta, AI Research Lead, and Josha Reini, Developer Advocate at Snowflake. You’ll design a data agent that connects to data sources (databases, files) and performs web searches to respond to users’ queries. The agent will consist of sub-agents, each specialized in connecting to a particular data source, and other sub-agents that summarize or visualize the results. To answer a particular query, the agent will use a planner that identifies which sub-agents to call and in what order. You’ll add observability to the agent’s workflow and evaluate the quality of its output. Using an LLM-as-a-judge approach, you’ll assess whether the final answer is relevant to the user’s query and grounded in the collected data. You’ll also evaluate the process by determining whether the agent’s goal, plan, and actions (GPA) are all aligned. Finally, you’ll apply inline evaluations to evaluate the agent’s performance during runtime. At every retrieval step, you’ll evaluate if the collected data is relevant to the user’s query. The agent will use this evaluation score to decide if it needs to adjust its plan. What you’ll do, in detail: Understand what data agents are and how they can be trustworthy when their goal, plan, and actions are properly aligned. Build a data agent that plans, performs web searches ,and visualizes or summarizes the results, using a multi-agent workflow implemented in LangGraph. Expand the agent’s capabilities by adding a Cortex sub-agent that retrieves information from structured and unstructured data stored in Snowflake. Add tracing to the agent’s workflow to log the steps it takes to answer a query. Evaluate the context relevance of the retrieved results, the groundedness of the final answer, and its relevance to the user’s query. Measure the alignment of the agent’s goal, plan, and actions (GPA) b
Watch on External: Coursera ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

The Hidden Cost of Repetitive Online Work (And How AI Is Changing It)
Discover how AI can help reduce the hidden costs of repetitive online work and improve business efficiency
Medium · AI
How to Hack an AI Agent (And How to Stop It)
Learn how to identify and mitigate 7 common AI agent attack patterns, including prompt injection and data exfiltration, to secure your AI systems
Dev.to · Dockfix Labs
Give any MCP agent ground-truth: measured ground motion for US addresses with SibFly
Learn how to provide MCP agents with ground-truth data on measured ground motion for US addresses using SibFly's hosted MCP server
Dev.to AI
How a 3-Line Loop Costs $5,000 at 2 AM (And the Code Pattern to Fix It)
Learn how a simple 3-line loop can cost $5,000 at 2 AM and discover the code pattern to fix it, ensuring secure autonomous workflows
Dev.to AI
Up next
Building Great Agent Skills: The Missing Manual
AI Engineer
Watch →