AI Agent Memory: Chat History & Semantic Caching | Lino Tadros | Azure Cosmos DB Conf 2026

Microsoft Developer · Advanced ·🤖 AI Agents & Automation ·4h ago
AI agents are only as intelligent as their ability to remember. Without persistent memory, every conversation starts from scratch — costing tokens, increasing latency, and delivering disconnected experiences. Lino Tadros (Microsoft Regional Director, Founder & Principal Architect at The Training Boss) shows how to use Azure Cosmos DB as the unified memory layer for agentic AI applications. You'll build two essential containers from scratch: • Chat History — preserves multi-turn conversations across sessions and devices • Semantic Cache — uses vector search to return prior LLM completions for semantically similar prompts, avoiding redundant API calls See the data modeling decisions that matter: partition key strategy for multi-tenant workloads, document design that balances write distribution with query locality, vector indexes tuned for fast similarity search, and TTL policies for automatic lifecycle management. Walk away with working code patterns you can apply immediately. 👤 Connect with Lino Tadros 📝 Distinguished executive leader and renowned technical expert in AI, Machine Learning, and IoT. Leads cross-functional architectural teams to award-winning performance by developing strategic roadmaps and powering enterprise-wide projects. Serves as board member and advisor for multiple corporations delivering strategic guidance on product line developments and business solutions. Industry influencer and mastermind of strategic programs and innovations leading modernization efforts to alter the global IT landscape as Microsoft Regional Director. Partnered with Microsoft to consult major corporations on Azure integrations; trained over 1,000 global employees and architects across US, Canada, Europe, Middle East, and Australia. Invited into elite Microsoft Regional Director program as Top 1% of global SMEs—maintains direct line of communication to Microsoft executive leaders and Office of the President for Technical and Business Influence. Piloted multi-million
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

BizNode now has 7 tiers from $20 to $1500. API-hosted tiers need zero installation — your bot runs on BizNode...
BizNode offers 7 tiers of AI-powered automation, allowing businesses to work smarter with around-the-clock intelligence, starting at $20/month
Dev.to AI
OpenAI Really Wants Codex to Shut Up About Goblins
Learn how OpenAI's coding agent Codex is instructed to avoid discussing irrelevant creatures like goblins, and why this matters for AI development
Wired AI
Why Companies Will Stop Asking “Do You Know AI?” and Start Asking This Instead
Companies will shift from asking about AI knowledge to inquiring about ability to architect integrated systems using MCP, RAG, and Agents, highlighting the need for professionals to understand these technologies.
Medium · RAG
MCP, A2A, AND THE AGENT INTERNET
Learn about the Model Context Protocol (MCP) and A2A protocol and their impact on the agent internet and enterprise AI
Medium · AI
Up next
ChatGPT’s New Workspace 24/7 AI Agents is INSANE
Julian Goldie SEO
Watch →