AI Memory Patterns: Save Tokens, Cut Costs | Chander Dhall | Azure Cosmos DB Conf 2026

Microsoft Developer · Intermediate ·🤖 AI Agents & Automation ·4h ago
AI systems must retain conversation history, tool outputs, and user context across multiple turns. Basic approaches can quickly inflate token usage or lose critical context. In this session, Chander Dhall (CEO of Cazton, 15-time Microsoft MVP) explores three memory patterns implemented using Azure Cosmos DB NoSQL: 1. Sliding Window Memory — summarization for recent and older turns 2. Hierarchical Memory — recent context, compressed history, and long-term tiers with intelligent retrieval 3. Entity-Based Memory Graphs — extracts and stores structured facts for precise recall You'll leave with concrete code patterns, guidance on selecting the right approach, and a reusable Cosmos DB schema for AI agent memory management. 👤 Connect with Chander Dhall 📝 Chander Dhall, CEO of Cazton, is a fifteen-time awarded Microsoft (AI) MVP, Microsoft Regional Director, Google Developer Expert, Azure Cosmos DB Cosmonaut and world-renowned technology leader in architecting and implementing solutions. In 2025, he was recognized as one of only twenty AI MVPs in the United States. He's not only rescued software development, cloud, big data, and AI teams, but also implemented successful projects under tight deadlines and difficult business constraints. His company, Cazton, has a proven track record of not just saving the client millions of dollars, but also providing expedited delivery time. Cazton's clients include Google, Microsoft, Thomson Reuters, Broadcom, AT&T, Dell, Bank of America, NBC Universal, American Express, Fandango, LinkedIn, VMware, McKesson, Macquarie Bank, and many other Fortune 500, mid-size and startup companies. In the field of AI, Chander has been at the forefront of building and deploying enterprise-grade solutions that leverage cutting-edge technologies like OpenAI's GPT models, Google's Gemini, Cohere's Command and Anthropic's Claude. He specializes in designing and fine-tuning generative AI models for real-world applications such as conversational AI, pred
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

BizNode now has 7 tiers from $20 to $1500. API-hosted tiers need zero installation — your bot runs on BizNode...
BizNode offers 7 tiers of AI-powered automation, allowing businesses to work smarter with around-the-clock intelligence, starting at $20/month
Dev.to AI
OpenAI Really Wants Codex to Shut Up About Goblins
Learn how OpenAI's coding agent Codex is instructed to avoid discussing irrelevant creatures like goblins, and why this matters for AI development
Wired AI
Why Companies Will Stop Asking “Do You Know AI?” and Start Asking This Instead
Companies will shift from asking about AI knowledge to inquiring about ability to architect integrated systems using MCP, RAG, and Agents, highlighting the need for professionals to understand these technologies.
Medium · RAG
MCP, A2A, AND THE AGENT INTERNET
Learn about the Model Context Protocol (MCP) and A2A protocol and their impact on the agent internet and enterprise AI
Medium · AI
Up next
ChatGPT’s New Workspace 24/7 AI Agents is INSANE
Julian Goldie SEO
Watch →