Beyond Code Coverage: Functionality Testing with Playwright — Marlene Mhangami, Microsoft

AI Engineer · Intermediate ·🧠 Large Language Models ·1mo ago

Key Takeaways

Demonstrates functionality testing with Playwright and LLMs

Original Description

When an LLM writes your tests, it tends to write tests that confirm what the code does rather than tests that verify what the user experiences. Your test suite goes green. The app still breaks in ways none of those tests would catch. Marlene Mhangami from Microsoft makes the case for flipping the order: get the agent to write failing Playwright tests against the expected behavior first, then generate code to pass them. The demo runs this live with GitHub Copilot and the Playwright MCP server on a toy store search feature, with the browser open so you can watch the agent click through filters and validate results in real time. Speaker info: - https://x.com/marlene_zw - https://www.linkedin.com/in/marlenemhangami/ - https://github.com/marlenemhangami
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related Reads

📰
Chinese AI Models: The OpenAI Alternatives Every Developer Should Know
Discover Chinese AI models as alternatives to OpenAI for developers, offering competitive performance and pricing
Dev.to AI
📰
Benchmarking Chinese LLM APIs: DeepSeek V3 vs Qwen3 vs Kimi K2 — A Developer's Guide (2026)
Learn how to benchmark and choose the best Chinese LLM API for your application, saving costs without compromising performance
Dev.to AI
📰
Sematic Coherance
Learn about semantic coherence as a structural condition for effective language models and its implications for AI development
Dev.to · Claire Goldbeg
📰
Building an AI Study Buddy with Persistent Memory Using Cognee
Learn to build an AI study buddy with persistent memory using Cognee, enabling it to remember, recall, and improve knowledge over time.
Medium · LLM
Up next
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)
Watch →