Beyond Code Coverage: Functionality Testing with Playwright — Marlene Mhangami, Microsoft
Key Takeaways
Demonstrates functionality testing with Playwright and LLMs
Original Description
When an LLM writes your tests, it tends to write tests that confirm what the code does rather than tests that verify what the user experiences. Your test suite goes green. The app still breaks in ways none of those tests would catch.
Marlene Mhangami from Microsoft makes the case for flipping the order: get the agent to write failing Playwright tests against the expected behavior first, then generate code to pass them. The demo runs this live with GitHub Copilot and the Playwright MCP server on a toy store search feature, with the browser open so you can watch the agent click through filters and validate results in real time.
Speaker info:
- https://x.com/marlene_zw
- https://www.linkedin.com/in/marlenemhangami/
- https://github.com/marlenemhangami
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
More on: LLM Engineering
View skill →Related Reads
📰
📰
📰
📰
Chinese AI Models: The OpenAI Alternatives Every Developer Should Know
Dev.to AI
Benchmarking Chinese LLM APIs: DeepSeek V3 vs Qwen3 vs Kimi K2 — A Developer's Guide (2026)
Dev.to AI
Sematic Coherance
Dev.to · Claire Goldbeg
Building an AI Study Buddy with Persistent Memory Using Cognee
Medium · LLM
🎓
Tutor Explanation
DeepCamp AI