Did You Forget What I Asked? Prospective Memory Failures in Large Language Models

📰 ArXiv cs.AI

Large language models often fail to follow formatting instructions when performing demanding tasks, with compliance dropping by 2-21% under concurrent task load

advanced Published 26 Mar 2026

Action Steps

Identify the limitations of large language models in following formatting instructions under concurrent task load
Use a prospective memory inspired lens from cognitive psychology to analyze model behavior
Design controlled paradigms to test model compliance with formatting constraints
Evaluate model performance across multiple task complexities and model families

Who Needs to Know This

AI engineers and researchers benefit from understanding these limitations to improve model performance and reliability, while product managers can use this knowledge to design more effective interfaces

Key Insight

💡 Large language models' ability to follow formatting instructions degrades significantly under concurrent task load