Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

📰 ArXiv cs.AI

Researchers introduce a framework to evaluate LLMs' metacognitive efficiency using Signal Detection Theory, distinguishing between knowledge and self-awareness of knowledge

advanced Published 27 Mar 2026

Action Steps

Decompose LLM evaluation into Type-1 sensitivity (knowledge) and Type-2 metacognitive sensitivity (self-awareness of knowledge)
Apply Signal Detection Theory to calculate meta-d' and M-ratio, providing a more accurate assessment of LLMs' metacognitive efficiency
Use the M-ratio to compare the metacognitive abilities of different LLMs, informing model selection and development decisions
Integrate this framework into existing evaluation pipelines to improve the reliability and trustworthiness of LLMs

Who Needs to Know This

AI researchers and engineers benefit from this framework as it provides a more nuanced understanding of LLMs' capabilities, allowing for more effective model evaluation and improvement

Key Insight

💡 LLMs' metacognitive efficiency can be measured and improved using Signal Detection Theory, enhancing their reliability and trustworthiness