Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory

📰 ArXiv cs.AI

Researchers introduce a framework to evaluate LLMs' metacognitive efficiency using Signal Detection Theory, distinguishing between knowledge and self-awareness of knowledge

advanced Published 27 Mar 2026
Action Steps
  1. Decompose LLM evaluation into Type-1 sensitivity (knowledge) and Type-2 metacognitive sensitivity (self-awareness of knowledge)
  2. Apply Signal Detection Theory to calculate meta-d' and M-ratio, providing a more accurate assessment of LLMs' metacognitive efficiency
  3. Use the M-ratio to compare the metacognitive abilities of different LLMs, informing model selection and development decisions
  4. Integrate this framework into existing evaluation pipelines to improve the reliability and trustworthiness of LLMs
Who Needs to Know This

AI researchers and engineers benefit from this framework as it provides a more nuanced understanding of LLMs' capabilities, allowing for more effective model evaluation and improvement

Key Insight

💡 LLMs' metacognitive efficiency can be measured and improved using Signal Detection Theory, enhancing their reliability and trustworthiness

Share This
🤖 New framework for evaluating LLMs' metacognitive efficiency using Signal Detection Theory! 📊
Read full paper → ← Back to News