Code-QA-Bench: Separating Code Reasoning from Documentation Memorization in Repository-Level QA

📰 ArXiv cs.AI

arXiv:2605.29277v1 Announce Type: cross Abstract: We present Code-QA-Bench, a fully automated framework for synthesizing repository-level code understanding benchmarks that separates genuine code comprehension from documentation recall and pretraining memorization. The framework makes two methodological contributions: (1) an answer-first generation pipeline where a tool-equipped agent explores source code to produce verified gold answers before deriving questions, ensuring every task is grounded

Published 29 May 2026
Read full paper → ← Back to Reads