ReCUBE Benchmark Reveals GPT-5 Scores Only 37.6% on Repository-Level Code Generation
📰 Dev.to · gentic news
Researchers introduce ReCUBE, a benchmark isolating LLMs' ability to use repository-wide context for code generation. GPT-5 achieves just a 37.57% str
DeepCamp AI