General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks

📰 ArXiv cs.AI

arXiv:2604.11778v1 Announce Type: cross Abstract: Contemporary large language models (LLMs) have demonstrated remarkable reasoning capabilities, particularly in specialized domains like mathematics and physics. However, their ability to generalize these reasoning skills to more general and broader contexts--often termed general reasoning--remains under-explored. Unlike domain-specific reasoning, general reasoning relies less on expert knowledge but still presents formidable reasoning challenges,

Published 14 Apr 2026

Read full paper → ← Back to Reads