Should We be Pedantic About Reasoning Errors in Machine Translation?
📰 ArXiv cs.AI
arXiv:2604.09890v1 Announce Type: cross Abstract: Across multiple language pairings (English $\to$ \{Spanish, French, German, Mandarin, Japanese, Urdu, Cantonese\}), we find reasoning errors in translation. To quantify how often these reasoning errors occur, we leverage an automated annotation protocol for reasoning evaluation wherein the goal is to detect if a reasoning step is any of three error categories: (1) source sentence-misaligned, (2) model hypothesis-misaligned, or (3) reasoning trace
DeepCamp AI