Should We be Pedantic About Reasoning Errors in Machine Translation?

📰 ArXiv cs.AI

arXiv:2604.09890v1 Announce Type: cross Abstract: Across multiple language pairings (English $\to$ \{Spanish, French, German, Mandarin, Japanese, Urdu, Cantonese\}), we find reasoning errors in translation. To quantify how often these reasoning errors occur, we leverage an automated annotation protocol for reasoning evaluation wherein the goal is to detect if a reasoning step is any of three error categories: (1) source sentence-misaligned, (2) model hypothesis-misaligned, or (3) reasoning trace

Published 14 Apr 2026

Read full paper → ← Back to Reads