CSPO: Alleviating Reward Ambiguity for Structured Table-to-LaTeX Generation

📰 ArXiv cs.AI

arXiv:2604.10918v1 Announce Type: new Abstract: Tables contain rich structured information, yet when stored as images their contents remain "locked" within pixels. Converting table images into LaTeX code enables faithful digitization and reuse, but current multimodal large language models (MLLMs) often fail to preserve structural, style, or content fidelity. Conventional post-training with reinforcement learning (RL) typically relies on a single aggregated reward, leading to reward ambiguity tha

Published 14 Apr 2026
Read full paper → ← Back to Reads