LLM-Based Multi-Reference Evaluation for Efficient and Robust Assessment of Phrase Break Annotations
📰 ArXiv cs.AI
arXiv:2606.21098v1 Announce Type: cross Abstract: Reliable evaluation of phrase break annotations is crucial, as subtle variations in prosodic boundaries directly affect the clarity and naturalness of speech. However, existing approaches exhibit major limitations: single-reference evaluation assumes a unique gold phrasing for an utterance despite multiple valid phrasings, while human judgment, though flexible, is labor-intensive and unscalable. To address these, we propose LLM-based Multi-Refere
DeepCamp AI