Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy

📰 ArXiv cs.AI

arXiv:2603.02123v3 Announce Type: replace Abstract: The development of affective multimodal language models (MLMs) has long been constrained by a gap between low-level perception and high-level interaction, leading to fragmented affective capabilities and limited generalization. To bridge this gap, we propose a cognitively inspired three-level hierarchy that organizes affective tasks according to their cognitive depth-perception, understanding, and interaction-and provides a unified conceptual f

Published 14 Apr 2026
Read full paper → ← Back to Reads