MM-StanceDet: Retrieval-Augmented Multi-modal Multi-agent Stance Detection
📰 ArXiv cs.AI
arXiv:2604.27934v1 Announce Type: new Abstract: Multimodal Stance Detection (MSD) is crucial for understanding public discourse, yet effectively fusing text and image, especially with conflicting signals, remains challenging. Existing methods often face difficulties with contextual grounding, cross-modal interpretation ambiguity, and single-pass reasoning fragility. To address these, we propose Retrieval-Augmented Multi-modal Multi-agent Stance Detection (MM-StanceDet), a novel multi-agent frame
DeepCamp AI