Insider Attacks in Multi-Agent LLM Consensus Systems
📰 ArXiv cs.AI
arXiv:2605.08268v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly deployed in multi-agent systems where agents communicate in natural language to solve tasks jointly. A key capability in such systems is consensus formation, where agents iteratively exchange messages and update decisions to reach a shared outcome. However, most existing multi-agent LLM frameworks assume that all participating agents are aligned with the system objective. In practice, a malicious insi
DeepCamp AI