Insider Attacks in Multi-Agent LLM Consensus Systems

📰 ArXiv cs.AI

arXiv:2605.08268v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly deployed in multi-agent systems where agents communicate in natural language to solve tasks jointly. A key capability in such systems is consensus formation, where agents iteratively exchange messages and update decisions to reach a shared outcome. However, most existing multi-agent LLM frameworks assume that all participating agents are aligned with the system objective. In practice, a malicious insi

Published 12 May 2026

Read full paper → ← Back to Reads