Malice in Agentland: Down the Rabbit Hole of Backdoors in the AI Supply Chain
📰 ArXiv cs.AI
arXiv:2510.05159v4 Announce Type: replace-cross Abstract: While finetuning AI agents on interaction data -- such as web browsing or tool use -- improves their capabilities, it also introduces critical security vulnerabilities within the agentic AI supply chain. We show that adversaries can effectively poison the data collection pipeline at multiple stages to embed hard-to-detect backdoors that, when triggered, cause unsafe or malicious behavior. We formalize three realistic threat models across
DeepCamp AI