AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Agents

📰 ArXiv cs.AI

AgentDrift reveals unsafe recommendation drift in LLM agents due to tool corruption, hidden by ranking metrics

advanced Published 26 Mar 2026

Action Steps

Identify potential tool corruption in LLM agents
Evaluate the impact of tool corruption on recommendation safety using paired-trajectory protocols
Decompose divergence into information-channel and decision-process components to understand the sources of unsafe drift
Develop and implement safety-focused evaluation metrics to complement ranking-quality metrics

Who Needs to Know This

ML researchers and engineers working on LLM agents benefit from understanding AgentDrift, as it highlights the importance of evaluating safety in high-stakes domains. This knowledge can inform the development of more robust and reliable LLM agents

Key Insight

💡 Ranking metrics can hide unsafe recommendation drift in LLM agents, emphasizing the need for safety-focused evaluation