Structured Security Auditing and Robustness Enhancement for Untrusted Agent Skills
📰 ArXiv cs.AI
arXiv:2604.25109v1 Announce Type: cross Abstract: Agent Skills package SKILL.md files, scripts, reference documents, and repository context into reusable capability units, turning pre-load auditing from single-prompt filtering into cross-file security review. Existing guardrails often flag risk but recover malicious intent inconsistently under semantics-preserving rewrites. This paper formulates pre-load auditing for untrusted Agent Skills as a robust three-way classification task and introduces
DeepCamp AI