Can Open-Source LLM Agents Replace Static Application Security Testing Tools? An Empirical Assessment

📰 ArXiv cs.AI

Learn how open-source LLM agents perform in static application security testing and their potential to replace traditional tools

advanced Published 11 Jun 2026

Action Steps

Evaluate the performance of open-source LLM agents using metrics such as precision, recall, and false positive count
Compare the results with traditional static application security testing tools
Configure and fine-tune LLM agents for specific use cases and applications
Assess the potential of LLM agents to replace or augment traditional security testing tools
Apply LLM agents to real-world applications and monitor their performance over time

Who Needs to Know This

Cybersecurity teams and developers can benefit from understanding the capabilities and limitations of open-source LLM agents in identifying vulnerabilities and improving application security

Key Insight

💡 Open-source LLM agents show promise in static application security testing, but their performance varies depending on the model and configuration

Full Article

Title: Can Open-Source LLM Agents Replace Static Application Security Testing Tools? An Empirical Assessment

Abstract:
arXiv:2606.11672v1 Announce Type: cross Abstract: This paper explores the value of agentic AI tools for cybersecurity purposes. We evaluate the efficacy of a general-purpose GenAI Large Language Model- (GenAI-) based agent when powered by three different Ollama-hosted general-purpose open source models. We assess each agent's performance using precision, recall, false positive count, and a calculated composite score based upon the interplay of the captured metrics, against the baseline performan

Read full paper → ← Back to Reads