Anthropic CVP Run 3 — Does Claude's Safety Stack Scale Down to Haiku 4.5?

📰 Dev.to AI

Learn how Anthropic's Cyber Verification Program scales down to smaller models like Haiku 4.5, ensuring safety and security

advanced Published 23 Apr 2026

Action Steps

Run the Anthropic CVP test suite on a smaller Claude model like Haiku 4.5
Evaluate the results against a set of 13-prompt agent-attack suite
Configure the test environment to mimic real-world scenarios
Test for exploit content execution and secret leakage
Analyze the scope notes for honest assessment of the model's safety

Who Needs to Know This

Cybersecurity researchers and AI engineers can benefit from understanding the Anthropic CVP and its application to smaller models, enhancing their ability to evaluate and improve AI safety

Key Insight

💡 The Anthropic CVP can effectively evaluate the safety of smaller AI models, providing a robust framework for responsible cybersecurity assessment