Skip to content
cyberLOW2026-04-23 22:59 UTC

Anthropic CVP Run 3 — Does Claude's Safety Stack Scale Down to Haiku 4.5?

TL;DR: Tested Anthropic's smallest production Claude (Haiku 4.5) against the same 13-prompt agent-attack suite from Run 2 (Opus 4.7). Result: 13/13 clean. Zero exploit content executed. Zero secrets leaked. Honest scope notes inside. The Cyber Verification Program is a narrow, authorized lane Anthro

ADVERTISEMENT
⚡ STAY AHEAD

Events like this, convergence-verified across 689 sources, land in your inbox every Sunday. Free.

GET THE SUNDAY BRIEFING →

RELATED · cyber