Anthropic Launches Claude Fable 5: Mythos-Class AI With Cybersecurity Guardrails
Summary
Anthropic released Claude Fable 5, a powerful AI model with safety restrictions that automatically switch to a less capable version when users try to use it for high-risk tasks like cybersecurity or biology. The company tested these safeguards extensively through internal testing and external bug bounty programs (paying security researchers to find vulnerabilities) spanning over 1,000 hours, and no universal jailbreaks (methods to bypass the restrictions) were discovered.
Classification
Affected Vendors
Related Issues
Original source: https://www.securityweek.com/anthropic-launches-claude-fable-5-mythos-class-ai-with-cybersecurity-guardrails/
First tracked: June 9, 2026 at 02:00 PM
Classified by LLM (prompt v3) · confidence: 92%