Anthropic Releases Claude Fable 5, Its Most Powerful AI Yet, With Cyber Safeguards
Summary
Anthropic released Claude Fable 5, a powerful AI model with safety classifiers (separate AI systems that monitor for misuse) that block cybersecurity-related requests by routing them to a weaker model instead of refusing them outright. The company also released Claude Mythos 5, an identical but unrestricted version for vetted cybersecurity professionals, because the underlying model is so effective at finding software vulnerabilities that giving it to the general public without controls could help attackers.
Solution / Mitigation
Anthropic stated it will narrow the safeguards and cut false positives after launch. The company also plans to make any remaining universal jailbreaks (prompts that completely bypass safety measures) slow and costly enough to catch before they are used at scale.
Classification
Affected Vendors
Related Issues
Original source: https://thehackernews.com/2026/06/anthropic-releases-claude-fable-5-its.html
First tracked: June 10, 2026 at 08:00 AM
Classified by LLM (prompt v3) · confidence: 92%