In the Wake of Anthropic’s Mythos, OpenAI Has a New Cybersecurity Model—and Strategy
Summary
OpenAI announced GPT-5.4-Cyber, a new AI model designed specifically for cybersecurity professionals, along with a three-part strategy to manage risks as AI becomes more powerful. The announcement comes after competitor Anthropic released a more limited version of its Claude Mythos model, citing concerns that advanced AI could be exploited by attackers, though OpenAI argues that current safeguards are sufficient for broad deployment of today's models.
Solution / Mitigation
OpenAI's strategy includes three components: (1) 'know your customer' validation systems combined with Trusted Access for Cyber (TAC), an automated system introduced in February that allows controlled access to new models; (2) iterative deployment, a careful process of releasing and refining capabilities while monitoring for resilience to jailbreaks (techniques that trick AI into ignoring its safety guidelines) and other adversarial attacks; and (3) investments supporting software security and digital defense, including the Codex Security application security AI agent, a cybersecurity grants program begun in 2023, a donation to the Linux Foundation for open source security, and the Preparedness Framework designed to assess and defend against severe harm from advanced AI capabilities.
Classification
Affected Vendors
Related Issues
Original source: https://www.wired.com/story/in-the-wake-of-anthropics-mythos-openai-has-a-new-cybersecurity-model-and-strategy/
First tracked: April 14, 2026 at 08:00 PM
Classified by LLM (prompt v3) · confidence: 85%