aisecwatch.com
DashboardVulnerabilitiesNewsResearchArchiveStatsDatasetFor devs
Subscribe
aisecwatch.com

Real-time AI security monitoring. Tracking AI-related vulnerabilities, safety and security incidents, privacy risks, research developments, and policy changes.

Navigation

VulnerabilitiesNewsResearchDigest ArchiveNewsletter ArchiveSubscribeData SourcesStatisticsDatasetAPIIntegrationsWidgetRSS Feed

Maintained by

Truong (Jack) Luu

Information Systems Researcher

AI Sec Watch

The security intelligence platform for AI teams

AI security threats move fast and get buried under hype and noise. Built by an Information Systems Security researcher to help security teams and developers stay ahead of vulnerabilities, privacy incidents, safety research, and policy developments.

Independent research. No sponsors, no paywalls, no conflicts of interest.

[TOTAL_TRACKED]
3,710
[LAST_24H]
1
[LAST_7D]
1
Daily BriefingSunday, May 17, 2026

No new AI/LLM security issues were identified today.

Latest Intel

page 171/371
VIEW ALL
01

'Silent failure at scale': The AI risk that can tip the business world into disorder

safetyindustry
Mar 1, 2026

AI systems are becoming too complex for humans to fully understand or predict their behavior, creating risks of 'silent failures at scale' where mistakes accumulate quietly over time without obvious crashes or alerts. As companies deploy AI to handle critical business operations like approving transactions and managing customer service, gaps between expected and actual system performance are causing real damage, such as a beverage manufacturer's AI producing hundreds of thousands of excess cans when it misidentified holiday packaging.

CNBC Technology
02

Hackers Weaponize Claude Code in Mexican Government Cyberattack

security
Mar 1, 2026

Attackers used Claude (an AI assistant made by Anthropic) to write exploits (code that takes advantage of security flaws), create hacking tools, and automatically steal over 150GB of data from Mexican government systems. This demonstrates how AI models can be misused for cyberattacks when someone gains unauthorized access to them.

SecurityWeek
03

Quoting claude.com/import-memory

securitysafety
Mar 1, 2026

A user requested that Claude export all stored memories and learned context about them in a specific format to migrate to another service. The request asked Claude to list personal details, behavioral preferences, instructions, projects, and tools with verbatim preservation and no summarization, then confirm if the export was complete.

Simon Willison's Weblog
04

The trap Anthropic built for itself

policysafety
Feb 28, 2026

Anthropic, an AI company founded in 2021, lost a $200 million Pentagon contract and faced a federal ban after refusing to allow its technology to be used for mass surveillance or autonomous weapons systems. According to physicist Max Tegmark, Anthropic and other major AI companies like OpenAI and Google DeepMind have contributed to this crisis by resisting binding regulation and repeatedly breaking their own safety promises, most recently when Anthropic dropped its core commitment not to release powerful AI systems until confident they would not cause harm.

TechCrunch
05

Anthropic’s Claude rises to No. 2 in the App Store following Pentagon dispute

policy
Feb 28, 2026

Anthropic's Claude AI chatbot has risen to the second most popular free app in Apple's US App Store, jumping from outside the top 100 in late January to number two by early February. This surge in downloads followed a public dispute where Anthropic negotiated with the Pentagon over safeguards to prevent its AI from being used for mass domestic surveillance or fully autonomous weapons, which led President Trump to direct federal agencies to stop using Anthropic products.

TechCrunch
06

The billion-dollar infrastructure deals powering the AI boom

industry
Feb 28, 2026

AI companies are spending billions of dollars on computing infrastructure to power AI models, with estimates of $3-4 trillion by the end of the decade. Major tech companies like Microsoft, Google, Oracle, and Amazon are competing to provide cloud services and specialized hardware to AI labs, leading to massive deals such as Oracle's $300 billion agreement with OpenAI and Microsoft's $14 billion investment in the company. This infrastructure race is straining power grids and pushing building capacity to its limits as the industry races to meet the enormous computing demands of AI training.

TechCrunch
07

Anthropic's Claude hits No. 2 on Apple's top free apps list after Pentagon rejection

policy
Feb 28, 2026

Anthropic's Claude AI app jumped to the No. 2 position on Apple's free apps chart after the Trump administration and Department of Defense moved to block government agencies from using the company's technology, citing concerns about Anthropic's refusal to support mass domestic surveillance or fully autonomous weapons. The surge in popularity suggests consumers are responding positively to Anthropic's ethical stance, even as the Pentagon designated the company a supply-chain risk (a classification that prevents defense contractors from using its tools).

CNBC Technology
08

ClawJacked Flaw Lets Malicious Sites Hijack Local OpenClaw AI Agents via WebSocket

security
Feb 28, 2026

OpenClaw fixed a high-severity vulnerability called ClawJacked that let malicious websites hijack local AI agents by exploiting a missing rate-limiting mechanism on the gateway's WebSocket server (a protocol for two-way communication between browsers and servers). An attacker could trick a developer into visiting a malicious site, then use JavaScript to brute-force the gateway password, auto-register as a trusted device, and gain complete control over the AI agent to steal data and execute commands.

Fix: OpenClaw released version 2026.2.25 on February 26, 2026, which fixed the vulnerability. Users are advised to "apply the latest updates as soon as possible, periodically audit access granted to AI agents, and enforce appropriate governance controls for non-human (aka agentic) identities."

The Hacker News
09

OpenAI to work with Pentagon after Anthropic dropped by Trump over company’s ethics concerns

policy
Feb 28, 2026

OpenAI announced a deal to provide AI technology to classified US military networks, shortly after the Trump administration ended its relationship with Anthropic (a competing AI company that makes Claude) over ethics disagreements. Anthropic had wanted guarantees that its AI would not be used for mass surveillance or autonomous weapons systems (systems that can select and attack targets without human decision-making).

The Guardian Technology
10

OpenAI’s Sam Altman announces Pentagon deal with ‘technical safeguards’

policysecurity
Feb 28, 2026

OpenAI announced a deal allowing the Department of Defense to use its AI models on classified networks, following a dispute where rival Anthropic refused to agree to unrestricted military use without safeguards against mass domestic surveillance and fully autonomous weapons. Sam Altman stated that OpenAI's agreement includes technical protections addressing these same concerns, with OpenAI building a 'safety stack' (a set of security and control measures) and deploying engineers to ensure the models behave correctly.

Fix: According to Altman, OpenAI will 'build technical safeguards to ensure our models behave as they should' and will 'deploy engineers with the Pentagon to help with our models and to ensure their safety.' Additionally, the government will allow OpenAI to build its own 'safety stack to prevent misuse' and 'if the model refuses to do a task, then the government would not force OpenAI to make it do that task.'

TechCrunch
Prev1...169170171172173...371Next