aisecwatch.com
DashboardVulnerabilitiesNewsResearchArchiveStatsDatasetFor devs
Subscribe
aisecwatch.com

Real-time AI security monitoring. Tracking AI-related vulnerabilities, safety and security incidents, privacy risks, research developments, and policy changes.

Navigation

VulnerabilitiesNewsResearchDigest ArchiveNewsletter ArchiveSubscribeData SourcesStatisticsDatasetAPIIntegrationsWidgetRSS Feed

Maintained by

Truong (Jack) Luu

Information Systems Researcher

AI Sec Watch

The security intelligence platform for AI teams

AI security threats move fast and get buried under hype and noise. Built by an Information Systems Security researcher to help security teams and developers stay ahead of vulnerabilities, privacy incidents, safety research, and policy developments.

Independent research. No sponsors, no paywalls, no conflicts of interest.

[TOTAL_TRACKED]
4,505
[LAST_24H]
5
[LAST_7D]
161
Daily BriefingSaturday, June 6, 2026
>

OpenAI Launches Lockdown Mode to Counter Prompt Injection: OpenAI introduced Lockdown Mode, a security feature that disables web browsing, image retrieval, deep research, and agent capabilities to mitigate prompt injection attacks (when malicious instructions hidden in webpages or uploaded content manipulate an AI's responses). The feature reduces but does not eliminate exfiltration risks, as attacks may still occur through cached content or uploaded files.

>

AI Agent Discovers 21 Zero-Days in FFmpeg as Chrome Patches Record 429 Bugs: An AI security agent uncovered 21 previously unknown vulnerabilities (zero-days, or security flaws unknown to the public) in FFmpeg, a widely-used media library, while Google released Chrome 149 with 429 security patches in a single update. The development underscores how AI tools are accelerating vulnerability discovery, creating pressure on security teams to remediate flaws at unprecedented speed.

Latest Intel

page 1/451
VIEW ALL
01

OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks

securitysafety
Critical This Week5 issues
critical

CVE-2026-32625: LibreChat is an enhanced ChatGPT clone that supports multiple AI providers. In versions up to and including 0.8.3, the M

CVE-2026-32625NVD/CVE DatabaseJun 2, 2026
Jun 2, 2026
>

Meta's AI Account Support Tool Exploited for Account Takeover: Meta's AI-powered account support tool, launched in March to automate password resets and other functions, has been discovered and exploited by attackers to take over user accounts. The vulnerability represents a concerning case of AI automation introducing new attack vectors in critical authentication systems.

>

Persistent Backdoor Attack Targets Personalized Federated Learning: Researchers identified a stealthy backdoor attack (hidden malicious code inserted into AI systems) specifically designed to compromise personalized federated learning (a privacy-focused method where multiple computers train an AI model together without sharing raw data). The attack is engineered to evade detection and persist across training cycles.

Jun 6, 2026

OpenAI introduced Lockdown Mode, a new security feature designed to protect against prompt injection attacks (when malicious instructions are hidden in webpages or uploaded content to manipulate an AI's responses). The feature disables several ChatGPT capabilities including live web browsing, image retrieval, deep research, and agent mode to reduce the risk of sensitive data being exposed, though OpenAI acknowledges that prompt injections could still occur through cached content or uploaded files.

Fix: OpenAI's explicit mitigation is Lockdown Mode, which "will disable live web browsing (so you can only access cached content), the retrieval and display of images from the web (you can still generate images), deep research, and agent mode." The feature is being rolled out to ChatGPT Business accounts and eligible personal accounts. OpenAI states the goal is "to reduce the likelihood that sensitive data gets shared in the process."

TechCrunch (Security)
02

SemAlign-PFL:Exploring stealthy and persistent backdoor attacks against personalized federated learning

securityresearch
Jun 6, 2026

Researchers discovered a new type of backdoor attack (hidden malicious code inserted into AI systems) that works against personalized federated learning (a privacy-focused method where multiple computers train an AI model together without sharing raw data). The attack is designed to be stealthy and persistent, meaning it can hide from detection and remain in the system over time.

Elsevier Security Journals
03

Meta made its own AI-generated clickbait news feed

safety
Jun 6, 2026

Meta has added a 'For You' section to its standalone Meta AI app that generates clickbait-style news articles using AI, complete with AI-created topics, images, and text. The app previously featured a 'Discover' feed showing AI-generated images and conversations from users who were sometimes unaware their content was public, but this has been replaced with a standard chatbot interface.

The Verge (AI)
04

New ChatGPT Lockdown Mode Limits Tools That Could Enable Data Exfiltration

securitysafety
Jun 6, 2026

OpenAI has launched Lockdown Mode, a security feature for ChatGPT that reduces the risk of data exfiltration from prompt injection attacks (tricking an AI by hiding malicious instructions in its input) by limiting tools that connect to external services. The mode disables features like web browsing, image retrieval, file downloads, and certain agent capabilities to block potential pathways attackers could use to steal sensitive data, though it does not completely eliminate all exfiltration risks.

Fix: OpenAI recommends enabling Lockdown Mode, described as "an optional advanced security setting that limits many tools and capabilities in OpenAI products that can connect to the web or external services." The feature specifically disables live web browsing, image support, deep research agent mode, canvas networking, and file downloads. Additionally, OpenAI has launched a new account management feature that enables users to "review active ChatGPT sessions and log out of individual or all sessions if signs of unauthorized account activity are detected."

The Hacker News
05

Here comes new Siri again

industry
Jun 6, 2026

Apple is preparing to reintroduce an updated version of Siri at WWDC, building on a redesign first shown in 2024 that included a new visual appearance, additional voice options, and the ability to route questions to ChatGPT (a large language model made by OpenAI). Apple has faced criticism because promised AI features under the "Apple Intelligence" branding were delayed, and the company is now settling a lawsuit over misleading marketing around these capabilities.

The Verge (AI)
06

Crypto-Funded Chinese Peptide Labs Are Booming

securityprivacy
Jun 6, 2026

Meta has embedded dormant face recognition code (technology that identifies people by matching their faces to stored images) called NameTag in over 50 million phones through its Ray-Ban and Oakley smart glasses app, despite previously abandoning this technology after settling biometric privacy lawsuits. Additionally, Meta's AI-powered account support tool, which was introduced in March to automate functions like password resets, has been discovered by hackers who can exploit it to take over user accounts.

Wired (Security)
07

AI Agent Uncovers 21 Zero-Days in FFmpeg; Chrome Patches Record 429 Bugs

securityresearch
Jun 6, 2026

An AI security agent discovered 21 previously unknown vulnerabilities (zero-days, or security flaws unknown to the public) in FFmpeg, a widely-used media library, while Google released Chrome 149 with a record 429 security patches in a single update. The article highlights how AI tools are finding vulnerabilities faster and cheaper than before, forcing security teams and software maintainers to work harder to keep up with the increased pace of bug discoveries.

Fix: For FFmpeg: pull the fixed upstream build or your distribution's security update as soon as it lands, and prioritize patching anything that processes untrusted RTSP (Real Time Streaming Protocol, a video streaming standard) or AV1-over-RTP (video compression format over network packets). Also check and patch embedded FFmpeg copies in Python packages, container images, and appliances. For Chrome: update to version 149.0.7827.53 on Linux or 149.0.7827.53/54 on Windows and macOS, or confirm auto-update has completed.

The Hacker News
08

OpenAI Help: Lockdown Mode

securitysafety
Jun 5, 2026

OpenAI has released Lockdown Mode, a security feature that prevents the final stage of data exfiltration (stealing and sending sensitive information) from prompt injection attacks (tricking an AI by hiding malicious instructions in its input) by blocking outbound network requests. However, Lockdown Mode does not stop prompt injections from appearing in the content ChatGPT processes, meaning attackers can still manipulate the AI's responses through cached web content or uploaded files.

Fix: Enable Lockdown Mode, which is rolling out to eligible personal accounts (Free, Go, Plus, and Pro tiers) and self-serve ChatGPT Business accounts. According to the source, Lockdown Mode uses deterministic mechanisms (fixed, rule-based processes) to restrict exfiltration vectors, rather than relying on AI systems to detect attacks.

Simon Willison's Weblog
09

Trump administration, OpenAI discussing possible government stake in the AI startup

policyindustry
Jun 5, 2026

OpenAI CEO Sam Altman and the White House are discussing a possible government stake in OpenAI, with talks ongoing for over a year. As part of the potential agreement, OpenAI could donate equity to create a 'Public Wealth Fund' that would invest in long-term assets and allow citizens to share in the financial benefits of AI growth. No official investment terms have been decided, and all details remain subject to change.

CNBC Technology
10

CVE-2025-31133, CVE-2025-52565, CVE-2025-52881 - runc container issues

security
Jun 5, 2026

Three security vulnerabilities (CVE-2025-31133, CVE-2025-52565, CVE-2025-52881) were found in runc, a component used by container management systems (tools that package and run isolated software environments). AWS says these issues don't create cross-customer risk because AWS doesn't rely on containers as a security boundary (a protective barrier between different users). AWS customers using containers to isolate their own internal workloads should contact their operating system vendor for updates.

Fix: AWS recommends applying all security patches and software version updates as a best practice. Customers using containers to isolate workloads within their own environments should contact their operating system vendor for any updates or instructions necessary to mitigate these issues.

AWS Security Bulletins
123...451Next
critical

CVE-2026-47117: OpenMed before 1.5.2 contains a remote code execution vulnerability in the PII privacy-filter model loading path. The pr

CVE-2026-47117NVD/CVE DatabaseJun 2, 2026
Jun 2, 2026
critical

Flowise’s MCP implementation can run ghost commands

CSO OnlineJun 1, 2026
Jun 1, 2026
critical

OpenAI Codex Authentication Tokens Stolen in codexui-android npm Supply Chain Attack

The Hacker NewsJun 1, 2026
Jun 1, 2026
high

CVE-2026-0830 - Command Injection in Kiro GitLab Merge Request Helper

AWS Security BulletinsJun 5, 2026
Jun 5, 2026