aisecwatch.com
DashboardVulnerabilitiesNewsResearchArchiveStatsDatasetFor devs
Subscribe
aisecwatch.com

Real-time AI security monitoring. Tracking AI-related vulnerabilities, safety and security incidents, privacy risks, research developments, and policy changes.

Navigation

VulnerabilitiesNewsResearchDigest ArchiveNewsletter ArchiveSubscribeData SourcesStatisticsDatasetAPIIntegrationsWidgetRSS Feed

Maintained by

Truong (Jack) Luu

Information Systems Researcher

AI Sec Watch

The security intelligence platform for AI teams

AI security threats move fast and get buried under hype and noise. Built by an Information Systems Security researcher to help security teams and developers stay ahead of vulnerabilities, privacy incidents, safety research, and policy developments.

Independent research. No sponsors, no paywalls, no conflicts of interest.

[TOTAL_TRACKED]
5,047
[LAST_24H]
5
[LAST_7D]
146
Daily BriefingSaturday, June 27, 2026
>

AI Coding Agents Vulnerable to DNS-Based Malware Injection: Researchers demonstrated that AI coding assistants can be manipulated through a social engineering chain where benign setup instructions trigger errors, prompting the AI to execute a suggested fix command that covertly retrieves and runs malicious code from attacker-controlled DNS records (the system that translates domain names to IP addresses). The attack is particularly insidious because the malicious payload never appears in the repository itself, evading traditional code review.

>

OpenAI Releases GPT-5.6 Sol With Enhanced Cybersecurity Controls: OpenAI launched a limited preview of GPT-5.6 Sol, its most capable model optimized for vulnerability research and patch development, featuring reinforced defenses against jailbreaks (techniques to circumvent safety restrictions) and guardrails to prevent offensive cyber operations. The company acknowledges the model may over-block legitimate security research requests during preview due to the dual-use nature of advanced cybersecurity capabilities.

Latest Intel

page 6/505
VIEW ALL
01

GHSA-4vp2-6q8c-pvq2: @anthropic-ai/claude-code has an Insecure Temporary File in /copy Command that Enables Response Disclosure and Symlink-Based File Write

security
Jun 25, 2026
Critical This Week5 issues
critical

CVE-2026-50549: Cursor is a code editor built for programming with AI. Prior to 3.0, Cursor runs agent terminal commands in a sandbox by

CVE-2026-50549NVD/CVE DatabaseJun 25, 2026
Jun 25, 2026

Claude Code's `/copy` command had a serious security flaw where it saved responses to an easily guessable file location (`/tmp/claude/response.md`) that any user on the system could read, potentially exposing secrets or credentials. An attacker could also create a symlink (a shortcut to another file) at that location to trick the command into overwriting any file they chose. This vulnerability required the attacker and a privileged user to be on the same computer.

Fix: Users on standard Claude Code auto-update have already received this fix. Users performing manual updates are advised to update to the latest version.

GitHub Advisory Database
02

New macOS malware embeds fake errors to confuse AI analysis tools

securitysafety
Jun 25, 2026

A macOS malware called "Gaslight" uses prompt injection (tricking an AI by hiding instructions in its input) to confuse AI-powered malware analysis tools by embedding fake error messages, crash reports, and debugging data within the executable file. The malware contains 38 fabricated system messages designed to make LLM (large language model)-assisted analysis tools question their own sessions or stop analyzing the malware, rather than trying to evade detection in sandboxes (isolated test environments). Researchers attribute the malware to a North Korean-linked threat actor, and while it hasn't been shown to successfully bypass current AI analysis platforms, it suggests attackers are developing new anti-analysis techniques targeting AI-based security tools.

BleepingComputer
03

CVE-2026-54036: LibreChat is an enhanced ChatGPT clone that supports multiple AI providers. Prior to 0.8.4-rc1, the GET /api/auth/2fa/en

security
Jun 25, 2026

LibreChat, a ChatGPT-like application supporting multiple AI providers, has a security flaw in versions before 0.8.4-rc1 where an attacker with a valid session token (a code that proves you're logged in) can disable a user's two-factor authentication (2FA, an extra security layer requiring a second verification step) without permission. The attacker can overwrite the TOTP secret (a code used to generate login verification codes) and backup codes, then disable 2FA entirely, locking the real owner out of their account.

Fix: This vulnerability is fixed in 0.8.4-rc1.

NVD/CVE Database
04

Computer-Use and TOCTOU: What You Click Is Not What You Get!

securityresearch
Jun 25, 2026

A TOCTOU attack (time-of-check to time-of-use, a type of race condition where a system checks something and then uses it, but the situation changes in between) can trick AI agents that control computers by changing what's on the screen while the AI is thinking. For example, an attacker can swap out a button with a different one, or overlay a fake button on top of a real one, so the AI clicks something it didn't intend to, like sending an email or visiting a malicious site.

Fix: "Ensure that the UI hasn't changed before taking an action." Anthropic addressed this in Claude Computer-Use by implementing a check to "ensure that pixels haven't changed before action," according to Felix Rieseberg's announcement when the feature shipped.

Embrace The Red
05

Understanding Hallucinations in Large Visual and Language Models

researchsafety
Jun 25, 2026

This academic survey examines hallucinations in large visual and language models, which are instances where AI systems generate false or nonsensical information that appears plausible. The paper, published in ACM Computing Surveys in October 2026, provides a comprehensive overview spanning 36 pages of research on this problem affecting both language models (AI systems trained on text) and multimodal models (AI systems that process both images and text).

ACM Digital Library (TOPS, DTRAP, CSUR)
06

Interesting Paper Exploring Prompt Injection

researchsafety
Jun 25, 2026

A research paper shows that large language models (LLMs) are vulnerable to prompt injection attacks (tricks where attackers hide malicious instructions in text input) because they rely on role tags (formatting markers that separate different instruction blocks) as their main security mechanism, but these tags don't actually reflect how the model processes information internally. The researchers conclude that unless LLMs develop a genuine ability to understand and maintain role boundaries, prompt injection attacks will remain difficult to prevent permanently.

Schneier on Security
07

Rethinking the balance between AI oversight and innovation

policyindustry
Jun 25, 2026

CIOs face pressure to rapidly adopt AI across their organizations to prove business value, but must balance this speed with managing new security and governance risks. AI introduces unique challenges because its behavior is indeterminate (unpredictable and hard to verify like traditional technology) and employees are eager to use it without oversight, creating what's called shadow use (unauthorized use of tools that bypasses IT controls). Organizations should clarify their specific business goals and conduct a risk assessment before implementing AI rather than adopting it out of fear of falling behind.

CSO Online
08

New Gaslight macOS Malware Uses Prompt Injection to Disrupt AI-Assisted Analysis

securitysafety
Jun 25, 2026

A new malware called Gaslight, created by North Korea-aligned hackers, targets macOS systems and uses prompt injection (tricking an AI by hiding instructions in its input) to disrupt AI tools that analyze malware. The malware embeds fake system-failure messages designed to confuse AI-assisted analysis tools, while also stealing sensitive data like browser histories and passwords through a command-and-control (C2, a server that lets attackers remotely control infected computers) channel powered by Telegram.

The Hacker News
09

Anthropic's latest hiring spree reveals where it's building AI data centers next

industry
Jun 25, 2026

Anthropic, a major AI company, is rapidly expanding its data center operations in Asia-Pacific by hiring 13 people, with eight positions in Australia and Japan, to handle increasing demand for its AI products. The company is building infrastructure in these regions because they offer advantages like renewable energy, political stability, and security benefits, though Australia's copyright laws present a potential obstacle to large-scale expansion.

CNBC Technology
10

How agents are transforming work

industry
Jun 24, 2026

Agentic AI (AI systems that can work independently on tasks for extended periods, rather than just answering single questions) is transforming how people work by handling longer, more complex tasks instead of short interactions. At OpenAI, a tool called Codex shifted from being used mainly by engineers to becoming the primary AI tool across all departments, including non-technical ones like Legal and Recruiting, with usage growing dramatically over the past year. Users increasingly delegate tasks that would take humans hours or even days to complete, with some users running dozens of hours of parallel agent tasks in a single day.

OpenAI Blog
Prev1...45678...505Next
critical

CVE-2026-50548: Cursor is a code editor built for programming with AI. Prior to 3.0, Cursor runs agent terminal commands in a sandbox by

CVE-2026-50548NVD/CVE DatabaseJun 25, 2026
Jun 25, 2026
critical

CVE-2026-55413: ToolJet is the open-source foundation am AI-native platform for building and deploying internal tools, workflows and AI

CVE-2026-55413NVD/CVE DatabaseJun 25, 2026
Jun 25, 2026
critical

CVE-2026-12537: Improper Neutralization used in an OS Command in the container launcher in Google Gemini CLI (versions prior to 0.39.1)

CVE-2026-12537NVD/CVE DatabaseJun 24, 2026
Jun 24, 2026
high

Clean GitHub repo tricks AI coding agents into running malware

BleepingComputerJun 27, 2026
Jun 27, 2026