aisecwatch.com
DashboardVulnerabilitiesNewsResearchArchiveStatsDatasetFor devs
Subscribe
aisecwatch.com

Real-time AI security monitoring. Tracking AI-related vulnerabilities, safety and security incidents, privacy risks, research developments, and policy changes.

Navigation

VulnerabilitiesNewsResearchDigest ArchiveNewsletter ArchiveSubscribeData SourcesStatisticsDatasetAPIIntegrationsWidgetRSS Feed

Maintained by

Truong (Jack) Luu

Information Systems Researcher

AI Sec Watch

The security intelligence platform for AI teams

AI security threats move fast and get buried under hype and noise. Built by an Information Systems Security researcher to help security teams and developers stay ahead of vulnerabilities, privacy incidents, safety research, and policy developments.

Independent research. No sponsors, no paywalls, no conflicts of interest.

[TOTAL_TRACKED]
5,048
[LAST_24H]
3
[LAST_7D]
147
Daily BriefingSaturday, June 27, 2026
>

AI Coding Agents Exploited via DNS-Hidden Malware: Researchers demonstrated a novel attack vector where AI coding assistants like Claude Code can be socially engineered through benign repository instructions to execute malicious payloads retrieved from DNS records (the system that translates domain names to IP addresses), bypassing traditional code review since no suspicious code appears in the repository itself. This highlights a new class of supply chain risk unique to autonomous agents that execute commands without human verification.

>

OpenAI Deploys GPT-5.6 Sol with Hardened Cyber Controls: OpenAI released a limited preview of GPT-5.6 Sol specifically tuned for cybersecurity tasks including vulnerability research and patch development, featuring enhanced jailbreak resistance (defenses against prompts designed to bypass safety restrictions) and guardrails targeting offensive cyber use cases, though the company acknowledges the dual-use controls may over-block legitimate security work during the preview period.

Latest Intel

page 13/505
VIEW ALL
01

OpenAI rolls out AI-led push to fix open-source software flaws

securityindustry
Critical This Week5 issues
critical

CVE-2026-50549: Cursor is a code editor built for programming with AI. Prior to 3.0, Cursor runs agent terminal commands in a sandbox by

CVE-2026-50549NVD/CVE DatabaseJun 25, 2026
Jun 25, 2026
>

Margaret Atwood Flags Hallucination Risk in LLMs: Author Margaret Atwood publicly criticized Claude for generating factually incorrect information about a TV show, underscoring the persistent hallucination problem (when large language models confidently generate plausible but false information) inherent in systems trained on unverified or low-quality data.

Jun 23, 2026

OpenAI launched Patch the Planet, a program that uses AI to find and fix vulnerabilities (security flaws) in widely-used open-source software (code that anyone can access and modify) with help from cybersecurity firm Trail of Bits. The program combines AI-assisted vulnerability research with human review to develop tested fixes and coordinate their disclosure through existing project channels. The initiative has already identified hundreds of security issues and merged dozens of patches across projects like Python, Go, and cURL.

Fix: The source describes the Patch the Planet program itself as the mitigation approach: AI-assisted vulnerability research is used alongside human review by Trail of Bits engineers who filter out false positives and duplicate reports before sending findings to maintainers. Additionally, the source recommends that CISOs implement governance controls before deploying AI-assisted vulnerability research, including what one analyst calls a 'Safety Relevance Layer' that requires every AI-generated finding to pass automated verification with dynamic proof-of-concept validation and strong false-positive filtering before reaching a human analyst, plus predefined escalation paths and notification timelines for disclosed flaws in external dependencies.

CSO Online
02

Australia ‘sleepwalking’ into AI crisis and ‘tech bro free-for-all’, says Greens senator

policy
Jun 23, 2026

Australian politicians are raising concerns that the country is unprepared for AI development, with calls to prevent large tech companies from using Australian content to train AI models (teach AI systems by feeding them data) and to pause approval of new datacenters until proper regulations exist. The debate reflects worry that AI is advancing faster than government safeguards can keep up.

The Guardian Technology
03

OpenAI Expands Daybreak With GPT-5.5-Cyber to Help Defenders Patch Security Flaws

securityindustry
Jun 22, 2026

OpenAI released an improved GPT-5.5-Cyber model and updated Codex Security plugin (a tool for finding and fixing code problems) to help security defenders find and patch software vulnerabilities more quickly. The company is also launching Patch the Planet, a partnership with Trail of Bits to secure open-source projects, because AI models are now finding vulnerabilities faster than developers can fix them, shifting the bottleneck from discovery to patching.

Fix: OpenAI is providing the improved GPT-5.5-Cyber model to trusted defenders as part of the Daybreak initiative. The updated Codex Security plugin allows developers to run deep scans, generate reports with severity levels and affected code locations, generate codebase-specific patches for review, and facilitate patch generation at scale. The Patch the Planet initiative lets security engineers review and validate findings, work with projects to develop patches and tests, and help build reusable vulnerability discovery workflows.

The Hacker News
04

Change your cyber risk strategy to meet AI threats, Five Eyes countries warn CSOs

policysecurity
Jun 22, 2026

Five Eyes cybersecurity agencies (US, UK, Canada, Australia, New Zealand) warn that threat actors are increasingly using AI to bypass security defenses, with capabilities advancing in months rather than years, so organizations must urgently update their cyber risk strategies. They recommend that business leaders treat cybersecurity as core business risk, get security fundamentals right, use AI deliberately to strengthen defenses, and take practical actions like reducing attack surface, accelerating security patches, and preparing breach response plans. However, some experts criticize the guidance as too generic and lacking specific advice on AI-related risks.

CSO Online
05

How Omio is building the future of conversational travel

industry
Jun 22, 2026

Omio, a travel platform connecting millions of travelers with transportation options, is using conversational AI (AI that understands natural language questions from users) to let people book trips by simply describing where they want to go, rather than searching through websites. The company launched this capability through ChatGPT in 2023 by connecting OpenAI's language models directly to its real-time transportation data, and it is now using similar AI tools internally to help engineers and other employees work more efficiently.

OpenAI Blog
06

Prompt Injection as Role Confusion

securityresearch
Jun 22, 2026

Researchers discovered that AI models struggle to distinguish between their own internal instructions (wrapped in tags like <system> and <think>) and untrusted user input (wrapped in <user> tags), a problem called role confusion. The models pay more attention to the writing style of text than its actual meaning, allowing attackers to craft jailbreaks (unauthorized bypasses of safety rules) by mimicking the style of internal thinking blocks. However, rewriting malicious text in a different style (called 'destyling') significantly reduced attack success rates from 61% to 10%, showing that format changes can help models better distinguish between trusted and untrusted content.

Fix: The source explicitly mentions 'destyling' as having material impact: 'destyling causes average attack success in our dataset to plunge from 61% to 10%.' Destyling is described as 'rewriting text in a slightly different way such that it looked less like the expected format in a role tag.' However, the source does not present this as an implemented solution or official mitigation—only as a research finding about what reduces attack effectiveness. No deployed fix, patch, or official defense mechanism is described in the text.

Simon Willison's Weblog
07

Porting the Moebius 0.2B image inpainting model to run in the browser with Claude Code

industry
Jun 22, 2026

This article describes successfully porting the Moebius image inpainting model (a small AI model that can remove objects from images and fill in the missing areas) to run in a web browser using WebGPU (a graphics technology that lets browsers use GPU acceleration). The author used Claude Code, an AI coding agent, to help convert the model from Python and NVIDIA CUDA (specialized GPU software for training AI) into a web-compatible format using ONNX Runtime Web.

Simon Willison's Weblog
08

GHSA-gfq7-5x4g-3xhf: @budibase/backend-core has potential SSRF DNS rebinding bypass in outbound fetch validation

security
Jun 22, 2026

Budibase has a DNS rebinding vulnerability (a type of attack where DNS lookups return different IP addresses at different times) in its SSRF protection. The software checks if a hostname is safe by looking up its IP address and checking a blacklist, but then performs a separate DNS lookup when actually connecting. An attacker controlling DNS can return a public IP during the safety check and a private/internal IP during the actual connection, allowing them to access internal services like localhost or cloud metadata endpoints.

GitHub Advisory Database
09

CVE-2026-55443: LangChain is a framework for building agents and LLM-powered applications. Prior to 1.3.9, several LangChain components

security
Jun 22, 2026

LangChain, a framework for building AI agents and applications powered by large language models, had a vulnerability before version 1.3.9 where several components that work with file paths did not properly restrict access to files. This meant attackers could use glob patterns (wildcards for matching multiple files), symlinks (shortcuts to files), or specially crafted paths to read files outside the intended directory, especially when an AI system processes untrusted input. The vulnerability allowed unauthorized file disclosure.

Fix: This vulnerability is fixed in version 1.3.9.

NVD/CVE Database
10

Anthropic says Claude may want to see your ID

privacypolicy
Jun 22, 2026

Anthropic updated its privacy policy to allow Claude users to appeal account flags by uploading government-issued ID documents and biometric data (facial scans and face geometry templates, which are digital measurements of facial features). The policy applies only to a small subset of users whose accounts are flagged for fraud rather than immediately banned, and Anthropic says it uses this verification to comply with various legal requirements and security measures.

TechCrunch (Security)
Prev1...1112131415...505Next
critical

CVE-2026-50548: Cursor is a code editor built for programming with AI. Prior to 3.0, Cursor runs agent terminal commands in a sandbox by

CVE-2026-50548NVD/CVE DatabaseJun 25, 2026
Jun 25, 2026
critical

CVE-2026-55413: ToolJet is the open-source foundation am AI-native platform for building and deploying internal tools, workflows and AI

CVE-2026-55413NVD/CVE DatabaseJun 25, 2026
Jun 25, 2026
critical

CVE-2026-12537: Improper Neutralization used in an OS Command in the container launcher in Google Gemini CLI (versions prior to 0.39.1)

CVE-2026-12537NVD/CVE DatabaseJun 24, 2026
Jun 24, 2026
high

Clean GitHub repo tricks AI coding agents into running malware

BleepingComputerJun 27, 2026
Jun 27, 2026