aisecwatch.com
DashboardVulnerabilitiesNewsResearchArchiveStatsDatasetFor devs
Subscribe
aisecwatch.com

Real-time AI security monitoring. Tracking AI-related vulnerabilities, safety and security incidents, privacy risks, research developments, and policy changes.

Navigation

VulnerabilitiesNewsResearchDigest ArchiveNewsletter ArchiveSubscribeData SourcesStatisticsDatasetAPIIntegrationsWidgetRSS Feed

Maintained by

Truong (Jack) Luu

Information Systems Researcher

AI Sec Watch

The security intelligence platform for AI teams

AI security threats move fast and get buried under hype and noise. Built by an Information Systems Security researcher to help security teams and developers stay ahead of vulnerabilities, privacy incidents, safety research, and policy developments.

Independent research. No sponsors, no paywalls, no conflicts of interest.

[TOTAL_TRACKED]
5,048
[LAST_24H]
3
[LAST_7D]
147
Daily BriefingSaturday, June 27, 2026
>

AI Coding Agents Exploited via DNS-Hidden Malware: Researchers demonstrated a novel attack vector where AI coding assistants like Claude Code can be socially engineered through benign repository instructions to execute malicious payloads retrieved from DNS records (the system that translates domain names to IP addresses), bypassing traditional code review since no suspicious code appears in the repository itself. This highlights a new class of supply chain risk unique to autonomous agents that execute commands without human verification.

>

OpenAI Deploys GPT-5.6 Sol with Hardened Cyber Controls: OpenAI released a limited preview of GPT-5.6 Sol specifically tuned for cybersecurity tasks including vulnerability research and patch development, featuring enhanced jailbreak resistance (defenses against prompts designed to bypass safety restrictions) and guardrails targeting offensive cyber use cases, though the company acknowledges the dual-use controls may over-block legitimate security work during the preview period.

Latest Intel

page 12/505
VIEW ALL
01

Something’s off with Midjourney’s pivot to body scanners

industry
Jun 23, 2026

Midjourney, an AI company known for its image generator, announced a new medical imaging product: an experimental ultrasound scanner that would immerse users in water to produce detailed body images similar to MRI (magnetic resonance imaging, a medical scanning technique). Medical imaging experts expressed skepticism about the technology, saying Midjourney has not yet shown sufficient public evidence to support its claims that the system could match or exceed MRI capabilities.

Critical This Week5 issues
critical

CVE-2026-50549: Cursor is a code editor built for programming with AI. Prior to 3.0, Cursor runs agent terminal commands in a sandbox by

CVE-2026-50549NVD/CVE DatabaseJun 25, 2026
Jun 25, 2026
>

Margaret Atwood Flags Hallucination Risk in LLMs: Author Margaret Atwood publicly criticized Claude for generating factually incorrect information about a TV show, underscoring the persistent hallucination problem (when large language models confidently generate plausible but false information) inherent in systems trained on unverified or low-quality data.

The Verge (AI)
02

AI PACs pour $20 million into New York Democratic primary in AI regulation battle

policyindustry
Jun 23, 2026

AI companies are spending over $20 million in a New York congressional race between AI safety advocate Alex Bores and two other candidates, with competing super PACs (political action committees, groups that raise unlimited money for political causes) backing different approaches to AI regulation. Leading the Future, backed by companies like OpenAI and Andreessen Horowitz, opposes Bores and favors lighter regulation, while Public First Action, funded by Anthropic, supports Bores and advocates for stricter safety requirements built into AI models from the start. This race has become a proxy battle over whether the U.S. government should heavily regulate the AI industry or allow it to develop with fewer restrictions.

CNBC Technology
03

Fake AI Agent Skill Passed Security Scans and Reportedly Reached 26,000 Agents

securitysafety
Jun 23, 2026

A security firm created a fake AI agent skill (a bundle of instructions that agents load and follow) that bypassed all security scanners and reached approximately 26,000 agents by exploiting a structural weakness: scanners only check the skill's initial package, but attackers can change the external webpage the skill points to after it passes review. The fake skill appeared legitimate through inherited GitHub credibility and targeted ads, demonstrating that current trust signals and scanning tools fail to catch sophisticated attacks.

Fix: Treat skills as software, not text, by vetting what a skill points to externally, not just what ships inside it. Route new skills through a single source you control and re-check them when anything changes since a clean result at install does not stay clean if the skill connects to a link someone else can edit. Additionally, pin versions, hold agents to the least privilege (minimum access needed to function), and assume any external instruction an agent fetches runs with the agent's full access level.

The Hacker News
04

Helping build shared standards for advanced AI

policy
Jun 23, 2026

Advanced AI models offer benefits like stronger cybersecurity and faster scientific discovery, but they also pose safety and security risks if their capabilities aren't properly understood or safeguarded. To address this, OpenAI helped found the Appia Foundation (an organization hosted by the Linux Foundation), which will create open technical standards and assessment criteria that allow different organizations and governments to evaluate and trust AI systems using a shared language and consistent methods.

Fix: The source discusses initiatives to build standards and governance frameworks rather than fixing a specific vulnerability. Explicitly mentioned approaches include: developing open, modular specifications through Appia, establishing a strengthened Center for AI Standards and Innovation (CAISI), creating a 'shared playbook for trustworthy third-party evaluations' that requires disclosure of the system tested, tool access, evaluation methods, available resources, and validation checks, and implementing OpenAI's Preparedness Framework and Frontier Governance Framework to operationalize risk management practices around risk assessment, model reporting, security controls, and incident response.

OpenAI Blog
05

AI Threat Readiness Pillar 4: Detect and contain threats in real-time

security
Jun 23, 2026

Traditional security detection tools were not designed to handle AI-era threats, which move faster and create new attack surfaces through prompt injection (tricking AI by hiding instructions in its input), coding agents accessing codebases, and cloud-native AI services. The document argues that manual investigation by security analysts is too slow when the time between initial access and damage can shrink to minutes, requiring instead real-time detection with automated investigation and containment rather than human-driven responses.

Wiz Research Blog
06

The Download: the future of chipmaking and Anthropic’s government clash

securitypolicy
Jun 23, 2026

This newsletter roundup covers several AI and tech developments, including ASML's $400 million lithography machine (a tool that uses extreme-ultraviolet light to pattern features on computer chips) that dominates global chipmaking, tensions between Anthropic and the US government over export controls on an AI coding model, and Meta pausing an AI training program that tracked workers' keystrokes after sensitive data was leaked.

MIT Technology Review
07

AI in the classroom prompts tide of concern from US parents and experts

policyindustry
Jun 23, 2026

Some parents and education experts are concerned that using AI chatbots (software programs that simulate conversation) like Google Gemini in classrooms may discourage independent thinking, with critics arguing there is little evidence these tools actually help students learn. One parent in New York objected to an assignment where students used an AI chatbot for feedback instead of discussing improvements with peers or teachers.

The Guardian Technology
08

Agentic AI: The Weapon That No Longer Needs a Warrior

securitysafety
Jun 23, 2026

Agentic AI (artificial intelligence systems that can independently execute tasks without human intervention at each step) represents a major shift in cybersecurity threats because it allows attackers to move from using AI as a drafting tool to using it as an autonomous weapon that can plan and carry out attacks on its own. This technology lowers the barrier to entry for unskilled attackers while dramatically speeding up campaigns from experienced ones, creating a broader threat landscape where attackers can now operate at speeds and scales that were previously impossible.

The Hacker News
09

OpenAI Refocuses Cybersecurity Efforts on Patching Over Discovery

securityindustry
Jun 23, 2026

OpenAI expanded its Daybreak cybersecurity initiative to focus on fixing vulnerabilities faster rather than just finding them, arguing that AI models have made vulnerability discovery so fast that security teams are overwhelmed by the volume of findings. The company released an updated Codex Security plugin (a tool that scans code and generates patches) and GPT-5.5-Cyber (a specialized AI model for security work), along with Patch the Planet, a program that deploys security experts to help open source projects validate and fix vulnerabilities.

Fix: OpenAI released an updated Codex Security plugin that 'can scan entire codebases, trace attack paths, construct threat models, validate findings, generate patches, and export results into existing vulnerability management pipelines via SARIF files and CodeQL queries.' The company also launched GPT-5.5-Cyber, described as capable of 'sustain[ing] analysis across large codebases, assess[ing] whether vulnerable code is actually reachable, and carry[ing] work through to patch development and testing.' Additionally, Patch the Planet deploys expert security researchers to work with open source project maintainers to handle 'validation, deduplication, and patch development.'

SecurityWeek
10

Anthropic’s Fable 5 Model Jailbroken Within Days

securitysafety
Jun 23, 2026

Anthropic's Fable 5 model was successfully jailbroken (tricked into bypassing its safety restrictions) shortly after its release, despite the company's claims that it had been thoroughly tested for security. The source criticizes overconfident security statements, noting that even rigorous testing cannot guarantee that vulnerabilities will not be discovered.

Schneier on Security
Prev1...1011121314...505Next
critical

CVE-2026-50548: Cursor is a code editor built for programming with AI. Prior to 3.0, Cursor runs agent terminal commands in a sandbox by

CVE-2026-50548NVD/CVE DatabaseJun 25, 2026
Jun 25, 2026
critical

CVE-2026-55413: ToolJet is the open-source foundation am AI-native platform for building and deploying internal tools, workflows and AI

CVE-2026-55413NVD/CVE DatabaseJun 25, 2026
Jun 25, 2026
critical

CVE-2026-12537: Improper Neutralization used in an OS Command in the container launcher in Google Gemini CLI (versions prior to 0.39.1)

CVE-2026-12537NVD/CVE DatabaseJun 24, 2026
Jun 24, 2026
high

Clean GitHub repo tricks AI coding agents into running malware

BleepingComputerJun 27, 2026
Jun 27, 2026