aisecwatch.com
DashboardVulnerabilitiesNewsResearchArchiveStatsDatasetFor devs
Subscribe
aisecwatch.com

Real-time AI security monitoring. Tracking AI-related vulnerabilities, safety and security incidents, privacy risks, research developments, and policy changes.

Navigation

VulnerabilitiesNewsResearchDigest ArchiveNewsletter ArchiveSubscribeData SourcesStatisticsDatasetAPIIntegrationsWidgetRSS Feed

Maintained by

Truong (Jack) Luu

Information Systems Researcher

AI Sec Watch

The security intelligence platform for AI teams

AI security threats move fast and get buried under hype and noise. Built by an Information Systems Security researcher to help security teams and developers stay ahead of vulnerabilities, privacy incidents, safety research, and policy developments.

Independent research. No sponsors, no paywalls, no conflicts of interest.

[TOTAL_TRACKED]
5,048
[LAST_24H]
4
[LAST_7D]
147
Daily BriefingSaturday, June 27, 2026
>

AI Coding Agents Exploited via DNS-Hidden Malware: Researchers demonstrated a novel attack vector where AI coding assistants like Claude Code can be socially engineered through benign repository instructions to execute malicious payloads retrieved from DNS records (the system that translates domain names to IP addresses), bypassing traditional code review since no suspicious code appears in the repository itself. This highlights a new class of supply chain risk unique to autonomous agents that execute commands without human verification.

>

OpenAI Deploys GPT-5.6 Sol with Hardened Cyber Controls: OpenAI released a limited preview of GPT-5.6 Sol specifically tuned for cybersecurity tasks including vulnerability research and patch development, featuring enhanced jailbreak resistance (defenses against prompts designed to bypass safety restrictions) and guardrails targeting offensive cyber use cases, though the company acknowledges the dual-use controls may over-block legitimate security work during the preview period.

Latest Intel

page 1/505
VIEW ALL
01

Margaret Atwood says the problem with AI is ‘garbage in, garbage out’

safety
Jun 27, 2026

Author Margaret Atwood criticized AI chatbots after using Claude (an AI assistant made by Anthropic) to search for information about a TV show, only to receive incorrect information. She highlighted a fundamental problem with large language models (AI systems trained on vast amounts of text data that generate responses word-by-word), arguing that when they're trained on poor quality or inaccurate data, they produce unreliable outputs, a principle she described as 'garbage in, garbage out.'

Critical This Week5 issues
critical

CVE-2026-50549: Cursor is a code editor built for programming with AI. Prior to 3.0, Cursor runs agent terminal commands in a sandbox by

CVE-2026-50549NVD/CVE DatabaseJun 25, 2026
Jun 25, 2026
>

Margaret Atwood Flags Hallucination Risk in LLMs: Author Margaret Atwood publicly criticized Claude for generating factually incorrect information about a TV show, underscoring the persistent hallucination problem (when large language models confidently generate plausible but false information) inherent in systems trained on unverified or low-quality data.

The Verge (AI)
02

Clean GitHub repo tricks AI coding agents into running malware

securitysafety
Jun 27, 2026

Researchers at Mozilla's security platform discovered that AI coding agents like Claude Code can be tricked into running malware hidden inside a seemingly clean GitHub repository through a social engineering chain: a harmless-looking setup instruction causes an error, the AI automatically runs a suggested fix command, which then secretly fetches and executes malicious code from a DNS record (a server lookup system) controlled by the attacker. This attack is particularly dangerous because it leaves no suspicious code in the repository itself and the AI agent never directly evaluates the malicious payload.

Fix: According to 0DIN researchers, "AI agents should disclose the full execution chain of setup commands, including scripts and code fetched dynamically at runtime" to prevent such exploitation.

BleepingComputer
03

OpenAI Previews GPT-5.6 Sol With Restricted Access and Stronger Cyber Safeguards

securitysafety
Jun 27, 2026

OpenAI released three versions of GPT-5.6 (Sol, Terra, and Luna) as a limited preview, with Sol being the most powerful and designed for cybersecurity tasks like vulnerability research and patch development. The model includes stronger safety protections, including hardened defenses against jailbreaks (attempts to bypass safety restrictions) and guardrails to block offensive cyber activities, though OpenAI warns some legitimate requests may be blocked during the preview phase due to the dual-use nature of the technology (capabilities that can be used for both defensive and harmful purposes).

Fix: OpenAI stated it has implemented 'our most robust safety stack to date' with 'strengthened protections for higher-risk activity, sensitive cyber requests, and repeated misuse' and spent 'multiple weeks finding weaknesses, pressure-testing our system, and hardening it against real-world attacks.' The company also noted it is 'swiftly remediating newly discovered jailbreaks' and enforcing 'strong guardrails that block offensive activity.'

The Hacker News
04

Anthropic’s Mythos 5 is back

policy
Jun 26, 2026

Anthropic's Mythos 5 AI model has been allowed to resume operations for a limited group of organizations after a two-week negotiation with the Trump administration, according to a government letter. However, Fable 5, the public version of the Mythos-class model, remains unavailable with no clear timeline for when it might be released to the public.

The Verge (AI)
05

Trump admin allows Anthropic to release Mythos AI model to some companies, government agencies: Reports

policy
Jun 26, 2026

The U.S. government allowed Anthropic to release its Mythos 5 AI model to about 100 companies and federal agencies after a two-week standoff, during which Anthropic had disabled access to its latest models due to export control restrictions (government rules limiting what technology can be shared internationally). The Commerce Department said the decision was made to keep America competitive in AI while protecting national security.

CNBC Technology
06

China's Zhipu is closing in on top U.S. AI models with Anthropic and OpenAI held back

industry
Jun 26, 2026

Zhipu's GLM 5.2, a Chinese open source AI model (a model that can be freely downloaded and modified), has achieved performance comparable to top U.S. models like Anthropic's Opus 4.8 while costing significantly less, making it attractive to companies concerned about AI spending. Unlike proprietary models from OpenAI and Anthropic that face government restrictions, GLM 5.2 can be run on companies' own servers without risk of being revoked, positioning open source AI as a more reliable and cost-effective alternative for enterprise use.

CNBC Technology
07

What happened after 2,000 people tried to hack my AI assistant

securitysafety
Jun 26, 2026

A researcher ran a public challenge where 2,000 people attempted to hack an AI assistant by sending emails containing prompt injection attacks (tricks to make an AI ignore its safety rules and reveal secrets). After 6,000 total attempts, nobody successfully leaked the system's secrets, suggesting that modern AI models are becoming more resistant to these attacks through better training.

Simon Willison's Weblog
08

Cybersecurity firms targeted by fraudulent OpenAI organization invites

security
Jun 26, 2026

Attackers are creating fake OpenAI organizations impersonating real companies and sending legitimate-looking invitations to employees to trick them into sharing sensitive information like source code and internal documents in chats. The fraudulent invitations come from OpenAI's real email servers and include payment methods attached, making them difficult to spot even though OpenAI includes a warning that the inviter's email domain doesn't match the recipient's company.

Fix: Push Security recommends training employees to verify unexpected organization invitations and monitoring SaaS (software-as-a-service, cloud-based applications) organization memberships to reduce the risk of these types of attacks.

BleepingComputer
09

Cisco Adds NHI to Security Stack With Astrix, WideField Acquisitions

securityindustry
Jun 26, 2026

Cisco is acquiring companies called Astrix and WideField to add NHI (network hygiene intelligence, which monitors and maintains network health) to its security products. The company believes that securing AI agents (autonomous software programs that perform tasks with minimal human input) requires making identity, which verifies who or what is accessing a system, the main control system.

Dark Reading
10

GHSA-2jc5-xhx8-qj6h: fluent-plugin-opentelemetry Has Denial of Service (DoS) via Large Payloads and Decompression Bombs in `in_opentelemetry`

security
Jun 26, 2026

The fluent-plugin-opentelemetry plugin's HTTP input lacks size limits, allowing attackers to send huge or highly compressed files that consume excessive memory when decompressed, causing a DoS (denial of service, a type of attack that makes a service unavailable) attack by crashing the Fluentd logging process. If the OpenTelemetry endpoint (a connection point that accepts telemetry data) is exposed to untrusted networks, an attacker can exploit this to disrupt all log collection on the affected server.

Fix: Upgrade to v0.5.3. If immediate upgrade is not possible, restrict network access to the OpenTelemetry ingestion port (default 4318) using firewall rules to only trusted networks, or place a reverse proxy like Nginx in front of Fluentd to handle decompression and enforce strict size limits on both compressed and uncompressed request bodies before sending traffic to Fluentd.

GitHub Advisory Database
123...505Next
critical

CVE-2026-50548: Cursor is a code editor built for programming with AI. Prior to 3.0, Cursor runs agent terminal commands in a sandbox by

CVE-2026-50548NVD/CVE DatabaseJun 25, 2026
Jun 25, 2026
critical

CVE-2026-55413: ToolJet is the open-source foundation am AI-native platform for building and deploying internal tools, workflows and AI

CVE-2026-55413NVD/CVE DatabaseJun 25, 2026
Jun 25, 2026
critical

CVE-2026-12537: Improper Neutralization used in an OS Command in the container launcher in Google Gemini CLI (versions prior to 0.39.1)

CVE-2026-12537NVD/CVE DatabaseJun 24, 2026
Jun 24, 2026
high

Clean GitHub repo tricks AI coding agents into running malware

BleepingComputerJun 27, 2026
Jun 27, 2026