AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

How we contain Claude across products

infonews

securitysafety

May 30, 2026

Anthropic published documentation explaining how they use multiple containment techniques to restrict what Claude can do across their products. They use process sandboxes (isolated execution environments), virtual machines (complete simulated computers), filesystem boundaries (limiting file access), and egress controls (preventing unauthorized data transfer) to prevent AI agents from accessing credentials, exfiltrating data (stealing information), or reaching unintended systems, even if a user, the AI model, or an attacker tries to find workarounds.

Fix: Anthropic implements containment through: gVisor for Claude.ai, Seatbelt (macOS) and Bubblewrap (Linux) for Claude Code, and full VMs using Apple's Virtualization framework (macOS) or HCS (Windows) for Claude Cowork. They also prevent credentials from entering sandboxes in the first place, ensuring they cannot be exfiltrated regardless of how an agent tries to access them.

Simon Willison's Weblog

Anthropic’s alliance with pope on AI harms: all in good faith or ‘Vatican-washing?’

infonews

policyindustry

Russia-aligned crime group Greyvibe extensively uses AI in attacks

highnews

security

May 29, 2026

Researchers discovered Greyvibe, a Russia-aligned crime group that uses large language models (LLMs, AI systems trained to generate text) extensively throughout its cyberattacks against Ukrainian targets, including government and military organizations. The group has used generative AI to create spear phishing emails (fraudulent messages pretending to come from trusted sources), malicious scripts, and custom malware programs like PhantomRelay and LegionRelay (remote access trojans, or RATs, which are tools that let attackers control compromised computers). Greyvibe has conducted multiple campaigns since August 2025 using various attack methods, from fake websites to ClickFix-style attacks (tricks that convince users to run malicious commands on their computers).

Microsoft and security researcher’s dueling posts about cybersecurity disclosures get nasty

infonews

security

May 29, 2026

A cybersecurity researcher named Nightmare Eclipse and Microsoft had a public conflict over responsible disclosure practices, with the researcher publishing vulnerability details after claiming Microsoft ignored his reports, while Microsoft argued that uncoordinated disclosures (releasing bug information before patches are available) create unnecessary risk for users. Tom Gallagher, a Microsoft security executive, acknowledged the debate over whether current patching practices fit today's landscape but stated the company is not currently changing its policies, though it will continue to evaluate them.

ChatGPT share links abused to host fake outage pages to deliver malware

highnews

security

May 29, 2026

Attackers are abusing ChatGPT's share feature (which lets users publish rendered content on legitimate OpenAI URLs) to display fake outage pages that trick users into downloading malware disguised as the ChatGPT desktop application. The "LLMShare" campaign uses Google ads to direct people to these malicious shared pages, which appear to come from OpenAI's domain but actually deliver malware-infected downloads through a fake installation portal.

ChatGPhish Vulnerability Turns ChatGPT Web Summaries Into a Phishing Surface

highnews

securitysafety

DNS-AID will make AI agents easier to discover, says Linux Foundation

infonews

industry

May 29, 2026

The Linux Foundation is promoting DNS-AID, a new standard that allows AI agents (autonomous programs that can act independently) to find and communicate with each other using DNS (the system that translates website names into IP addresses) instead of requiring separate proprietary registries. DNS-AID enables agents and MCP (Model Context Protocol, a standard for how agents exchange information) servers to use the existing internet infrastructure as a vendor-neutral directory, with domain owners creating a special DNS address at _index._agents.{domain} as a discovery point.

SpaceX skeptics have added reason for concern after Musk comments diverge from IPO filing

infonews

industrypolicy

Attackers Use LLM Agent for Post-Exploitation After Marimo CVE-2026-39987 Exploit

highnews

securitysafety

Dan Ives: Anthropic’s growth is 'just the tip of the spear' for AI rally

infonews

industry

May 29, 2026

Anthropic, an AI company, recently achieved a $965 billion valuation after securing $65 billion in funding, and analyst Dan Ives believes investor interest in AI is far from peaked and will expand to data layer companies (companies that manage and organize data). Ives predicts a major market rally with several large public offerings planned for 2026, though some analysts warn this could signal a market peak similar to the dot-com bubble of the late 1990s.

EU seeks to 'intensify' talks with U.S. on advanced cyber AI models, official tells CNBC, amid Mythos concerns

infonews

policysecurity

How Braintrust turns customer requests into code with Codex

infonews

industry

May 29, 2026

Braintrust, an AI observability company, uses Codex (OpenAI's code-generation AI model) to quickly turn customer feature requests into working preview branches in minutes, with half the team adopting it within one month. The speed of Codex enables faster feedback loops with customers and allows engineers to test ideas in real time rather than letting requests sit in a backlog. Codex's ability to handle large amounts of text output without slowing down makes it more effective than other models for this workflow.

Boston Children’s uses AI to unlock new diagnoses

infonews

industry

May 29, 2026

Boston Children's Hospital integrated AI (artificial intelligence) across its entire organization as a core part of clinical and operational work, rather than treating it as a separate experiment. By building an enterprise AI layer (a shared, secure internal AI system used across teams) and redesigning workflows in areas like supply chain and surgical scheduling, the hospital has diagnosed over 40 previously unresolved rare conditions, saved approximately 60,000 hours of staff time, and enabled more than one-third of employees to use AI daily in their work.

‘Like a billionaire on acid’: Star Wars director Gareth Edwards comes out in favour of AI

infonews

industry

May 29, 2026

Film director Gareth Edwards publicly endorsed generative AI (software that creates content like images or text from descriptions) for movie-making at an Amazon event, comparing it favorably to traditional CGI (computer-generated imagery) and calling it a tool as fundamental as a camera. Edwards argued that filmmakers have no reason to avoid adopting AI since it can help with creative work and will eventually surpass CGI in quality.

What 2,000 Exposed Vibe-Coded Apps Reveal About the Limits of Most Security Stacks

infonews

securitypolicy

Adobe’s conversational AI agent is a mediocre design intern

infonews

industry

May 29, 2026

Adobe's Firefly AI Assistant is a conversational AI agent designed to automate tasks within Adobe's design software while keeping users in control of the creative process, unlike traditional AI image generators that work independently. The assistant acts as a multitasking middleman that can operate design apps on behalf of users, though early testing suggests the results are not particularly impressive despite the tool's thoughtful approach to preserving creative control.

Cybersecurity trends in SEC filings

infonews

policy

May 29, 2026

In 2023, the SEC required public companies to disclose cybersecurity risk management in their annual filings, prompting an analysis of the top 200 S&P companies' cybersecurity leadership structures. The analysis found that Chief Information Security Officers (CISOs) lead cybersecurity at over 70% of companies with an average of 23 years of experience, most commonly reporting to the Chief Information Officer, while the Audit Committee oversees cybersecurity at about 60% of companies, and the NIST Cybersecurity Framework (a set of best practices for managing cyber risks) is the most referenced security standard.

GDPR set the tone for regulatory action — and the AI fine pushback to come

infonews

policy

May 29, 2026

Big tech companies are legally challenging GDPR (General Data Protection Regulation, Europe's data protection law) fines, with nearly 40% of the €7.1 billion in fines announced over eight years either annulled or under appeal. While GDPR successfully established a global 72-hour breach notification standard (the requirement that organizations tell people within three days if their data is stolen), experts note the framework has structural weaknesses that companies exploit in court, and upcoming AI regulations may face similar challenges.

Shadow AI: The Hidden Risk Expanding Across the Enterprise

infonews

securitypolicy

Strengthening societal resilience with Rosalind Biodefense

infonews

policyindustry

Industry News

Industry News

How we contain Claude across products

Anthropic’s alliance with pope on AI harms: all in good faith or ‘Vatican-washing?’

Russia-aligned crime group Greyvibe extensively uses AI in attacks

Microsoft and security researcher’s dueling posts about cybersecurity disclosures get nasty

ChatGPT share links abused to host fake outage pages to deliver malware

ChatGPhish Vulnerability Turns ChatGPT Web Summaries Into a Phishing Surface

DNS-AID will make AI agents easier to discover, says Linux Foundation

SpaceX skeptics have added reason for concern after Musk comments diverge from IPO filing

Attackers Use LLM Agent for Post-Exploitation After Marimo CVE-2026-39987 Exploit

Dan Ives: Anthropic’s growth is 'just the tip of the spear' for AI rally

EU seeks to 'intensify' talks with U.S. on advanced cyber AI models, official tells CNBC, amid Mythos concerns

How Braintrust turns customer requests into code with Codex

Boston Children’s uses AI to unlock new diagnoses

‘Like a billionaire on acid’: Star Wars director Gareth Edwards comes out in favour of AI

What 2,000 Exposed Vibe-Coded Apps Reveal About the Limits of Most Security Stacks

Adobe’s conversational AI agent is a mediocre design intern

Cybersecurity trends in SEC filings

GDPR set the tone for regulatory action — and the AI fine pushback to come

Shadow AI: The Hidden Risk Expanding Across the Enterprise

Strengthening societal resilience with Rosalind Biodefense

How we contain Claude across products

Anthropic’s alliance with pope on AI harms: all in good faith or ‘Vatican-washing?’

Russia-aligned crime group Greyvibe extensively uses AI in attacks

Microsoft and security researcher’s dueling posts about cybersecurity disclosures get nasty

ChatGPT share links abused to host fake outage pages to deliver malware

ChatGPhish Vulnerability Turns ChatGPT Web Summaries Into a Phishing Surface

DNS-AID will make AI agents easier to discover, says Linux Foundation

SpaceX skeptics have added reason for concern after Musk comments diverge from IPO filing

Attackers Use LLM Agent for Post-Exploitation After Marimo CVE-2026-39987 Exploit

Dan Ives: Anthropic’s growth is 'just the tip of the spear' for AI rally

EU seeks to 'intensify' talks with U.S. on advanced cyber AI models, official tells CNBC, amid Mythos concerns

How Braintrust turns customer requests into code with Codex

Boston Children’s uses AI to unlock new diagnoses

‘Like a billionaire on acid’: Star Wars director Gareth Edwards comes out in favour of AI

What 2,000 Exposed Vibe-Coded Apps Reveal About the Limits of Most Security Stacks

Adobe’s conversational AI agent is a mediocre design intern

Cybersecurity trends in SEC filings

GDPR set the tone for regulatory action — and the AI fine pushback to come

Shadow AI: The Hidden Risk Expanding Across the Enterprise

Strengthening societal resilience with Rosalind Biodefense