AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

AI Sec Watch

The security intelligence platform for AI teams

AI security threats move fast and get buried under hype and noise. Built by an Information Systems Security researcher to help security teams and developers stay ahead of vulnerabilities, privacy incidents, safety research, and policy developments.

Independent research. No sponsors, no paywalls, no conflicts of interest.

[TOTAL_TRACKED]

3,710

[LAST_24H]

[LAST_7D]

Daily BriefingFriday, May 8, 2026

Critical RCE Vulnerabilities in LiteLLM Proxy Server: LiteLLM, a proxy server that forwards requests to AI model APIs, disclosed three critical and high-severity flaws in versions 1.74.2 through 1.83.6. Two test endpoints allowed attackers with valid API keys to execute arbitrary code (running any commands an attacker wants) on the server by submitting malicious configurations or prompt templates without sandboxing (CVE-2026-42271, CVE-2026-42203, both critical), while a SQL injection flaw (inserting malicious code into database queries) let unauthenticated attackers read or modify stored API credentials (CVE-2026-42208, high).

ClaudeBleed Exploit Allows Extension Hijacking in Chrome: Anthropic's Claude browser extension contains a vulnerability that allows malicious Chrome extensions to hijack it and perform unauthorized actions like exfiltrating files, sending emails, or stealing code from private repositories. The flaw stems from the extension trusting any script from claude.ai without verifying the actual caller, and while Anthropic released a partial fix in version 1.0.70 on May 6, researchers report it remains exploitable when the extension runs in privileged mode.

Latest Intel

page 44/371

VIEW ALL

Workspace agents

industry

Apr 22, 2026

Workspace agents are AI systems designed to automate repeatable workflows in your daily work by connecting to tools your team uses, rather than helping with one-off tasks. A workspace agent has three core components: a trigger (what starts it, like a schedule), a process with specialized skills (the steps it follows), and access to tools or systems (like Slack or a CRM). Unlike traditional deterministic workflows (where each step is explicitly defined and always the same), agents are probabilistic, meaning they use AI to interpret context and adjust their approach while staying within set instructions and guardrails.

AI Systems Show Triple the High-Risk Vulnerabilities of Legacy Software: Penetration testing data reveals that AI and LLM systems have 32% of findings rated high-risk compared to just 13% for traditional software, with only 38% of high-risk AI issues getting resolved. Security experts attribute this gap to rapid deployment without mature controls, novel attack surfaces like prompt injection (tricking AI by hiding instructions in input), and fragmented responsibility for remediation across teams.

Model Context Protocol Emerging as Critical Security Blind Spot: Model Context Protocol (MCP, a plugin system connecting AI agents to external tools) has become a major vulnerability vector as organizations fail to scan for or monitor MCP-related risks. Recent supply chain attacks, such as the postmark-mcp npm package that exfiltrated emails from 300 organizations, demonstrate how attackers exploit widely-trusted MCP packages and hardcoded credentials in AI configurations to enable credential theft and supply chain compromises at scale.

AI Sec Watch

Latest Intel

Workspace agents

Anthropic’s most dangerous AI model just fell into the wrong hands

Anthropic bets on EPSS for the coming bug surge

Anthropic investigates report of rogue access to hack-enabling Mythos AI

Cohere AI Terrarium Sandbox Flaw Enables Root Code Execution, Container Escape

Changes to GitHub Copilot Individual plans

Is Claude Code going to cost $100/month? Probably not - it's all very confusing

Introducing OpenAI Privacy Filter

SpaceX cuts a deal to maybe buy Cursor for $60 billion

CVE-2026-40933: Flowise is a drag & drop user interface to build a customized large language model flow. Prior to 3.1.0, due to unsafe s