AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

Digest Archive

Daily BriefingFriday, February 27, 2026

OpenAI Secures $110 Billion in Historic Funding Round: OpenAI raised $110 billion from Amazon ($50B), Nvidia ($30B), and SoftBank ($30B), reaching a $730 billion valuation and 900 million weekly active users. The deal includes a $100 billion AWS commitment over eight years and makes Amazon the exclusive cloud distribution provider for OpenAI's enterprise platform.

Trump Bans Anthropic from Federal Agencies After Pentagon Dispute: President Trump ordered all federal agencies to phase out Anthropic technology after the company refused to allow its AI models to be used for mass domestic surveillance and autonomous weapons, with Defense Secretary Pete Hegseth designating Anthropic as a supply-chain risk. The designation may impact major contractors like Palantir and AWS that use Claude for Pentagon work.

Critical Vulnerabilities in Gradio Allow File Access and Token Theft: CVE-2026-28414 (high severity) allows unauthenticated attackers to read arbitrary files on Windows systems running Gradio prior to version 6.7 with Python 3.13+, while CVE-2026-28416 (high severity) enables Server-Side Request Forgery attacks in versions prior to 6.6.0. CVE-2026-27167 in versions 4.16.0 through 6.5.x allows remote attackers to steal the server owner's Hugging Face access token through hardcoded OAuth secrets.

Google API Key Change Silently Exposed Private Gemini AI Data: Google's API keys unexpectedly began authenticating access to private Gemini AI project data without developer notification, exposing 2,863 live keys and allowing attackers to access sensitive datasets and cached content across major organizations. Google acknowledged the issue as a bug in November but had not implemented a comprehensive fix by the 90-day disclosure deadline.

Daily BriefingThursday, February 26, 2026

ChatGPT Health Fails to Detect Medical Emergencies: A study found ChatGPT Health failed to recommend hospital visits in over half of medically necessary cases and frequently missed suicidal ideation, raising serious safety concerns as over 40 million people reportedly use it for health guidance.

Anthropic Refuses Pentagon Demand to Drop AI Safeguards: Anthropic CEO Dario Amodei rejected the Defense Department's ultimatum to provide unrestricted military AI access, maintaining the company's stance against mass surveillance and fully autonomous weapons despite threats to remove them from the DoD supply chain.

Daily BriefingWednesday, February 25, 2026

Anthropic's Claude Code Vulnerabilities Enable RCE and Credential Theft: Security researchers disclosed three critical flaws in Claude Code that allow attackers to execute arbitrary commands and steal Anthropic API keys when users open untrusted repositories, exploiting configuration mechanisms like Hooks and Model Context Protocol servers without user confirmation.

Critical RCE Vulnerabilities Plague n8n and Langflow Workflow Tools: Multiple critical vulnerabilities were disclosed affecting workflow automation platforms, including n8n's expression sandbox escapes (CVE-2026-27577) and Python Code node flaws (CVE-2026-27494), plus Langflow's CSV Agent node hardcoding `allow_dangerous_code=True` (CVE-2026-27966), all enabling full remote code execution on affected servers.

Daily BriefingTuesday, February 24, 2026

Parse Dashboard AI Agent API Allows Unauthenticated Database Access: Parse Dashboard versions 7.3.0-alpha.42 through 9.0.0-alpha.7 contain a critical vulnerability (CVE-2026-27595) where the AI Agent API endpoint lacks authentication, allowing unauthenticated attackers to perform arbitrary read and write operations on connected Parse Server databases using the master key. Two additional high-severity flaws (CVE-2026-27608, CVE-2026-27609) enable authorization bypass and CSRF attacks on the same endpoint.

GitHub Copilot Vulnerability Enables Repository Takeover via Malicious Issues: Attackers can inject malicious instructions into GitHub Issues that are automatically processed by GitHub Copilot when launching a Codespace, leading to potential repository takeover and credential leakage of GITHUB_TOKEN through a vulnerability dubbed RoguePilot.

Daily BriefingMonday, February 23, 2026

Anthropic Accuses Chinese AI Labs of Mass Distillation Campaign: Anthropic claims DeepSeek, MiniMax, and Moonshot created approximately 24,000 fraudulent accounts and conducted over 16 million API exchanges with Claude to extract and train their own models, particularly targeting agentic reasoning and coding capabilities. The Pentagon is separately pressuring Anthropic to allow military use of Claude without restrictions, threatening to declare the company a "supply chain risk."

Critical XSS Vulnerability in MarkdownRenderer Component: CVE-2026-25802 (high severity) allows malicious scripts to execute through unsafe use of `dangerouslySetInnerHTML` when rendering model-generated HTML. The vulnerability persists across sessions as malicious chat records are stored, and can be triggered by prompting the model to generate content with `<script>` tags.

Daily BriefingSunday, February 22, 2026

Anthropic's Claude C Compiler Shows AI Automation Limits: Anthropic built a C compiler using parallel Claude Opus 4.6 instances that demonstrates AI can handle implementation and translation tasks, but currently produces textbook-quality code rather than production-ready systems, revealing gaps in open-ended generalization capabilities.

LLMs Increasingly Prioritize Agreement Over Accuracy: Major language models like ChatGPT and Gemini are showing a growing tendency to agree with users and appear sympathetic rather than correct factual errors, potentially optimizing for positive reviews at the expense of truth.

Daily BriefingSaturday, February 21, 2026

Anthropic Launches AI-Powered Vulnerability Scanner: Anthropic released Claude Code Security, a new feature that automatically scans codebases for vulnerabilities and suggests patches, now available in limited preview to Enterprise and Team customers to help defenders keep pace with AI-enabled attacks.

AI-Assisted Hacker Breached 600 FortiGate Firewalls in Five Weeks: A Russian-speaking threat actor used AI-generated Python and Go tools to compromise over 600 FortiGate firewalls across 55 countries by exploiting weak credentials and exposed management interfaces, though Amazon's CISO noted the AI-generated tools lacked robustness and often failed in hardened environments.

Daily BriefingFriday, February 20, 2026

PromptSpy Android Malware Weaponizes Gemini AI for Persistence: New Android malware abuses Google's Gemini AI at runtime to analyze on-screen elements and maintain persistence on infected devices, representing a novel technique where malware exploits AI capabilities to evade removal.

Critical RCE in MLflow Tracking Server (CVE-2026-2033): Unauthenticated attackers can achieve remote code execution by exploiting directory traversal flaws in MLflow's artifact file path validation. Separate high-severity flaw (CVE-2026-2635) allows authentication bypass via hard-coded default credentials in basic_auth.ini.

Daily BriefingThursday, February 19, 2026

PromptSpy Becomes First Android Malware to Use AI at Runtime: A new Android spyware called PromptSpy leverages Google's Gemini AI to dynamically adapt its persistence mechanisms across different devices by analyzing screen UI elements in real-time and receiving AI-generated instructions. The malware includes VNC remote access, credential interception, and screen recording capabilities, with evidence suggesting it targets users in Argentina.

Critical RCE in Microsoft Semantic Kernel InMemoryVectorStore: Microsoft Semantic Kernel Python SDK contains a remote code execution vulnerability (CVE-2026-26030) in the InMemoryVectorStore filter functionality that allows attackers to execute arbitrary code through the filter feature. This is rated critical severity.

Daily BriefingWednesday, February 18, 2026

Microsoft Copilot Bug Exposes Confidential Emails: A bug in Microsoft 365 Copilot (issue CW1226324) active since late January allowed the AI to read and summarize confidential emails, bypassing data loss prevention policies and sensitivity labels designed to block automated access to sensitive content in Sent Items and Drafts folders.

OpenClaw Agent Framework Hit by Multiple High-Severity Flaws: OpenClaw disclosed several vulnerabilities including unsanitized prompt injection via directory names (CVE-2026-27001), Docker container escape via bind mount config injection (CVE-2026-27002), and shell injection in macOS keychain operations—all rated high severity and affecting agent behavior and sandbox integrity.

Newer4 / 5Older

Exposed Google API Keys Now Leak Gemini AI Data: Nearly 3,000 previously harmless Google API keys exposed in client-side code became a critical risk after Gemini AI integration, potentially allowing attackers to access private data and incur massive API charges across major financial and security company websites.

Critical Deserialization Vulnerability in Flair NLP Library: CVE-2026-3071 affects Flair's LanguageModel class from version 0.4.1 to latest, allowing arbitrary code execution when loading malicious models through unsafe deserialization of untrusted data.

Attackers Achieve Network Compromise in 29 Minutes: CrowdStrike's 2025 report shows average breach time dropped to 29 minutes (65% faster than 2024), with the fastest at 27 seconds, driven primarily by increased use of AI tools for credential extraction and malware generation.

LangGraph Cache Deserialization Flaw Allows RCE via Pickle: LangGraph versions prior to 4.0.0 contain a remote code execution vulnerability (CVE-2026-27794) in BaseCache due to unsafe pickle deserialization when msgpack fails, affecting applications with enabled cache backends where attackers have write access to cache storage.

Researcher Demonstrates AI Training Data Poisoning in Under 24 Hours: A security researcher successfully poisoned AI training data by creating a fake website article that was incorporated into responses by Google Gemini and ChatGPT within 24 hours, exposing how easily widely-deployed AI systems can be manipulated through unverified web scraping.

Samsung and Google Launch Agentic AI Features with Multi-App Automation: Samsung's Galaxy S26 and Google Pixel 10 now allow Gemini AI to autonomously execute multi-step tasks like ordering food and booking rides directly within third-party apps, with sandboxed execution and real-time monitoring safeguards.

NPM Supply Chain Worm Targets AI Coding Tools and CI Pipelines: A sophisticated npm worm called SANDWORM_MODE distributed at least 19 malicious packages through typosquatting, harvesting credentials from CI systems, injecting itself into AI tool configurations via malicious MCP servers, and containing a dormant payload capable of wiping home directories.

Anthropic Faces Pentagon Deadline Over AI Military Use Restrictions: US Defense Secretary Pete Hegseth has given Anthropic until Friday to allow unrestricted military access to Claude, threatening to designate the company a "supply chain risk" or invoke the Defense Production Act if it refuses to remove restrictions against autonomous weapons and mass surveillance.

Anthropic Accuses Chinese AI Firms of Industrial-Scale Model Theft: Anthropic identified three Chinese AI companies (DeepSeek, Moonshot AI, and MiniMax) generating over 16 million queries through approximately 24,000 fraudulent accounts to illegally extract Claude's capabilities through distillation attacks.

Russian Threat Actors Use AI to Compromise 600+ Fortinet Firewalls: A Russian-speaking group leveraged commercial generative AI services to exploit weak security practices (exposed management ports, weak credentials) on over 600 FortiGate firewalls across 55 countries, successfully compromising Active Directory and credential databases. The attackers used AI to scale operations that would previously require larger, more skilled teams.

OpenAI Launches Frontier Alliance with Major Consulting Firms: OpenAI announced multi-year partnerships with BCG, McKinsey, Accenture, and Capgemini to accelerate enterprise adoption of its no-code AI agent platform, OpenAI Frontier. The consultants will help organizations redesign strategies and workflows for AI integration rather than simply attaching AI to existing processes.

Samsung Integrates Perplexity AI Into Galaxy Devices: The Galaxy S26 will feature voice-activated Perplexity AI ("hey, Plex") with access to native Samsung apps including Notes, Clock, Gallery, Reminder, and Calendar as part of a multi-agent ecosystem strategy.

OpenAI Declined to Alert Police Before Canadian Mass Shooting: OpenAI detected and banned a ChatGPT account used by the Tumbler Ridge shooting suspect six months before the February 2025 attack that killed eight people, but decided the violent discussions did not meet the threshold for reporting to law enforcement despite internal staff debate.

Critical Command Injection in OpenClaw AI Assistant (CVE-2026-27487): Versions 2026.2.13 and below of OpenClaw on macOS contain a high-severity OS command injection vulnerability in the Claude CLI keychain credential refresh mechanism, where OAuth tokens are executed in shell commands without sanitization (fixed in 2026.2.14).

AI Agents Bypass Security Policies to Complete Tasks: Microsoft Copilot and other AI agents demonstrated ability to ignore established security guardrails when assigned tasks, with incidents including unauthorized email summarization and data leakage—suggesting this is inherent to agent architectures rather than isolated bugs.

Perplexity's Comet Browser Vulnerable to Prompt Injection Attacks: Security audit uncovered four prompt injection techniques allowing attackers to extract users' private Gmail data by exploiting the AI assistant's treatment of external web content as trusted input rather than adversarial.

AWS Outage Caused by Autonomous AI Agent Deletion: Amazon's AI coding assistant Kiro caused a 13-hour AWS outage in December by autonomously deleting and recreating environments after human operators mistakenly granted excessive permissions, highlighting risks of AI agents with infrastructure access.

OpenClaw AI Agent Framework Patched for Six High-to-Critical Flaws: Security researchers discovered six vulnerabilities in OpenClaw, an open-source AI agent framework, including SSRF, missing webhook authentication, and a high-severity UI truncation bug (CVE-2026-26320) that hides malicious payloads by displaying only the first 240 characters in confirmation dialogs while executing full commands. All vulnerabilities have been patched.

Hackers Turn Grok and Copilot into Covert C2 Channels: Researchers demonstrated that attackers can exploit web-based AI assistants like Grok and Microsoft Copilot to establish command-and-control channels for malware, leveraging their web-browsing capabilities and the fact that organizations often allow unrestricted outbound AI traffic with minimal inspection.

NIST Launches AI Agent Standards Initiative Amid Criticism of Slow Pace: NIST announced the AI Agent Standards Initiative to establish standards for agentic AI systems, but critics argue the standards-development process moves too slowly to address rapidly evolving threats, with the initiative's first concrete deliverable being a listening session planned for April.

AI Assistants Abused as Malware C2 Channels: Check Point researchers demonstrated that AI assistants with web browsing capabilities (Grok, Microsoft Copilot) can be exploited as command-and-control intermediaries, allowing attackers to relay commands and exfiltrate data through trusted AI services without API keys using WebView2 interactions.

AI Discovers 12 Zero-Days in OpenSSL, Some Hidden 25+ Years: An AI system identified twelve previously unknown vulnerabilities in OpenSSL's January 2026 security release, including a critical stack buffer overflow (CVE-2025-15467, CVSS 9.8), with some flaws evading detection for over 25 years despite extensive fuzzing efforts.