AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

On the Equilibrium Between Feasible Zone and Uncertain Model in Safe Exploration

inforesearchPeer-Reviewed

research

Mar 3, 2026

This research addresses how to safely explore environments using reinforcement learning (RL, a type of AI training where a system learns by trial and error) without causing damage or violating safety rules. The paper introduces safe equilibrium exploration (SEE), a method that balances two competing goals: expanding the area where exploration is allowed (the feasible zone) and building a more accurate model of how the environment works, showing that these two objectives improve each other and can reach an optimal balance without any safety violations.

IEEE Xplore (Security & AI Journals)

AIRPNet: Adaptive Image Restoration With Privacy Protection in Steganographic Domain

inforesearchPeer-Reviewed

research

Outlier-Aware Contrastive Learning

inforesearchPeer-Reviewed

research

AI Agent Overload: How to Solve the Workload Identity Crisis

infonews

security

Mar 3, 2026

Organizations are facing challenges managing workload identities (the digital credentials and permissions that allow different software systems and applications to authenticate and communicate with each other), and the problem is becoming harder to handle as systems grow more complex. The source indicates this is a widespread issue but does not provide specific technical details about the nature of the crisis or its consequences.

On Moltbook

infonews

safetyindustry

OpenAI changes deal with US military after backlash

infonews

policysafety

OpenAI amends Pentagon deal as Sam Altman admits it looks ‘sloppy’

infonews

policysecurity

AI Agents: The Next Wave Identity Dark Matter - Powerful, Invisible, and Unmanaged

mediumnews

securitypolicy

Fooling AI Agents: Web-Based Indirect Prompt Injection Observed in the Wild

highnews

security

Mar 3, 2026

Web-based indirect prompt injection (IDPI) is an attack where adversaries hide malicious instructions in website content that AI systems later read and unknowingly execute, such as through webpage summarization or content analysis features. Researchers found real-world examples of these attacks being used for ad fraud evasion, phishing promotion, data destruction, unauthorized transactions, and information theft, showing that IDPI is no longer just theoretical but actively weaponized. Unlike direct prompt injection (where attackers directly submit malicious input to an AI), IDPI exploits the normal behavior of AI systems processing benign-looking web content.

Vulnerability in MS-Agent AI Framework Can Allow Full System Compromise

highnews

security

Mar 3, 2026

A vulnerability in the MS-Agent AI Framework allows attackers to compromise an entire system by exploiting the Shell tool through improper input sanitization (failure to clean and validate user input). Attackers can use this flaw to modify system files and steal data.

Iran war heralds era of AI-powered bombing quicker than ‘speed of thought’

infonews

safetypolicy

Das gehört in Ihr Security-Toolset

infonews

security

Mar 2, 2026

This article describes 13 essential security tools that companies need to protect against cyber threats, including XDR (extended detection and response, an AI-powered system that identifies threats across networks and devices), MFA (multifactor authentication, requiring users to verify their identity multiple ways), NAC (network access control, which checks devices before allowing network access), and DLP (data loss prevention, which monitors for sensitive data being sent outside the company). The article explains why each tool is important but does not discuss any specific fixes, patches, or solutions to existing security problems.

OpenAI's Altman admits defense deal was 'opportunistic and sloppy' amid backlash

infonews

policy

Mar 2, 2026

OpenAI CEO Sam Altman acknowledged that the company rushed into a deal with the U.S. Department of Defense, calling it "opportunistic and sloppy," after public backlash over the timing and terms. The company plans to amend the contract to add safeguards, including language stating that "the AI system shall not be intentionally used for domestic surveillance of U.S. persons and nationals," and will work with the Pentagon on technical protections for their AI tools.

GHSA-6g25-pc82-vfwp: OpenClaw: macOS beta onboarding exposed PKCE verifier via OAuth state

mediumvulnerability

security

Mar 2, 2026

The OpenClaw macOS beta onboarding flow had a security flaw where it exposed a PKCE code_verifier (a secret token used in OAuth, a system for secure login) by putting it in the OAuth state parameter, which could be seen in URLs. This vulnerability only affected the macOS beta app's login process, not other parts of the software.

GHSA-5847-rm3g-23mw: OpenClaw has hook auth rate limiter bypass via IPv4-mapped IPv6 client key variants

mediumvulnerability

security

Mar 2, 2026

OpenClaw had a security flaw in its hook authentication rate limiter (the system that limits how many times someone can try to log in) where IPv4 addresses and IPv4-mapped IPv6 addresses (the newer internet protocol format that can represent older addresses like ::ffff:1.2.3.4) of the same client were counted separately, allowing attackers to double their brute-force attempts from 20 to 40 per minute by using both address forms.

CVE-2026-1336: The AI ChatBot with ChatGPT and Content Generator by AYS plugin for WordPress is vulnerable to unauthorized access and m

mediumvulnerability

security

Mar 2, 2026

CVE-2026-1336

A WordPress plugin called 'AI ChatBot with ChatGPT and Content Generator by AYS' has a security flaw in versions up to 2.7.5 where missing authorization checks (verification that a user has permission to perform an action) allow attackers without accounts to view, modify, or delete the plugin's ChatGPT API key (a secret code needed to use OpenAI's service). The vulnerability was partially fixed in version 2.7.5 and fully fixed in version 2.7.6.

CyberStrikeAI tool adopted by hackers for AI-powered attacks

highnews

security

Mar 2, 2026

Hackers are using CyberStrikeAI, an open-source AI security testing platform, to automate attacks against network devices like firewalls. The tool combines over 100 security utilities with an AI decision engine (compatible with GPT, Claude, and DeepSeek models) to automatically scan networks, find vulnerabilities, and execute attacks with minimal hacker skill required. Researchers warn this represents a growing threat as adversaries adopt AI-powered orchestration engines (systems that coordinate multiple tools automatically) to target exposed network equipment.

ChatGPT uninstalls surged by 295% after DoD deal

infonews

policy

Mar 2, 2026

ChatGPT's mobile app uninstalls surged 295% after OpenAI announced a partnership with the U.S. Department of Defense, while competitor Anthropic's Claude app saw downloads jump 37-51% after publicly declining a similar defense partnership over concerns about AI being used for surveillance and autonomous weapons. The shift in user preference was reflected in app store rankings, with Claude reaching the number one position and ChatGPT receiving a sharp increase in negative reviews.

CVE-2026-22719: Broadcom VMware Aria Operations Command Injection Vulnerability

infovulnerability

security

Mar 2, 2026

CVE-2026-22719🔥 Actively Exploited

CVE-2026-21385: Qualcomm Multiple Chipsets Memory Corruption Vulnerability

infovulnerability

security

Mar 2, 2026

CVE-2026-21385🔥 Actively Exploited

Browse All

Browse All

On the Equilibrium Between Feasible Zone and Uncertain Model in Safe Exploration

AIRPNet: Adaptive Image Restoration With Privacy Protection in Steganographic Domain

Outlier-Aware Contrastive Learning

AI Agent Overload: How to Solve the Workload Identity Crisis

On Moltbook

OpenAI changes deal with US military after backlash

OpenAI amends Pentagon deal as Sam Altman admits it looks ‘sloppy’

AI Agents: The Next Wave Identity Dark Matter - Powerful, Invisible, and Unmanaged

Fooling AI Agents: Web-Based Indirect Prompt Injection Observed in the Wild

Vulnerability in MS-Agent AI Framework Can Allow Full System Compromise

Iran war heralds era of AI-powered bombing quicker than ‘speed of thought’

Das gehört in Ihr Security-Toolset

OpenAI's Altman admits defense deal was 'opportunistic and sloppy' amid backlash

GHSA-6g25-pc82-vfwp: OpenClaw: macOS beta onboarding exposed PKCE verifier via OAuth state

GHSA-5847-rm3g-23mw: OpenClaw has hook auth rate limiter bypass via IPv4-mapped IPv6 client key variants

CVE-2026-1336: The AI ChatBot with ChatGPT and Content Generator by AYS plugin for WordPress is vulnerable to unauthorized access and m

CyberStrikeAI tool adopted by hackers for AI-powered attacks

ChatGPT uninstalls surged by 295% after DoD deal

CVE-2026-22719: Broadcom VMware Aria Operations Command Injection Vulnerability

CVE-2026-21385: Qualcomm Multiple Chipsets Memory Corruption Vulnerability

On the Equilibrium Between Feasible Zone and Uncertain Model in Safe Exploration

AIRPNet: Adaptive Image Restoration With Privacy Protection in Steganographic Domain

Outlier-Aware Contrastive Learning

AI Agent Overload: How to Solve the Workload Identity Crisis

On Moltbook

OpenAI changes deal with US military after backlash

OpenAI amends Pentagon deal as Sam Altman admits it looks ‘sloppy’

AI Agents: The Next Wave Identity Dark Matter - Powerful, Invisible, and Unmanaged

Fooling AI Agents: Web-Based Indirect Prompt Injection Observed in the Wild

Vulnerability in MS-Agent AI Framework Can Allow Full System Compromise

Iran war heralds era of AI-powered bombing quicker than ‘speed of thought’

Das gehört in Ihr Security-Toolset

OpenAI's Altman admits defense deal was 'opportunistic and sloppy' amid backlash

GHSA-6g25-pc82-vfwp: OpenClaw: macOS beta onboarding exposed PKCE verifier via OAuth state

GHSA-5847-rm3g-23mw: OpenClaw has hook auth rate limiter bypass via IPv4-mapped IPv6 client key variants

CVE-2026-1336: The AI ChatBot with ChatGPT and Content Generator by AYS plugin for WordPress is vulnerable to unauthorized access and m

CyberStrikeAI tool adopted by hackers for AI-powered attacks

ChatGPT uninstalls surged by 295% after DoD deal

CVE-2026-22719: Broadcom VMware Aria Operations Command Injection Vulnerability

CVE-2026-21385: Qualcomm Multiple Chipsets Memory Corruption Vulnerability