aisecwatch.com
DashboardVulnerabilitiesNewsResearchArchiveStatsDatasetFor devs
Subscribe
aisecwatch.com

Real-time AI security monitoring. Tracking AI-related vulnerabilities, safety and security incidents, privacy risks, research developments, and policy changes.

Navigation

VulnerabilitiesNewsResearchDigest ArchiveNewsletter ArchiveSubscribeData SourcesStatisticsDatasetAPIIntegrationsWidgetRSS Feed

Maintained by

Truong (Jack) Luu

Information Systems Researcher

AI Sec Watch

The security intelligence platform for AI teams

AI security threats move fast and get buried under hype and noise. Built by an Information Systems Security researcher to help security teams and developers stay ahead of vulnerabilities, privacy incidents, safety research, and policy developments.

Independent research. No sponsors, no paywalls, no conflicts of interest.

[TOTAL_TRACKED]
3,710
[LAST_24H]
1
[LAST_7D]
1
Daily BriefingMonday, May 18, 2026

No new AI/LLM security issues were identified today.

Latest Intel

page 139/371
VIEW ALL
01

GHSA-pf93-j98v-25pv: ha-mcp has XSS via Unescaped HTML in OAuth Consent Form

security
Mar 12, 2026

The ha-mcp OAuth consent form has a cross-site scripting (XSS) vulnerability, where user-controlled data is inserted into HTML without escaping (the process of converting special characters so they display as text rather than execute as code). An attacker could register a malicious application and trick the server operator into visiting a crafted authorization URL, allowing the attacker to run JavaScript in the operator's browser and steal sensitive tokens. This only affects users running the beta OAuth mode, not the standard setup.

Fix: Upgrade to version 7.0.0

GitHub Advisory Database
02

Detecting and analyzing prompt abuse in AI tools

securitysafety
Mar 12, 2026

Prompt abuse occurs when attackers craft inputs to make AI systems perform unintended actions, such as revealing sensitive information or bypassing safety rules. Three main types exist: direct prompt override (forcing an AI to ignore its instructions), extractive abuse (extracting private data the user shouldn't access), and indirect prompt injection (hidden malicious instructions in documents or web pages that the AI interprets as legitimate input). The article emphasizes that detecting prompt abuse is difficult because it uses natural language manipulation that leaves no obvious trace, and without proper logging, attempts to access sensitive information can go unnoticed.

Fix: The source mentions that organizations can use an 'AI assistant prompt abuse detection playbook' and 'Microsoft security tools' to detect, investigate, and respond to prompt abuse by turning logged interactions into actionable insights. However, the source text does not provide specific details about what these tools are, how to implement them, or concrete technical steps for detection and mitigation. The full implementation details are referenced but not included in the provided content.

Microsoft Security Blog
03

Anthropic doesn’t trust the Pentagon, and neither should you

policysecurity
Mar 12, 2026

Anthropic, maker of the AI assistant Claude, is in a legal dispute with the Pentagon after being designated a supply chain risk (a company that poses a security threat to government operations). The core issue involves disagreement over whether the U.S. government can be trusted to follow the law when using AI for surveillance, given a long history of government lawyers interpreting surveillance laws in ways that expand government monitoring far beyond what the plain language of those laws seems to allow.

The Verge (AI)
04

Bespoke AI models are the next big thing in filmmaking

industry
Mar 12, 2026

Current popular AI video models like Sora, Vevo, and Runway aren't very effective for making films and TV shows, despite hype suggesting AI could create entire productions automatically. AI companies are now developing custom models designed specifically for filmmakers' creative needs while trying to avoid copyright issues.

The Verge (AI)
05

Anthropic’s Claude would ‘pollute’ defense supply chain: Pentagon CTO

policysecurity
Mar 12, 2026

The U.S. Department of Defense designated Anthropic's Claude AI as a supply chain risk, citing concerns that the company's built-in policy preferences (established through its constitutional training approach) could compromise military effectiveness and security. The Pentagon requires defense contractors to certify they don't use Claude, though the DOD acknowledged that transitioning away from the technology will take time.

CNBC Technology
06

Adversarial Semantic and Label Perturbation Attack for Pedestrian Attribute Recognition

researchsecurity
Mar 12, 2026

This research paper explores vulnerabilities in Pedestrian Attribute Recognition (PAR), a computer vision task that identifies characteristics of people in images using AI models. The authors developed both adversarial attacks (methods to fool the system with manipulated images) and a defense strategy called semantic offset defense to protect PAR systems, testing their approach on multiple datasets.

Fix: The paper proposes a semantic offset defense strategy to suppress the influence of adversarial attacks on pedestrian attribute recognition systems. Source code is made available at https://github.com/Event-AHU/OpenPAR.

IEEE Xplore (Security & AI Journals)
07

Toward Generalizable Deepfake Detection via Forgery-Aware Audio–Visual Adaptation: A Variational Bayesian Approach

researchsafety
Mar 12, 2026

This research paper presents a new method called FoVB (Forgery-aware Audio-Visual Adaptation with Variational Bayes) to detect deepfakes (AI-generated fake videos that manipulate both audio and video). The method works by analyzing the relationship between audio and video to find mismatches, such as when lip movements don't match the sound, which are telltale signs of deepfakes.

IEEE Xplore (Security & AI Journals)
08

Microsoft’s Copilot Health can connect to your medical records and wearables

safetyprivacy
Mar 12, 2026

Microsoft launched Copilot Health, a feature that lets users ask an AI assistant questions about their medical records, lab results, and data from wearables (devices that track health metrics like heart rate) in a dedicated secure space within Copilot. The feature is rolling out gradually through a waitlist and is designed to help users understand their health data rather than replace doctors or provide medical diagnoses.

The Verge (AI)
09

Google is using old news reports and AI to predict flash floods

researchindustry
Mar 12, 2026

Google developed a flash flood prediction system by using Gemini (an LLM, or large language model) to analyze 5 million news articles and extract data about 2.6 million floods, creating a dataset called Groundsource. This dataset trained a machine learning model (LSTM, a type of neural network) that now provides flood risk forecasts for urban areas in 150 countries on Google's Flood Hub platform, though it has limitations like lower resolution than traditional weather services.

TechCrunch
10

You can now ask Google Maps ‘complex, real-world questions’ — and Gemini will answer

industry
Mar 12, 2026

Google is adding an AI-powered feature called "Ask Maps" to Google Maps that uses Gemini (Google's AI assistant) to answer complex, specific questions about locations. Previously, Google Maps couldn't handle very detailed queries like "where can I charge my phone without waiting in line," but now Gemini can provide personalized, detailed answers to these kinds of questions.

The Verge (AI)
Prev1...137138139140141...371Next