AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

Attackers can turn AI agent guardrails into denial-of-service weapons

infonewsLLM-Specific

securityresearch

Source: CSO OnlineJune 15, 2026

Summary

Researchers discovered that attackers can exploit AI agent guardrails (safety systems that check AI behavior) by inserting malicious content into documents, causing the security mechanisms to enter extended thinking loops that dramatically slow down or crash shared AI systems. This reasoning-extension DoS (denial-of-service, a type of attack that makes systems unavailable) attack targets the safety layer itself rather than trying to jailbreak the AI model, and it works across multiple AI frameworks and different LLM families. Unlike traditional attacks that try to produce unsafe outputs, this technique compromises availability by exhausting computational resources, with some systems experiencing slowdowns of up to 148 times normal speed.