AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

5 runtime signals for catching a compromised AI agent

infonewsLLM-Specific

securitysafety

Source: CSO OnlineJune 15, 2026

Summary

AI agents (software systems that can read data, process external content, and take actions) now commonly have all three dangerous capabilities together, making them vulnerable to prompt injection (tricky instructions hidden in data that trick the AI into doing harmful things). Security experts like Meta recommend the 'Rule of Two,' which limits agents to only two of these three capabilities per session and requires human approval if all three are needed, but this framework has limitations and doesn't fully solve the problem.

Solution / Mitigation

Meta's security team published the 'Rule of Two' framework, which recommends agents satisfy no more than two of the three trifecta properties (access to private data, exposure to untrusted content, ability to communicate externally) in a single session, with human-in-the-loop approval required if all three are necessary. Simon Willison endorsed this framework as 'the best practical advice for building secure LLM-powered agent systems today.'