AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

Jules Zombie Agent: From Prompt Injection to Remote Control

securitysafety

Aug 14, 2025

Jules, a coding agent, is vulnerable to prompt injection (tricking an AI by hiding malicious instructions in its input) attacks that can lead to remote command and control compromise. An attacker can embed malicious instructions in GitHub issues to trick Jules into downloading and executing malware, giving attackers full control of the system. The attack works because Jules has unrestricted internet access and automatically approves plans after a time delay without requiring human confirmation.

Fix: The source explicitly recommends four mitigations: (1) 'Be careful when directly tasking Jules to work with untrusted data (e.g. GitHub issues that are not from trusted sources, or websites with documentation that does not belong to the organization, etc.)'; (2) 'do not have Jules work on private, important, source code or give it access to production-level secrets, or anything that could enable an adversary to perform lateral movement'; (3) deploy 'monitoring and detection tools on these systems' to 'enable security teams to monitor and understand potentially malicious behavior'; and (4) 'do not allow arbitrary Internet access by default. Instead, allow the configuration to be enabled when needed.'

Embrace The Red

AI Sec Watch

Latest Intel

CVE-2025-55284: Claude Code is an agentic coding tool. Prior to version 1.0.4, it's possible to bypass the Claude Code confirmation prom

Automated Red Teaming Scans of Dataiku Agents Using Protect AI Recon

Google Jules is Vulnerable To Invisible Prompt Injection

Jules Zombie Agent: From Prompt Injection to Remote Control

Google Jules: Vulnerable to Multiple Data Exfiltration Issues

CVE-2025-23298: NVIDIA Merlin Transformers4Rec for all platforms contains a vulnerability in a python dependency, where an attacker coul

GitHub Copilot: Remote Code Execution via Prompt Injection (CVE-2025-53773)

CVE-2025-53773: Improper neutralization of special elements used in a command ('command injection') in GitHub Copilot and Visual Studio

AI Safety Newsletter #61: OpenAI Releases GPT-5

CVE-2025-55012: Zed is a multiplayer code editor. Prior to version 0.197.3, in the Zed Agent Panel allowed for an AI agent to achieve Re