AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

Breaking Opus 4.7 with ChatGPT (Hacking Claude's Memory)

infonewsLLM-Specific

securitysafety

Source: Embrace The RedApril 17, 2026

Summary

A researcher discovered that Claude Opus 4.7 can be tricked using an adversarial image (a specially crafted image designed to fool AI systems) generated by ChatGPT to misuse the memory tool and store false information for future conversations. While Claude Opus 4.6+ is harder to attack than earlier versions because it reasons through requests before acting, it remains vulnerable to this type of indirect prompt injection (embedding hidden malicious instructions in images rather than text).