Breaking Opus 4.7 with ChatGPT (Hacking Claude's Memory)
Summary
A researcher discovered that Claude Opus 4.7 can be tricked using an adversarial image (a specially crafted image designed to fool AI systems) generated by ChatGPT to misuse the memory tool and store false information for future conversations. While Claude Opus 4.6+ is harder to attack than earlier versions because it reasons through requests before acting, it remains vulnerable to this type of indirect prompt injection (embedding hidden malicious instructions in images rather than text).
Classification
Affected Vendors
Related Issues
Original source: https://embracethered.com/blog/posts/2026/breaking-opus-4.7-with-chatgpt/
First tracked: April 18, 2026 at 02:00 AM
Classified by LLM (prompt v3) · confidence: 85%