AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

What happened after 2,000 people tried to hack my AI assistant

infonewsLLM-Specific

securitysafety

Source: Simon Willison's WeblogJune 26, 2026

Summary

A researcher ran a public challenge where 2,000 people attempted to hack an AI assistant by sending emails containing prompt injection attacks (tricks to make an AI ignore its safety rules and reveal secrets). After 6,000 total attempts, nobody successfully leaked the system's secrets, suggesting that modern AI models are becoming more resistant to these attacks through better training.