AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

OpenAI talks about not talking about goblins

infonewsLLM-Specific

safety

Source: The Verge (AI)April 30, 2026

Summary

OpenAI discovered that its AI models were unexpectedly inserting references to goblins and other creatures into their responses, a behavior that started appearing in the GPT-5.1 model, particularly when using the "Nerdy" personality option. The company traced this quirk to patterns in the training data and added instructions to prevent the models from discussing these creatures.