Detecting backdoored language models at scale | AI Sec Watch