AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

Trigger Without Trace: Toward Stealthy Backdoor Attack on Text-to-Image Diffusion Models

inforesearchPeer-Reviewed

securityresearch

Source: IEEE Xplore (Security & AI Journals)May 20, 2026

Summary

Researchers have developed a new backdoor attack method called Trigger without Trace (TwT) that can secretly compromise text-to-image diffusion models (AI systems that generate images from text descriptions) while avoiding detection. The method works by using syntactic structures (grammar patterns) as hidden triggers and employing a mathematical technique called Kernel Maximum Mean Discrepancy (KMMD, a way to match statistical distributions) to make malicious samples look identical to legitimate ones, achieving a 97.5% success rate while bypassing three existing defense detection systems.