AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

ShadowCoT: Cognitive Hijacking for Stealthy Reasoning Backdoors in LLMs

inforesearchPeer-ReviewedLLM-Specific

securityresearch

Source: IEEE Xplore (Security & AI Journals)April 27, 2026

Summary

ShadowCoT is a backdoor attack (a hidden vulnerability inserted into an AI model that causes it to misbehave when triggered) that targets Chain-of-Thought reasoning, which is a technique where LLMs show their step-by-step thinking to solve complex problems. Unlike simpler attacks, ShadowCoT hijacks the model's internal reasoning process by subtly rewiring how attention flows through the model and changing intermediate representations (internal data the model creates while processing), allowing it to produce logical-sounding but harmful outputs while avoiding detection.