AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

Enhancing Targeted Adversarial Attacks on Large Vision-Language Models via Intermediate Projector

inforesearchPeer-Reviewed

securityresearch

Source: IEEE Xplore (Security & AI Journals)June 10, 2026

Summary

Researchers developed new methods to perform targeted adversarial attacks (carefully crafted inputs designed to trick AI systems into producing specific harmful outputs) on Large Vision-Language Models, which are AI systems that process both images and text. The attack methods exploit a component called the projector (a part of the model that helps align visual and text information) to make attacks more precise and effective, allowing attackers to modify specific parts of an image while leaving other parts unchanged, and these attacks were shown to work against commercial AI systems like Google Gemini and OpenAI GPT.