AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

AI-Shielder: Exploiting Backdoors to Defend Against Adversarial Attacks

inforesearchPeer-Reviewed

securityresearch

Source: IEEE Xplore (Security & AI Journals)September 29, 2025

Summary

Deep neural networks (DNNs, machine learning models with many layers that learn patterns from data) are vulnerable to adversarial attacks, where small, carefully crafted changes to input data trick the AI into making wrong predictions, especially in critical areas like self-driving cars. This paper presents AI-Shielder, a method that intentionally embeds backdoors (hidden pathways that alter how the model behaves) into neural networks to detect and block adversarial attacks while keeping the AI's normal performance intact. Testing shows AI-Shielder reduces successful attacks from 91.8% to 3.8% with only minor slowdowns.

Solution / Mitigation

AI-Shielder is the proposed solution presented in the paper. According to the results, it 'reduces the attack success rate from 91.8% to 3.8%, which outperforms the state-of-the-art works by 37.2%, with only a 0.6% decline in the clean data accuracy' and 'introduces only 1.43% overhead to the model prediction time, almost negligible in most cases.' The approach works by leveraging intentionally embedded backdoors to fail adversarial perturbations while maintaining original task performance.

Classification

Attack Type

Model Evasion

Attack SophisticationAdvanced

Impact (CIA+S)

integritysafety

Related Issues

info

Defending Against Patch-Based and Texture-Based Adversarial Attacks With Spectral Decomposition

info

HGNN Shield: Defending Hypergraph Neural Networks Against High-Order Structure Attack

Monthly digest — independent AI security research

Original source: http://ieeexplore.ieee.org/document/11184428

First tracked: February 12, 2026 at 02:22 PM

Classified by LLM (prompt v3) · confidence: 85%