AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

Cert-SSBD: Certified Backdoor Defense With Sample-Specific Smoothing Noises

securityresearch

Feb 24, 2026

Deep neural networks can be attacked through backdoors, where attackers secretly poison training data to make the model misclassify certain inputs while appearing normal otherwise. This paper proposes Cert-SSBD, a defense method that uses randomized smoothing (adding random noise to samples) with sample-specific noise levels, optimized per sample using stochastic gradient ascent, combined with a new certification approach to make models more resistant to these attacks.

Fix: The proposed Cert-SSBD method addresses the issue by employing stochastic gradient ascent to optimize the noise magnitude for each sample, applying this sample-specific noise to multiple poisoned training sets to retrain smoothed models, aggregating predictions from multiple smoothed models, and introducing a storage-update-based certification method that dynamically adjusts each sample's certification region to improve certification performance.

IEEE Xplore (Security & AI Journals)

AI Sec Watch

Latest Intel

New Relic launches new AI agent platform and OpenTelemetry tools

This Chainsmokers-approved AI music producer is joining Google

New ‘Sandworm_Mode’ Supply Chain Attack Hits NPM

Cert-SSBD: Certified Backdoor Defense With Sample-Specific Smoothing Noises

Risk-Aware Privacy Preservation for LLM Inference

A Novel Perspective on Gradient Defense: Layer-Specific Protection Against Privacy Leakage

Nimble raises $47M to give AI agents access to real-time web data

GitHub Issues Abused in Copilot Attack Leading to Repository Takeover

Anthropic joins OpenAI in flagging 'industrial-scale' distillation campaigns by Chinese AI firms

Is AI Good for Democracy?