AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

Really Unlearned? Verifying Machine Unlearning via Influential Sample Pairs

securityresearch

Oct 13, 2025

Machine unlearning allows AI models to forget the effects of specific training samples, but verifying whether this actually happened is difficult because existing checks (like backdoor attacks or membership inference attacks, which test if a model remembers data by trying to extract or manipulate it) can be fooled by a dishonest model provider who simply retrains the model to pass the test rather than truly unlearning. This paper proposes IndirectVerify, a formal verification method that uses pairs of connected samples (trigger samples that are unlearned and reaction samples that should be affected by that unlearning) with intentional perturbations (small changes to training data) to create indirect evidence that unlearning actually occurred, making it harder to fake.

IEEE Xplore (Security & AI Journals)

AI Sec Watch

Latest Intel

CVE-2025-36730: A prompt injection vulnerability exists in Windsurft version 1.10.7 in Write mode using SWE-1 model. It is possible to

A Mathematical Certification for Positivity Conditions in Neural Networks With Applications to Partial Monotonicity and Trustworthy AI

CVE-2025-62364: text-generation-webui is an open-source web interface for running Large Language Models. In versions through 3.13, a Loc

Privacy Protection of Dual Averaging Push for Decentralized Optimization via Zero-Sum Structured Perturbations

Do More With Less: Architecture-Agnostic and Data-Free Extraction Attack Against Tabular Model

Really Unlearned? Verifying Machine Unlearning via Influential Sample Pairs

Action-Perturbation Backdoor Attacks on Partially Observable Multiagent Systems

Engineering Trustworthy AI: A Developer Guide for Empirical Risk Minimization

A Deep Reinforcement Learning Approach to Time Delay Differential Game Deception Resource Deployment

Exploring Energy Landscapes for Minimal Counterfactual Explanations: Applications in Cybersecurity and Beyond