aisecwatch.com
DashboardVulnerabilitiesNewsResearchArchiveStatsDatasetFor devs
Subscribe
aisecwatch.com

Real-time AI security monitoring. Tracking AI-related vulnerabilities, safety and security incidents, privacy risks, research developments, and policy changes.

Navigation

VulnerabilitiesNewsResearchDigest ArchiveNewsletter ArchiveSubscribeData SourcesStatisticsDatasetAPIIntegrationsWidgetRSS Feed

Maintained by

Truong (Jack) Luu

Information Systems Researcher

Research

Academic papers, new techniques, benchmarks, and theoretical findings in AI/LLM security.

to
Export CSV
690 items

Practical and secure history-independent indexing for queryable-encrypted databases

inforesearchPeer-Reviewed
security
Mar 19, 2026

This research paper, published in June 2026, presents a method for creating indexes in queryable-encrypted databases (databases where data stays encrypted even when being searched) that don't leak information about access patterns or query history. The approach aims to improve security by preventing attackers from inferring sensitive information about which data is being accessed based on observable patterns of database queries.

Elsevier Security Journals

Privacy-Preserving Spatio-Temporal Keyword Query with Verifiability for Location-Based Services

inforesearchPeer-Reviewed
security

N Truths and a Lie: Consistency-Based Backdoor Defense for Vertical Federated Learning

inforesearchPeer-Reviewed
security

A Dual-Purpose Framework for Backdoor Defense and Backdoor Amplification in Diffusion Models

inforesearchPeer-Reviewed
security

OWASP GenAI Security Project Expands AI Security Frameworks Ahead of RSA 2026, Celebrates Continued Sponsor Support

inforesearchIndustry
security

Survey on Learning-based Dynamic Fault Localization: From Traditional Machine Learning to Large Language Models

inforesearchPeer-Reviewed
research

Boosting Active Defense Persistence: A Two-Stage Defense Framework Combining Interruption and Poisoning Against Deepfake

inforesearchPeer-Reviewed
security

FORCE: Byzantine-Resilient Decentralized Federated Learning via Game-Theoretic Contribution Aggregation

inforesearchPeer-Reviewed
security

A novel android malware detection method based on CWInFs and MPTACF optimization

inforesearchPeer-Reviewed
research

Modeling of physical unclonable functions (PUF): A systematic literature review

inforesearchPeer-Reviewed
security

Alignment of Diffusion Models: Fundamentals, Challenges, and Future

inforesearchPeer-Reviewed
research

Machine Learning for Cybersecurity: A Comprehensive Literature Review

inforesearchPeer-Reviewed
research

Selective Forgetting in Machine Learning and Beyond: A Survey

inforesearchPeer-Reviewed
research

A Systematic Review on Human Roles, Solutions, and Methodological Approaches to Address Bias in AI

inforesearchPeer-Reviewed
research

Responsible AI Question Bank for Risk Assessment

inforesearchPeer-Reviewed
safety

Building Trust in Artificial Intelligence: A Systematic Review through the Lens of Trust Theory

inforesearchPeer-Reviewed
research

Detecting Training Data For Large Language Models: A Survey

inforesearchPeer-Reviewed
security

Bias-Free? An Empirical Study on Ethnicity, Gender, and Age Fairness in Deepfake Detection

inforesearchPeer-Reviewed
research

Adaptive Real-Time Financial Fraud Detection with Explainable AI Tools

inforesearchPeer-Reviewed
research

Enhancing Digital Security: A Novel Dual-Paradigm Approach for Robust Deepfake Detection Using Pre and Post Quantum-Trained Neural Networks

inforesearchPeer-Reviewed
research
Previous16 / 35Next
Mar 18, 2026

This research paper presents a method for searching location-based services (apps that use your geographic position, like finding nearby restaurants) while protecting user privacy and ensuring the results are trustworthy. The approach combines spatio-temporal (location and time-based) keyword searching with verifiability (a way to prove the results are correct), allowing users to query location services without exposing their exact location or search patterns to the service provider.

Elsevier Security Journals
research
Mar 18, 2026

This paper addresses backdoor attacks (where attackers secretly poison AI models to make them behave maliciously) in vertical federated learning (VFL, a setup where different organizations train an AI together on their own private data). The researchers propose a defense using a latent masked autoencoder (LMAE, a type of neural network that detects patterns and missing information) to identify when one participant is submitting suspicious, inconsistent data compared to honest participants, allowing the system to reject malicious contributions.

Fix: The paper proposes a novel defense mechanism using a latent masked autoencoder (LMAE) to assess the semantic consistency of embeddings (learned data representations) from different participants. The authors developed an algorithm based on the LMAE that identifies attackers and enables backdoor-resistant predictions. The defense was tested on multiple datasets and backdoor attack types and demonstrated effectiveness at identifying attackers while maintaining high prediction accuracy.

IEEE Xplore (Security & AI Journals)
research
Mar 18, 2026

Diffusion models (AI systems that generate images and other content by gradually removing noise from random data) are vulnerable to backdoor attacks, where hidden triggers cause the model to produce harmful outputs. Researchers created PureDiffusion, a framework that can both defend against these attacks by detecting and inverting the hidden triggers, and amplify attacks by making existing backdoors more effective.

IEEE Xplore (Security & AI Journals)
policy
Mar 18, 2026

The OWASP GenAI Security Project, an open-source community focused on AI security, announced expansion of its resources and frameworks with over 25,000 members contributing practical guidance and tools. The project is being highlighted at the RSA 2026 conference, indicating growing industry adoption of AI security best practices.

OWASP GenAI Security
Mar 18, 2026

This survey examines methods for automatically finding bugs in software code by using machine learning and AI models, tracing the evolution from traditional machine learning techniques to modern large language models (LLMs, which are AI systems trained on vast amounts of text data). The research covers how these AI-based approaches learn patterns to pinpoint where faults occur in code, making debugging faster and more efficient than manual inspection.

ACM Digital Library (TOPS, DTRAP, CSUR)
research
Mar 17, 2026

This research addresses a weakness in active defense systems against deepfakes (AI-generated fake videos or images): these defenses often fail when attackers retrain their models on protected samples. The authors propose a Two-Stage Defense Framework (TSDF) that uses dual-function adversarial perturbations (carefully designed noise patterns that disrupt both the deepfake output and the attacker's retraining process) to make defenses more persistent by poisoning the data (corrupting the training information) that attackers would use to adapt their models.

Fix: The source describes the proposed defense framework (TSDF) as the solution but does not mention an existing patch, update, or mitigation for current systems. The paper presents the framework as a research contribution rather than a fix for deployed software. N/A -- no mitigation for existing systems discussed in source.

IEEE Xplore (Security & AI Journals)
research
Mar 17, 2026

Decentralized Federated Learning (DFL, a way for multiple computers to train AI models together without a central server) is vulnerable to Byzantine attacks (when malicious participants send bad data to sabotage the learning process). The paper proposes FORCE, a new method that uses game theory concepts (mathematical models of strategy and fairness) to identify and exclude malicious clients by checking their model loss (how well their models perform) instead of checking gradients (the direction to improve), making DFL more resistant to these attacks.

IEEE Xplore (Security & AI Journals)
Mar 17, 2026

Android malware is a major security threat because the Android operating system's open app ecosystem allows unverified applications to be installed, making it easier for malicious software to spread and steal data, perform unauthorized financial transactions, or remotely control devices. Researchers are using machine learning (algorithms that learn patterns from data) to detect malware by analyzing features of Android application packages (APK files, the file format for Android apps), with recent research focusing on three main approaches: selecting the most important features to analyze, combining multiple detection models together, and handling datasets where malicious apps are much rarer than legitimate ones.

Elsevier Security Journals
Mar 17, 2026

This academic paper is a systematic literature review (a comprehensive analysis of existing research) about physical unclonable functions, or PUFs, which are hardware-based security features that create unique, unchangeable identifiers for devices based on their physical properties. Published in July 2026, the review examines how PUFs are modeled and studied across different research papers. The paper does not describe a security problem or vulnerability, but rather surveys current knowledge about how these security devices work.

Elsevier Security Journals
safety
Mar 16, 2026

This is an academic survey paper published in ACM Computing Surveys that examines alignment of diffusion models (AI systems trained to generate images or other content by gradually removing noise from random data). The paper covers fundamental concepts, current challenges in making these models behave as intended, and directions for future research in this area.

ACM Digital Library (TOPS, DTRAP, CSUR)
Mar 16, 2026

This is a literature review article published in an academic journal that surveys how machine learning (algorithms that learn patterns from data to make predictions) is being applied to cybersecurity problems. The article covers research across the field but does not describe a specific security vulnerability or incident requiring a fix.

ACM Digital Library (TOPS, DTRAP, CSUR)
safety
Mar 16, 2026

This is a survey article that reviews research on selective forgetting in machine learning, which is the ability to remove or reduce specific information from a trained AI model without completely retraining it from scratch. The article covers methods and applications of this technique across various AI systems and domains. The survey appears to be an academic overview of current knowledge in this area rather than describing a specific problem or vulnerability.

ACM Digital Library (TOPS, DTRAP, CSUR)
safety
Mar 16, 2026

This academic review examines how bias (systematic unfairness in AI decision-making) occurs in AI systems and explores the human roles, solutions, and research methods used to identify and reduce it. The paper surveys existing approaches to addressing bias rather than proposing a single new solution.

ACM Digital Library (TOPS, DTRAP, CSUR)
research
Mar 16, 2026

This is an academic survey article published in ACM Computing Surveys that discusses a question bank designed to help assess risks in AI systems responsibly. The article appears to be a comprehensive review of how organizations can evaluate potential harms and safety concerns when developing or deploying AI, rather than describing a specific vulnerability or problem.

ACM Digital Library (TOPS, DTRAP, CSUR)
safety
Mar 16, 2026

This academic paper is a systematic review published in ACM Computing Surveys that examines how trust works in artificial intelligence systems using established trust theory frameworks. The article analyzes trust in AI through theoretical lenses rather than addressing a specific technical vulnerability or problem.

ACM Digital Library (TOPS, DTRAP, CSUR)
research
Mar 16, 2026

This survey article reviews methods for detecting training data used to build large language models (LLMs, which are AI systems trained on massive amounts of text to generate human-like responses). The paper examines various techniques that researchers have developed to identify and extract information about what data was used to train these models, which is important for understanding model behavior and potential privacy concerns.

ACM Digital Library (TOPS, DTRAP, CSUR)
safety
Mar 16, 2026

This research paper studies whether deepfake detection systems (AI tools that identify fake videos made to look real) are fair across different groups of people based on ethnicity, gender, and age. The study found that these detection systems often perform differently depending on the person's background, meaning they work better for some groups than others. The paper highlights that bias in deepfake detection is an important fairness problem that needs attention.

ACM Digital Library (TOPS, DTRAP, CSUR)
security
Mar 16, 2026

This academic paper discusses using explainable AI (AI systems that can show their reasoning for decisions) to detect financial fraud as it happens in real time. The research focuses on making fraud detection systems that adapt to new fraud patterns while also being transparent about why they flag transactions as suspicious.

ACM Digital Library (TOPS, DTRAP, CSUR)
security
Mar 16, 2026

This research paper proposes a new method for detecting deepfakes (AI-generated fake videos or images) by using neural networks (computer systems loosely modeled on how brains learn) trained with both current and quantum computing approaches. The dual approach aims to make deepfake detection more reliable and harder for attackers to bypass.

ACM Digital Library (TOPS, DTRAP, CSUR)