Benchmarking Knowledge-Extraction Attack and Defense on Retrieval-Augmented Generation
securityprivacyresearch
Source: Arxiv (cs.CR + cs.AI)February 10, 2026Summary
This paper introduces the first systematic benchmark for evaluating knowledge-extraction attacks on Retrieval-Augmented Generation (RAG) systems, which can be exploited through maliciously crafted queries to recover sensitive knowledge-base content. The benchmark consolidates fragmented research by providing a unified experimental framework covering various attack and defense strategies, retrieval embedding models, and both open- and closed-source generators across standardized datasets.
Original source: https://arxiv.org/abs/2602.09319v1
First tracked: February 11, 2026 at 06:00 PM