
Poisoning the Well: RAG System Vulnerabilities
A new efficient attack method threatens retrieval-based AI systems
This research introduces DIGA (Deterministic Influential tokensGuided Attack), a novel black-box attack that efficiently poisons corpora used by retrieval-augmented generation systems.
- Creates adversarial passages that manipulate retrieval results with minimal computation
- Achieves 93.5% attack success rate while being 7.8× faster than existing methods
- Requires no training or gradient computation, making it highly accessible to attackers
- Exploits the vulnerability of token-level influence in dense retrievers
This research highlights critical security vulnerabilities in RAG systems, which are increasingly deployed in enterprise environments. Organizations implementing retrieval-augmented AI must address these threats through robust defense mechanisms and content validation.
Tricking Retrievers with Influential Tokens: An Efficient Black-Box Corpus Poisoning Attack