
Detecting Vulnerabilities Without Labeled Data
LLM-based anomaly detection outperforms direct LLM vulnerability prediction
ANVIL reframes software vulnerability detection as an anomaly detection problem, eliminating the need for labeled training data while achieving superior results to direct LLM-based approaches.
- Uses unsupervised learning to identify code vulnerabilities
- Achieves higher accuracy than direct LLM vulnerability predictions
- Reduces false positives compared to traditional methods
- Provides a practical solution for the labeled data shortage in security applications
This research matters for security teams who struggle with insufficient vulnerability training data and the high costs of false positives. ANVIL's approach offers a more scalable and reliable method for identifying potential security vulnerabilities in software code.
ANVIL: Anomaly-based Vulnerability Identification without Labelled Training Data