Benchmarking LLMs and LLM-based Agents in Practical Vulnerab...

Abstract:

Large Language Models (LLMs) have shown promise in software vulnerability detection, particularly on function-level benchmarks like Devign and BigVul. However, real-world detection requires interprocedural analysis, as vulnerabilities often emerge through multi-hop function calls rather than isolate...

Key points:

Research on large language models
Security application

Source: Benchmarking LLMs and LLM-based Agents in Practical Vulnerability Detection for Code Repositories