Benchmarking LLMs and LLM-based Agents in Practical Vulnerab...

Benchmarking LLMs and LLM-based Agents in Practical Vulnerab...

By Alperen Yildiz, Sin G. Teo...

Abstract:

Large Language Models (LLMs) have shown promise in software vulnerability detection, particularly on function-level benchmarks like Devign and BigVul. However, real-world detection requires interprocedural analysis, as vulnerabilities often emerge through multi-hop function calls rather than isolate...

Key points:

  • Research on large language models
  • Security application

Source: Benchmarking LLMs and LLM-based Agents in Practical Vulnerability Detection for Code Repositories

189 | 251