Defending Against AI Video Deception

Defending Against AI Video Deception

Using Large Vision Language Models to Detect AI-Generated Videos

LAVID is a novel agentic framework that leverages Large Vision Language Models to detect videos generated by diffusion models with high accuracy.

  • Addresses a critical security gap in AI-generated content detection for videos
  • Implements a step-by-step reasoning approach that mimics how humans identify fake videos
  • Achieves superior detection performance across various video generation models
  • Introduces a comprehensive benchmark for evaluating AI-generated video detection systems

Security Implications: As AI video generation becomes more sophisticated, reliable detection systems are crucial for preserving digital integrity, preventing misinformation, and protecting against privacy violations.

LAVID: An Agentic LVLM Framework for Diffusion-Generated Video Detection

20 | 56