LLMs in Public Health: Performance Evaluation

LLMs in Public Health: Performance Evaluation

Comprehensive assessment of AI models for health classification and data extraction

This research evaluates how effectively Large Language Models can support public health professionals in analyzing and classifying health-related text data.

  • Combines 13 datasets (6 external, 7 new) to test LLM performance across public health tasks
  • Assesses capabilities for identifying health burdens, epidemiological risk factors, and public health interventions
  • Provides systematic evaluation framework for determining LLM suitability in health applications

Practical applications include enhanced disease surveillance, more efficient health data processing, and supporting evidence-based public health decision-making—potentially reducing expert workload while maintaining accuracy.

Evaluating Large Language Models for Public Health Classification and Extraction Tasks

9 | 85