Beyond Academia: Testing LLMs on Real Professional Exams

IndoCareer introduces a novel dataset of 8,834 multiple-choice questions from actual professional certification exams across diverse fields in Indonesia.

Shifts LLM evaluation from academic to real-world professional knowledge
Covers six key sectors including healthcare and law
Provides culturally-specific contexts often missing in global evaluations
Reveals performance gaps in specialized professional domains

For healthcare professionals, this research highlights how LLMs perform when handling medical certification questions with local context—an essential insight for developing AI assistants for medical education and practice in diverse healthcare systems.

Cracking the Code: Multi-domain LLM Evaluation on Real-World Professional Exams in Indonesia