Smart Classification on a Budget

Smart Classification on a Budget

Optimizing the cost-accuracy tradeoff in machine learning inference

OCCAM introduces a novel framework for cost-efficient classification that intelligently routes queries to appropriate models based on difficulty and requirements.

  • Achieves up to 5x cost reduction while maintaining accuracy targets
  • Uses a cascade architecture with multiple classifiers of varying capacities
  • Employs intelligent routing to direct queries to the most suitable classifier
  • Implements dynamic accuracy-cost management with probabilistic guarantees

For healthcare applications, OCCAM enables optimized resource allocation, allowing high-capacity models to focus on complex medical cases while simpler models handle routine classification, reducing operational costs without compromising diagnostic accuracy.

OCCAM: Towards Cost-Efficient and Accuracy-Aware Classification Inference

12 | 116