Combating Hate Speech with Multi-Task Learning

Combating Hate Speech with Multi-Task Learning

Improving generalization for detecting harmful content targeting political figures

This research introduces a Multi-task Learning (MTL) framework that significantly improves hate speech detection across different domains and datasets.

  • Creates a unified approach that overcomes inconsistent definitions and labeling criteria
  • Demonstrates improved performance when generalizing to new domains
  • Shows particular effectiveness for detecting harmful content targeting political figures
  • Addresses the real-world challenge of cross-dataset performance

From a security perspective, this work provides essential tools for platforms to better protect vulnerable individuals and communities from online harm, helping maintain safer digital spaces while addressing the technical challenges of content moderation at scale.

Generalizing Hate Speech Detection Using Multi-Task Learning: A Case Study of Political Public Figures

2 | 104