Publication | Closed Access
Unintended Bias in Misogyny Detection
72
Citations
13
References
2019
Year
Abuse DetectionEngineeringMachine LearningMisogyny Detection TaskCorpus LinguisticsJournalismText MiningNatural Language ProcessingData ScienceGender StudiesBiasComputational LinguisticsMisogyny DetectionHate SpeechMisogyny Detection ModelsAlgorithmic BiasSexismBias DetectionGender StereotypeMisogyny Detection ModelArts
During the last years, the phenomenon of hate against women increased exponentially especially in online environments such as microblogs. Although this alarming phenomenon has triggered many studies both from computational linguistic and machine learning points of view, less effort has been spent to analyze if those misogyny detection models are affected by an unintended bias. This can lead the models to associate unreasonably high misogynous scores to a non-misogynous text only because it contains certain terms, called identity terms. This work is the first attempt to address the problem of measuring and mitigating unintended bias in machine learning models trained for the misogyny detection task. We propose a novel synthetic test set that can be used as evaluation framework for measuring the unintended bias and different mitigation strategies specific for this task. Moreover, we provide a misogyny detection model that demonstrate to obtain the best classification performance in the state-of-the-art. Experimental results on recently introduced bias metrics confirm the ability of the bias mitigation treatment to reduce the unintended bias of the proposed misogyny detection model.
| Year | Citations | |
|---|---|---|
Page 1
Page 1