Publication | Open Access
A New Hate Speech Detection System based on Textual and Psychological Features
19
Citations
40
References
2022
Year
Abuse DetectionEngineeringMachine LearningMultimodal Sentiment AnalysisCorpus LinguisticsText MiningSpeech RecognitionNatural Language ProcessingSocial MediaData ScienceComputational LinguisticsLanguage StudiesContent AnalysisSocial Medium MiningHate SpeechSpeech AnalysisSocial Medium DataPsychological FeaturesLinguisticsRandom Forest
Hate speech often spreads on social media and harms individuals and the community. Machine learning models have been proposed to detect hate speech in social media; however, several issues presently limit the performance of current approaches. One challenge is the issue of having diverse comprehensions of hate speech constructs which will lead to many speech categories and different interpretations. In addition, certain language-specific features, and short text issues, such as Twitter, exacerbate the problem. Moreover, current machine learning approaches lack universality due to small datasets and the adoption of a few features of hateful speech. This paper develops and builds new feature sets based on frequencies of textual tokens and psychological characteristics. Then, the study evaluates several machine learning methods over a large dataset. Results showed that the Random Forest and BERT methods are the most valuable for detecting hate speech content. Furthermore, the most dominant features that are helpful for hate speech detection methods combine psychological features and Term-Frequency Inverse Document-Frequency (TFIDF) features. Therefore, the proposed approach could identify hate speech on social media platforms like Twitter.
| Year | Citations | |
|---|---|---|
Page 1
Page 1