Publication | Closed Access
Identification and Classification of Cybercrimes using Text Mining Technique
20
Citations
23
References
2019
Year
Unknown Venue
Abuse DetectionEngineeringInformation SecurityHidden IdentityInformation ForensicsCyber CrimeText Mining TechniqueText MiningSpam FilteringComputational Social ScienceProfiling TechniqueSocial MediaData ScienceData MiningPattern RecognitionTextual FeaturesContent AnalysisSocial Medium MiningCybercrimeThreat DetectionKnowledge DiscoveryComputer ScienceCyber Crime InvestigationCyberbullyingOnline HarassmentSocial ComputingDemographic FeaturesArts
Cyber-crimes involve all the crimes where internet is used as an access medium and committed through some electronic device such as computers and mobile phones. Unavailability of datasets, hidden identity of predators and the privacy of the victims are the main factors for limiting the past research in cyberbullying detection. Considering these factors, an effective text mining approach using machine learning algorithms is proposed to proactively detect bullying text. The dataset collected from myspace.com and Preverted-Justice.com has been used to evaluate the system's performance. Three types of feature namely textual, behavioral and demographic features are extracted from the dataset as compared to earlier study over the same dataset where only textual features were considered. Textual features include certain bullying words that if exists within the text may lead to a true outcome for cyberbullying. Personality trait features are extracted for the user if it is involving once in bullying may bully in future too. While demographic features extracted from dataset include age, gender and location. The system is evaluated through different performance measures for both classifiers used and the performance of Support Vector Machine classifier is found better than the Bernoulli NB with an overall 87.14% of classification accuracy.
| Year | Citations | |
|---|---|---|
Page 1
Page 1