Publication | Closed Access
A New Instance-weighting Naive Bayes Text Classifiers
11
Citations
16
References
2018
Year
Unknown Venue
EngineeringMachine LearningConditional IndependenceTraining InstanceCorpus LinguisticsText MiningNatural Language ProcessingClassification MethodInformation RetrievalData ScienceData MiningPattern RecognitionDocument ClassificationInstance-based LearningAutomatic ClassificationKnowledge DiscoveryIntelligent ClassificationComputer ScienceNew Instance-weighting Approach
It is shown in recent research that naive Bayes text classifiers have achieved noticeable classification performance despite its strong assumption of conditional independence among features. In order to weaken this unrealistic assumption and improve the classification accuracy, there are generally three methods: structures manipulating, features manipulating, and instances manipulating. Instances manipulating can be further divided into instance-weighting and instance-selecting. In this paper, we propose a new instance-weighting approach to naive Bayes text classifier. In this new approach, the training dataset is firstly divided into several subsets according to their class value. Then every training instance in a subset is weighted according to the distance between it and the mean of the training subset. The experimental results on 15 text document datasets show that in terms of the accuracy of classification, our method performs better than three existing naive Bayes text classifiers.
| Year | Citations | |
|---|---|---|
Page 1
Page 1