Publication | Closed Access
A novel refinement approach for text categorization
106
Citations
8
References
2005
Year
Unknown Venue
EngineeringBase ClassifierCorpus LinguisticsRefined Centroid ClassifierText MiningCentroid ClassifierNatural Language ProcessingSupport Vector MachineClassification MethodInformation RetrievalData ScienceData MiningPattern RecognitionComputational LinguisticsDocument ClassificationLanguage StudiesMachine TranslationAutomatic ClassificationKnowledge DiscoveryIntelligent ClassificationNovel Refinement ApproachLinguistics
In this paper we present a novel strategy, DragPushing, for improving the performance of text classifiers. The strategy is generic and takes advantage of training errors to successively refine the classification model of a base classifier. We describe how it is applied to generate two new classification algorithms; a Refined Centroid Classifier and a Refined Naive Bayes Classifier. We present an extensive experimental evaluation of both algorithms on three English collections and one Chinese corpus. The results indicate that in each case, the refined classifiers achieve significant performance improvement over the base classifiers used. Furthermore, the performance of the Refined Centroid Classifier implemented is comparable, if not better, to that of state-of-the-art support vector machine (SVM)-based classifier, but offers a much lower computational cost.
| Year | Citations | |
|---|---|---|
Page 1
Page 1