Publication | Closed Access
Text Classification by Boosting Weak Learners based on Terms and Concepts
81
Citations
6
References
2005
Year
Unknown Venue
Boosting Weak LearnersEngineeringDocument RepresentationsClassical Document RepresentationCorpus LinguisticsText MiningNatural Language ProcessingClassification MethodInformation RetrievalData ScienceData MiningPattern RecognitionComputational LinguisticsDocument ClassificationText ClassificationLanguage StudiesMachine TranslationAutomatic ClassificationKnowledge DiscoveryTerminology ExtractionIntelligent ClassificationVector Space ModelKeyword ExtractionLinguistics
Document representations for text classification are typically based on the classical bag-of-words paradigm. This approach comes with deficiencies that motivate the integration of features on a higher semantic level than single words. In this paper we propose an enhancement of the classical document representation through concepts extracted from background knowledge. Boosting is used for actual classification. Experimental evaluations on two well known text corpora support our approach through consistent improvement of the results.
| Year | Citations | |
|---|---|---|
Page 1
Page 1