Publication | Closed Access
An improved TF-IDF approach for text classification
45
Citations
5
References
2005
Year
EngineeringImproved Tf-idf ApproachCorpus LinguisticsText MiningNatural Language ProcessingInformation RetrievalData ScienceComputational LinguisticsDocument ClassificationText ClassificationLanguage StudiesContent AnalysisAutomatic ClassificationKnowledge DiscoveryTerminology ExtractionIntelligent ClassificationConventional Tf-idf ApproachControlled VocabularyLexical ResourceKeyword ExtractionLinguisticsNew Tf-idf Approach
This paper presents a new improved term frequency/inverse document frequency (TF-IDF) approach which uses confidence, support and characteristic words to enhance the recall and precision of text classification. Synonyms defined by a lexicon are processed in the improved TF-IDF approach. We detailedly discuss and analyze the relationship among confidence, recall and precision. The experiments based on science and technology gave promising results that the new TF-IDF approach improves the precision and recall of text classification compared with the conventional TF-IDF approach.
| Year | Citations | |
|---|---|---|
Page 1
Page 1