Publication | Closed Access
Improvement and Application of TFIDF Method Based on Text Classification
33
Citations
0
References
2006
Year
Data ClassificationClassification MethodEngineeringInformation RetrievalData ScienceData MiningPattern RecognitionImproved TfidfTfidf MethodAutomatic ClassificationImproved Tfidf MethodDocument ClassificationGenetic AlgorithmBiostatisticsClassificationIntelligent ClassificationCorpus LinguisticsText Mining
TFIDF is a kind of common methods used to measure the terms in a document.The method is easy but it undervalues these terms that frequently appear in the documents belonging to the same class,while those terms can represent the characteristic of the documents of this class,so higher weight is entrusted to them.The expression of IDF in TFIDF is modified to increase the weight of those terms mentioned,then is applied to the experiment to validate it.In the experiment,the improved TFIDF is used to select feature and genetic algorithm is used to train the classifier.The method is better than others and proves that the improved TFIDF method is feasible.