Concepedia

Publication | Closed Access

Feature selection for text classification based on part of speech filter and synonym merge

13

Citations

4

References

2015

Year

Abstract

In recent years, text categorization based on machine learning is a widely used technology in the field of natural language processing and text mining and has gained many advances. Feature selection is one of the key problems in text categorization. The chief obstacles to feature selection are noise and sparseness. In this paper, we propose an approach of Chinese text feature selection based on CV (contribution value), POS (part of speech) filter and synonym merge. We carry out experiments over corpus-TanCorpV1.0 and find that the proposed method performs better than traditional ones.

References

YearCitations

Page 1