Publication | Closed Access
An Empirical Comparison of Four Text Mining Methods
121
Citations
40
References
2010
Year
Unknown Venue
EngineeringTextual DataMining MethodsCorpus LinguisticsText MiningNatural Language ProcessingEmpirical ComparisonInformation RetrievalData ScienceData MiningDocument ClassificationContent AnalysisStatisticsDocument ClusteringKnowledge DiscoveryLatent Semantic AnalysisWeb MiningText Mining MethodsTopic ModelKeyword ExtractionText Processing
The amount of textual data that is available for researchers and businesses to analyze is increasing at a dramatic rate. This reality has led IS researchers to investigate various text mining techniques. This essay examines four text mining methods that are frequently used in order to identify their advantages and limitations. The four methods that we examine are (1) latent semantic analysis, (2) probabilistic latent semantic analysis, (3) latent Dirichlet allocation, and (4) the correlated topic model. We compare these four methods and highlight the optimal conditions under which to apply the various methods. Our paper sheds light on the theory that underlies text mining methods and provides guidance for researchers who seek to apply these methods.
| Year | Citations | |
|---|---|---|
Page 1
Page 1