Publication | Open Access
Machine Reading Tea Leaves: Automatically Evaluating Topic Coherence and Topic Model Quality
563
Citations
23
References
2014
Year
Unknown Venue
EngineeringCorpus LinguisticsMachine Reading TeaText MiningAutomatic SummarizationNatural Language ProcessingInformation RetrievalData ScienceComputational LinguisticsEvaluating Topic CoherenceLanguage StudiesContent AnalysisMachine TranslationDocument ClusteringNlp TaskKnowledge DiscoveryTerminology ExtractionTopic Model QualityDistributional SemanticsVector Space ModelTopic ModelTopic ModelsKeyword ExtractionText ProcessingTrend AnalysisLinguistics
Topic models based on latent Dirichlet allocation and related methods are used in a range of user-focused tasks including document navigation and trend analysis, but evaluation of the intrinsic quality of the topic model and topics remains an open research area. In this work, we explore the two tasks of automatic evaluation of single topics and automatic evaluation of whole topic models, and provide recommendations on the best strategy for performing the two tasks, in addition to providing an open-source toolkit for topic and topic model evaluation.
| Year | Citations | |
|---|---|---|
Page 1
Page 1