Publication | Closed Access
Termite
407
Citations
22
References
2012
Year
Unknown Venue
Natural Language ProcessingDocument ClusteringLatent TopicsEngineeringVector Space ModelData SciencePresent TermiteTopic ModelComputational LinguisticsKnowledge DiscoveryTopic ModelsLanguage StudiesLinguisticsText Mining
Topic models aid analysis of text corpora by identifying latent topics based on co-occurring words. Real-world deployments of topic models, however, often require intensive expert verification and model refinement. In this paper we present Termite, a visual analysis tool for assessing topic model quality. Termite uses a tabular layout to promote comparison of terms both within and across latent topics. We contribute a novel saliency measure for selecting relevant terms and a seriation algorithm that both reveals clustering structure and promotes the legibility of related terms. In a series of examples, we demonstrate how Termite allows analysts to identify coherent and significant themes.
| Year | Citations | |
|---|---|---|
Page 1
Page 1