Concepedia

Publication | Closed Access

Topic Model Diagnostics: Assessing Domain Relevance via Topical Alignment

97

Citations

25

References

2013

Year

Abstract

The use of topic models to analyze domainspecific texts often requires manual validation of the latent topics to ensure that they are meaningful. We introduce a framework to support such a large-scale assessment of topical relevance. We measure the correspondence between a set of latent topics and a set of reference concepts to quantify four types of topical misalignment: junk, fused, missing, and repeated topics. Our analysis compares 10,000 topic model variants to 200 expertprovided domain concepts, and demonstrates how our framework can inform choices of model parameters, inference algorithms, and intrinsic measures of topical quality. 1.

References

YearCitations

Page 1