Publication | Closed Access
Dynamic topic models
2.3K
Citations
13
References
2006
Year
Unknown Venue
Natural Language ProcessingDocument ClusteringLatent ModelingEngineeringInformation RetrievalData ScienceVector Space ModelTopic ModelComputational LinguisticsKnowledge DiscoveryDocument ClassificationDynamic Topic ModelsTime EvolutionOcr'ed ArchivesStatisticsCorpus LinguisticsText MiningNonparametric Wavelet Regression
A family of probabilistic time series models is developed to analyze the time evolution of topics in large document collections. The approach is to use state space models on the natural parameters of the multinomial distributions that represent the topics. Variational approximations based on Kalman filters and nonparametric wavelet regression are developed to carry out approximate posterior inference over the latent topics. In addition to giving quantitative, predictive models of a sequential corpus, dynamic topic models provide a qualitative window into the contents of a large document collection. The models are demonstrated by analyzing the OCR'ed archives of the journal Science from 1880 through 2000.
| Year | Citations | |
|---|---|---|
2000 | 33.7K | |
1960 | 30.4K | |
2003 | 26.9K | |
2004 | 5.9K | |
2005 | 3.6K | |
1982 | 3.1K | |
2005 | 1.3K | |
2004 | 1.3K | |
2005 | 981 | |
2005 | 929 |
Page 1
Page 1