Publication | Closed Access
Latent dirichlet allocation for tag recommendation
494
Citations
28
References
2009
Year
Unknown Venue
Latent Dirichlet AllocationEngineeringIntelligent Information RetrievalSemantic WebCorpus LinguisticsText MiningNatural Language ProcessingInformation RetrievalData ScienceData MiningComputational LinguisticsRelevance FeedbackMultimedia ContentLanguage StudiesContent AnalysisAnnotation GuidelinesKnowledge DiscoverySocial Multimedia TaggingSemantic TaggingTopic ModelCollaborative Filtering
Tagging systems have become major infrastructures on the Web. They allow users to create tags that annotate and categorize content and share them with other users, very helpful in particular for searching multimedia content. However, as tagging is not constrained by a controlled vocabulary and annotation guidelines, tags tend to be noisy and sparse. Especially new resources annotated by only a few users have often rather idiosyncratic tags that do not reflect a common perspective useful for search. In this paper we introduce an approach based on Latent Dirichlet Allocation (LDA) for recommending tags of resources in order to improve search. Resources annotated by many users and thus equipped with a fairly stable and complete tag set are used to elicit latent topics to which new resources with only a few tags are mapped. Based on this, other tags belonging to a topic can be recommended for the new resource. Our evaluation shows that the approach achieves significantly better precision and recall than the use of association rules, suggested in previous work, and also recommends more specific tags. Moreover, extending resources with these recommended tags significantly improves search for new resources.
| Year | Citations | |
|---|---|---|
2003 | 26.9K | |
1993 | 14.7K | |
2009 | 12.5K | |
2004 | 5.9K | |
1993 | 4.4K | |
2006 | 1.8K | |
2006 | 1K | |
2008 | 928 | |
2006 | 876 | |
2007 | 545 |
Page 1
Page 1