Publication | Closed Access
Unsupervised query segmentation using generative language models and wikipedia
158
Citations
25
References
2008
Year
Unknown Venue
EngineeringQuery ModelCorpus LinguisticsQuery SuggestionText MiningNatural Language ProcessingInformation RetrievalData ScienceData MiningComputational LinguisticsQuery ExpansionLanguage StudiesUnsupervised LearningQuery SegmentationMachine TranslationSearch TechnologyKnowledge DiscoveryQuery AnalysisNovel Unsupervised ApproachLinguistics
In this paper, we propose a novel unsupervised approach to query segmentation, an important task in Web search. We use a generative query model to recover a query's underlying concepts that compose its original segmented form. The model's parameters are estimated using an expectation-maximization (EM) algorithm, optimizing the minimum description length objective function on a partial corpus that is specific to the query. To augment this unsupervised learning, we incorporate evidence from Wikipedia.
| Year | Citations | |
|---|---|---|
Page 1
Page 1