Concepedia

Publication | Closed Access

Unsupervised query segmentation using generative language models and wikipedia

158

Citations

25

References

2008

Year

Bin Tan, Fuchun Peng

Unknown Venue

Abstract

In this paper, we propose a novel unsupervised approach to query segmentation, an important task in Web search. We use a generative query model to recover a query's underlying concepts that compose its original segmented form. The model's parameters are estimated using an expectation-maximization (EM) algorithm, optimizing the minimum description length objective function on a partial corpus that is specific to the query. To augment this unsupervised learning, we incorporate evidence from Wikipedia.

References

YearCitations

Page 1