Publication | Closed Access
UNSUPERVISED WORD INDUCTION USING MDL CRITERION
17
Citations
6
References
2000
Year
Unknown Venue
Unsupervised learning of units (phonemes, words, phrases, etc.) is important to the design of statistical speech and NLP systems. This paper presents a general source-coding framework for induc-ing words from natural language text without word boundaries. An efficient search algorithm is developed to optimize the mini-mum description length (MDL) induction criterion. Despite some seemingly over-simplified modeling assumption, we achieved good results on several word induction problems.
| Year | Citations | |
|---|---|---|
Page 1
Page 1