Concepedia

Publication | Closed Access

Morphology-based language modeling for arabic speech recognition

101

Citations

10

References

2004

Year

Abstract

Abstract : Language modeling is a difficult problem for languages with rich morphology. In this paper we investigate the use of morphology-based language models at different stages in a speech recognition system for conversational Arabic. Class-based and single-stream factored language models using morphological word representations are applied within an N-best list rescoring framework. In addition, we explore the use of factored language models in first-pass recognition, which is facilitated by two novel procedures: the data-driven optimization of a multi-stream language model structure, and the conversion of a factored language model to a standard word-based model. We evaluate these techniques on a large-vocabulary recognition task and demonstrate that they lead to perplexity and word error rate reductions.

References

YearCitations

1990

618

2003

282

2000

267

2002

173

2002

71

2002

64

2001

60

2002

58

2000

30

2003

13

Page 1