Publication | Closed Access
A morpho-graphemic approach for the recognition of spontaneous speech in agglutinative languages - like Hungarian
25
Citations
14
References
2007
Year
Unknown Venue
Pronunciation DictionariesLanguage-modeling ApproachSpeech CorpusSpoken Language ProcessingPhonologyMorpho-graphemic ApproachCorpus LinguisticsSpeech RecognitionApplied LinguisticsPhoneticsComputational LinguisticsRobust Speech RecognitionLanguage StudiesHealth SciencesAgglutinative LanguagesMorphologySpontaneous SpeechSpeech CommunicationSpeech TechnologySpeech AnalysisPhonology MorphologyAcoustic ModelsLanguage RecognitionSpeech ProcessingSpeech InputSpeech PerceptionLinguistics
A coupled acoustic- and language-modeling approach is presented for the recognition of spontaneous speech primarily in agglutinative languages. The effectiveness of the approach in large vocabulary spontaneous speech recognition is demonstrated on the Hungarian MALACH corpus. The derivation of morphs from word forms is based on a statistical morphological segmentation tool while the mapping of morphs into graphemes is obtained trivially by splitting each morph into individual letters. Using morphs instead of words in language modeling gives significant WER reductions in case of both phoneme- and grapheme-based acoustic modeling. The improvements are larger after speaker adaptation of the acoustic models. In conclusion, morphophonemic and the proposed morpho-graphemic ASR approaches yield the same best WERs, which are significantly lower than the word-based baselines but essentially without language dependent rules or pronunciation dictionaries in the latter case. Index Terms: spontaneous speech recognition, morphology.
| Year | Citations | |
|---|---|---|
Page 1
Page 1