Publication | Closed Access
A hybrid approach to compounds in LVCSR
11
Citations
3
References
2002
Year
Unknown Venue
Combinatorial ChemistryEngineeringSpeech CorpusOrganic ChemistryWord Error RateSpoken Language ProcessingChemistryChemical DerivativeCorpus LinguisticsText MiningSpeech RecognitionNatural Language ProcessingLanguage DocumentationData ScienceComputational LinguisticsLanguage EngineeringLanguage StudiesMachine TranslationAccurate Compound ModuleInorganic ChemistryHybrid ApproachTerminology ExtractionLanguage RecognitionSpeech ProcessingCompound ConstituentsDerivative (Chemistry)Linguistics
In several languages compound words form orthographic units, which complicates the task of ensuring good lexical coverage for large vocabulary continuous speech recognition (LVCSR). A common approach to the problem consists of first recognizing the compound constituents, followed by an automatic recompounding process. We describe an accurate compound module, which combines a rule-based approach with statistical pruning. The module is incorporated in a broadcast news recognition task for Dutch and yields an 11% relative decrease in word error rate (WER).
| Year | Citations | |
|---|---|---|
Page 1
Page 1