Publication | Closed Access
Cross-language bootstrapping for unsupervised acoustic model training: rapid development of a Polish speech recognition system
55
Citations
13
References
2009
Year
Unknown Venue
EngineeringMachine LearningSpoken Language ProcessingRapid DevelopmentSpeech RecognitionNatural Language ProcessingRobust Speech RecognitionVoice RecognitionCross-language BootstrappingMachine TranslationHealth SciencesUnsupervised Model Training.confidenceEuropean ParliamentSpeech CommunicationLanguage RecognitionSpeech ProcessingSpeech InputSpeech PerceptionLinguisticsSpeaker Recognition
This paper describes the rapid development of a Polish language speech recognition system.The system development was performed without access to any transcribed acoustic training data.This was achieved through the combined use of cross-language bootstrapping and confidence based unsupervised acoustic model training.A Spanish acoustic model was ported to Polish, through the use of a manually constructed phoneme mapping.This initial model was refined through iterative recognition and retraining of the untranscribed audio data.The system was trained and evaluated on recordings from the European Parliament, and included several state-of-the-art speech recognition techniques in addition to the use of unsupervised model training.Confidence based speaker adaptive training using features space transform adaptation, as well as vocal tract length normalization and maximum likelihood linear regression, was used to refine the acoustic model.Through the combination of the different techniques, good performance was achieved on the domain of parliamentary speeches.
| Year | Citations | |
|---|---|---|
Page 1
Page 1