Publication | Closed Access
Deep neural network features and semi-supervised training for low resource speech recognition
137
Citations
34
References
2013
Year
Unknown Venue
Semi-supervised TrainingEngineeringMachine LearningLow-resource Language ProcessingSpeech RecognitionNatural Language ProcessingData ScienceRobust Speech RecognitionVoice RecognitionNew TechniqueHealth SciencesComputer ScienceDeep LearningDistant Speech RecognitionSpeech CommunicationDeep Neural NetworksMulti-speaker Speech RecognitionSpeech ProcessingSpeech InputSpeech Perception
We propose a new technique for training deep neural networks (DNNs) as data-driven feature front-ends for large vocabulary continuous speech recognition (LVCSR) in low resource settings. To circumvent the lack of sufficient training data for acoustic modeling in these scenarios, we use transcribed multilingual data and semi-supervised training to build the proposed feature front-ends. In our experiments, the proposed features provide an absolute improvement of 16% in a low-resource LVCSR setting with only one hour of in-domain training data. While close to three-fourths of these gains come from DNN-based features, the remaining are from semi-supervised training.
| Year | Citations | |
|---|---|---|
Page 1
Page 1