Publication | Closed Access
Discriminative importance weighting of augmented training data for acoustic model training
12
Citations
28
References
2017
Year
Unknown Venue
EngineeringMachine LearningSpeech EnhancementAcoustic ModelingAcoustic Model TrainingSpeech RecognitionData SciencePattern RecognitionDiscriminative Importance WeightingAudio AnalysisRobust Speech RecognitionSupervised LearningHealth SciencesAugmentation TechniquesData AugmentationAugmented Training DataAudio RetrievalComputer ScienceDeep LearningDistant Speech RecognitionAcoustic ModelsSpeech ProcessingSpeech Input
DNN based acoustic models require a large amount of training data. Parametric data augmentation techniques such as adding noise, reverberation, or changing the speech rate, are often employed to boost the dataset size and the ASR performance. The choice of augmentation techniques and the associated parameters has been handled heuristically so far. In this work we propose an algorithm to automatically weight data perturbed using a variety of augmentation techniques and/or parameters. The weights are learned in a discriminative fashion so as to minimize the frame error rate using the standard gradient descent algorithm in an iterative manner. Experiments were performed using the CHiME-3 dataset. Data augmentation was done by adding noise at different SNRs. A relative WER improvement of 15% was obtained with the proposed data weighting algorithm compared to the unweighted augmented dataset. Interestingly, the resulting distribution of SNRs in the weighted training set differs significantly from that of the test set.
| Year | Citations | |
|---|---|---|
Page 1
Page 1