Publication | Closed Access
Data augmentation for deep convolutional neural network acoustic modeling
77
Citations
14
References
2015
Year
Unknown Venue
Data AugmentationSpeech PerceptionEngineeringMachine LearningData ScienceHealth SciencesHaitian CreoleRobust Speech RecognitionSpeech ProcessingComputer ScienceVoice RecognitionMultilingual PretrainingSpeech InputDeep LearningLinguisticsLimited Language PackMachine TranslationSpeech Recognition
This paper investigates data augmentation based on label-preserving transformations for deep convolutional neural network (CNN) acoustic modeling to deal with limited training data. We show how stochastic feature mapping (SFM) can be carried out when training CNN models with log-Mel features as input and compare it with vocal tract length perturbation (VTLP). Furthermore, a two-stage data augmentation scheme with a stacked architecture is proposed to combine VTLP and SFM as complementary approaches. Improved performance has been observed in experiments conducted on the limited language pack (LLP) of Haitian Creole in the IARPA Babel program.
| Year | Citations | |
|---|---|---|
Page 1
Page 1