Publication | Closed Access
Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition: A comparison of current training strategies
26
Citations
29
References
2020
Year
EngineeringMachine LearningDistributed AlgorithmsDistributed Ai SystemAsr Acoustic ModelingSpeech RecognitionData ScienceRobust Speech RecognitionParallel ComputingComputer ScienceDistributed LearningDeep LearningDeep Neural NetworkCurrent Training StrategiesDistant Speech RecognitionSpeech CommunicationAutomatic Speech RecognitionParallel LearningSpeech ProcessingParallel ProgrammingSpeech Input
The past decade has witnessed great progress in automatic speech recognition (ASR) due to advances in deep learning. The improvements in performance can be attributed to both improved models and large-scale training data. The key to training such models is the employment of efficient distributed learning techniques. In this article, we provide an overview of distributed training techniques for deep neural network (DNN) acoustic models used for ASR. Starting with the fundamentals of data parallel stochastic gradient descent (SGD) and ASR acoustic modeling, we investigate various distributed training strategies and their realizations in high-performance computing (HPC) environments with an emphasis on striking a balance between communication and computation. Experiments are carried out on a popular public benchmark to study the convergence, speedup, and recognition performance of the investigated strategies.
| Year | Citations | |
|---|---|---|
Page 1
Page 1