Publication | Closed Access
A time delay neural network architecture for efficient modeling of long temporal contexts
994
Citations
29
References
2015
Year
Unknown Venue
EngineeringMachine LearningLearning AlgorithmSpoken Language ProcessingWider Temporal DependenciesRecurrent Neural NetworkSpeech RecognitionTdnn ArchitectureData ScienceReal-time LanguageSequence ModellingComputer EngineeringLong Temporal ContextsTemporal Pattern RecognitionComputer ScienceDeep LearningNeural Architecture SearchEfficient ModelingSpeech ProcessingTemporal Network
Recurrent neural network architectures have been shown to efficiently model long term temporal dependencies between acoustic events. However the training time of recurrent networks is higher than feedforward networks due to the sequential nature of the learning algorithm. In this paper we propose a time delay neural network architecture which models long term temporal dependencies with training times comparable to standard feed-forward DNNs. The network uses sub-sampling to reduce computation during training. On the Switchboard task we show a relative improvement of 6% over the baseline DNN model. We present results on several LVCSR tasks with training data ranging from 3 to 1800 hours to show the effectiveness of the TDNN architecture in learning wider temporal dependencies in both small and large data scenarios.
| Year | Citations | |
|---|---|---|
Page 1
Page 1