Publication | Closed Access
Phone sequence modeling with recurrent neural networks
13
Citations
17
References
2014
Year
Unknown Venue
Phone AccuracyEngineeringSpoken Language ProcessingRecurrent Neural NetworkSpeech RecognitionNatural Language ProcessingComputational LinguisticsPhoneticsRecurrent Neural NetworksRobust Speech RecognitionVoice RecognitionLanguage StudiesReal-time LanguageMachine TranslationSequence ModellingComputer ScienceDeep LearningSpeech CommunicationPhone SequenceSpeech ProcessingSpeech InputSpeech PerceptionLinguistics
In this paper, we investigate phone sequence modeling with recurrent neural networks in the context of speech recognition. We introduce a hybrid architecture that combines a phonetic model with an arbitrary frame-level acoustic model and we propose efficient algorithms for training, decoding and sequence alignment. We evaluate the advantage of our phonetic model on the TIMIT and Switchboard-mini datasets in complementarity to a powerful context-dependent deep neural network (DNN) acoustic classifier and a higher-level 3-gram language model. Consistent improvements of 2-10% in phone accuracy and 3% in word error rate suggest that our approach can readily replace HMMs in current state-of-the-art systems.
| Year | Citations | |
|---|---|---|
Page 1
Page 1