Publication | Open Access
End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results
415
Citations
21
References
2014
Year
EngineeringMachine LearningSpoken Language ProcessingRecurrent Neural NetworkSpeech RecognitionNatural Language ProcessingHidden Markov ModelAttention MechanismVoice RecognitionLanguage StudiesReal-time LanguageTimit DatasetComputer ScienceDeep LearningAttention-based Recurrent NnSpeech CommunicationMulti-speaker Speech RecognitionSpeech ProcessingSpeech InputLinguistics
We replace the Hidden Markov Model (HMM) which is traditionally used in in continuous speech recognition with a bi-directional recurrent neural network encoder coupled to a recurrent neural network decoder that directly emits a stream of phonemes. The alignment between the input and output sequences is established using an attention mechanism: the decoder emits each symbol based on a context created with a subset of input symbols elected by the attention mechanism. We report initial results demonstrating that this new approach achieves phoneme error rates that are comparable to the state-of-the-art HMM-based decoders, on the TIMIT dataset.
| Year | Citations | |
|---|---|---|
1998 | 56.5K | |
2014 | 14.6K | |
2012 | 10.2K | |
1997 | 9.6K | |
2013 | 8.7K | |
2012 | 6.6K | |
2012 | 5.5K | |
2006 | 5.3K | |
2024 | 4.9K | |
2012 | 3.8K |
Page 1
Page 1