Concepedia

Publication | Open Access

End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results

415

Citations

21

References

2014

Year

Abstract

We replace the Hidden Markov Model (HMM) which is traditionally used in in continuous speech recognition with a bi-directional recurrent neural network encoder coupled to a recurrent neural network decoder that directly emits a stream of phonemes. The alignment between the input and output sequences is established using an attention mechanism: the decoder emits each symbol based on a context created with a subset of input symbols elected by the attention mechanism. We report initial results demonstrating that this new approach achieves phoneme error rates that are comparable to the state-of-the-art HMM-based decoders, on the TIMIT dataset.

References

YearCitations

1998

56.5K

2014

14.6K

2012

10.2K

1997

9.6K

2013

8.7K

2012

6.6K

2012

5.5K

2006

5.3K

2024

4.9K

2012

3.8K

Page 1