Publication | Open Access
Acoustic Modeling With Hierarchical Reservoirs
69
Citations
51
References
2013
Year
AeroacousticsEngineeringMachine LearningAcoustic ModelAccurate Acoustic ModelingRecurrent Neural NetworkAcoustic ModelingSpeech RecognitionData SciencePhoneticsRobust Speech RecognitionHierarchical ReservoirsHealth SciencesComputer ScienceDeep LearningDistant Speech RecognitionSpeech CommunicationMulti-speaker Speech RecognitionSpeech AcousticsSpeech ProcessingSpeech InputMultiscale Modeling
Accurate acoustic modeling is an essential requirement of a state-of-the-art continuous speech recognizer. The Acoustic Model (AM) describes the relation between the observed speech signal and the non-observable sequence of phonetic units uttered by the speaker. Nowadays, most recognizers use Hidden Markov Models (HMMs) in combination with Gaussian Mixture Models (GMMs) to model the acoustics, but neural-based architectures are on the rise again. In this work, the recently introduced Reservoir Computing (RC) paradigm is used for acoustic modeling. A reservoir is a fixed - and thus non-trained - Recurrent Neural Network (RNN) that is combined with a trained linear model. This approach combines the ability of an RNN to model the recent past of the input sequence with a simple and reliable training procedure. It is shown here that simple reservoir-based AMs achieve reasonable phone recognition and that deep hierarchical and bi-directional reservoir architectures lead to a very competitive Phone Error Rate (PER) of 23.1% on the well-known TIMIT task.
| Year | Citations | |
|---|---|---|
Page 1
Page 1