Publication | Closed Access
Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting
145
Citations
15
References
2017
Year
Unknown Venue
EngineeringMachine LearningSpeech RecognitionNatural Language ProcessingData ScienceRobust Speech RecognitionVoice RecognitionKeyword SpottingReal-time LanguageRecurrent LayersMachine VisionComputer EngineeringComputer ScienceDeep LearningDistant Speech RecognitionSpeech CommunicationSmall-footprint Keyword SpottingSpeech ProcessingSpeech InputConvolutional Layers
Keyword spotting (KWS) constitutes a major component of human-technology interfaces.Maximizing the detection accuracy at a low false alarm (FA) rate, while minimizing the footprint size, latency and complexity are the goals for KWS.Towards achieving them, we study Convolutional Recurrent Neural Networks (CRNNs).Inspired by large-scale state-ofthe-art speech recognition systems, we combine the strengths of convolutional layers and recurrent layers to exploit local structure and long-range context.We analyze the effect of architecture parameters, and propose training strategies to improve performance.With only ~230k parameters, our CRNN model yields acceptably low latency, and achieves 97.71% accuracy at 0.5 FA/hour for 5 dB signal-to-noise ratio.
| Year | Citations | |
|---|---|---|
Page 1
Page 1