Publication | Closed Access
Accurate and compact large vocabulary speech recognition on mobile devices
126
Citations
15
References
2013
Year
Unknown Venue
EngineeringMachine LearningMobile DevicesSpeech RecognitionPhoneticsRobust Speech RecognitionVoice RecognitionReal-time LanguageHealth SciencesComputer EngineeringMobile ComputingComputer ScienceDeep LearningNexus 4Distant Speech RecognitionModel CompressionSpeech CommunicationSpeech TechnologyDeep Neural NetworksSpeech ProcessingSpeech InputSpeech Perception
In this paper we describe the development of an accurate, smallfootprint, large vocabulary speech recognizer for mobile devices. To achieve the best recognition accuracy, state-of-the-art deep neural networks (DNNs) are adopted as acoustic models. A variety of speedup techniques for DNN score computation are used to enable real-time operation on mobile devices. To reduce the memory and disk usage, on-the-fly language model (LM) rescoring is performed with a compressed n-gram LM. We were able to build an accurate and compact system that runs well below real-time on a Nexus 4 Android phone. Index Terms: Deep neural networks, embedded speech recognition, SIMD, LM compression.
| Year | Citations | |
|---|---|---|
Page 1
Page 1