Publication | Closed Access
Compression of acoustic features for speech recognition in network environments
54
Citations
1
References
2002
Year
Unknown Venue
EngineeringMachine LearningAcoustic ModelingSpeech RecognitionSpeech CodingData SciencePattern RecognitionRobust Speech RecognitionVoice RecognitionHealth SciencesComputer EngineeringCompression AlgorithmComputer ScienceData CompressionDistant Speech RecognitionSignal ProcessingModel CompressionSpeech CommunicationAcoustic FeaturesSpeech ProcessingSpeech InputSpeech Perception
In this paper, we describe a new compression algorithm for encoding acoustic features used in typical speech recognition systems. The proposed algorithm uses a combination of simple techniques, such as linear prediction and multi-stage vector quantization, and the current version of the algorithm encodes the acoustic features at a fixed rate of 4.0 kbit/s. The compression algorithm can be used very effectively for speech recognition in network environments, such as those employing a client-server model, or to reduce storage in general speech recognition applications. The algorithm has also been tuned for practical implementations, so that the computational complexity and memory requirements are modest. We have successfully tested the compression algorithm against many test sets from several different languages, and the algorithm performed very well, with no significant change in the recognition accuracy due to compression.
| Year | Citations | |
|---|---|---|
Page 1
Page 1