Publication | Closed Access
Quantization of cepstral parameters for speech recognition over the World Wide Web
13
Citations
9
References
2002
Year
Unknown Venue
Alternative ArchitecturesEngineeringCepstral ParametersBrowser-based ComputingSpeech RecognitionSpeech CodingData ScienceRobust Speech RecognitionVoice RecognitionHealth SciencesComputer EngineeringComputer ScienceMobile ComputingSpeech SignalSignal ProcessingSpeech CommunicationSpeech TechnologySpeech ProcessingSpeech InputSpeech Perception
We examine alternative architectures for a client-server model of speech-enabled applications over the World Wide Web. We compare a server-only processing model, where the client encodes and transmits the speech signal to the server, to a model where the recognition front end, implemented as a Java applet runs locally at the client and encodes and transmits the cepstral coefficients to the recognition server over the Internet. We follow a novel encoding paradigm, trying to maximize the recognition performance instead of perceptual reproduction, and we find that by transmitting the cepstral coefficients we can achieve significantly higher recognition performance at a fraction of the bit rate required when encoding the speech signal directly.
| Year | Citations | |
|---|---|---|
Page 1
Page 1