Publication | Closed Access
Signal modeling techniques in speech recognition
763
Citations
67
References
1993
Year
EngineeringMachine LearningSpeech RecognitionData ScienceRobust Speech RecognitionAudio Signal AnalysisAutomatic RecognitionAcoustic AnalysisSpeech Signal AnalysisHealth SciencesSignal Processing ComponentsComputer ScienceDistant Speech RecognitionSignal ProcessingSpeech CommunicationSpeech TechnologyVoiceSpeech AcousticsSpectral AnalysisSpeech ProcessingSpeech InputSpeech Perception
A tutorial on signal processing in state-of-the-art speech recognition systems is presented, reviewing those techniques most commonly used. The four basic operations of signal modeling, i.e. spectral shaping, spectral analysis, parametric transformation, and statistical modeling, are discussed. Three important trends that have developed in the last five years in speech recognition are examined. First, heterogeneous parameter sets that mix absolute spectral information with dynamic, or time-derivative, spectral information, have become common. Second, similarity transform techniques, often used to normalize and decorrelate parameters in some computationally inexpensive way, have become popular. Third, the signal parameter estimation problem has merged with the speech recognition process so that more sophisticated statistical models of the signal's spectrum can be estimated in a closed-loop manner. The signal processing components of these algorithms are reviewed.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
| Year | Citations | |
|---|---|---|
Page 1
Page 1