Publication | Closed Access
Computing Mel-frequency cepstral coefficients on the power spectrum
237
Citations
6
References
2002
Year
Unknown Venue
EngineeringSpeech CorpusSpectrum EstimationCorpus LinguisticsPower SpectrumSpeech RecognitionCosmologyPhoneticsComputational LinguisticsRobust Speech RecognitionVoice RecognitionLanguage StudiesApproximation TheorySpeech SignalSpeech AnalysisSpeech CommunicationSpectral AnalysisSpectral SearchingSpeech ProcessingMel-frequency Cepstral CoefficientsSpeech PerceptionLinguistics
We present a method to derive Mel-frequency cepstral coefficients directly from the power spectrum of a speech signal. We show that omitting the filterbank in signal analysis does not affect the word error rate. The presented approach simplifies the speech recognizers front end by merging subsequent signal analysis steps into a single one. It avoids possible interpolation and discretization problems and results in a compact implementation. We show that frequency warping schemes like vocal tract normalization can be integrated easily in our concept without additional computational efforts. Recognition test results obtained with the RWTH large vocabulary speech recognition system are presented for two different corpora: The German VerbMobil II dev99 corpus, and the English North American Business News 94 20k development corpus.
| Year | Citations | |
|---|---|---|
Page 1
Page 1