Publication | Closed Access
On the use of autocorrelation analysis for pitch detection
515
Citations
6
References
1977
Year
MusicEngineeringSpeech EnhancementAutocorrelation FunctionPhonologyAcoustic ModelingSpeech RecognitionAutocorrelation Pitch AnalysisSpeech CodingPhoneticsAudio AnalysisRobust Speech RecognitionVoice RecognitionAcoustic Signal ProcessingHealth SciencesSignal ProcessingSpeech CommunicationAutocorrelation AnalysisSpeech ProcessingSpeech Perception
Autocorrelation analysis is a long‑standing pitch detection technique, and most systems preprocess speech to whiten the signal and remove vocal‑tract effects from the autocorrelation function. This paper presents results on nonlinear preprocessing methods that spectrally flatten speech for autocorrelation pitch detection. The methods adjust quantizer threshold levels to implement ordinary or center‑clipping autocorrelation, were evaluated on utterances from a recent pitch‑detector comparison study, and include an algorithm for adaptively selecting frame size. The results show the extent of spectrum flattening achieved by these methods and provide comparison outcomes with the Rabiner et al.
One of the most time honored methods of detecting pitch is to use some type of autocorrelation analysis on speech which has been appropriately preprocessed. The goal of the speech preprocessing in most systems is to whiten, or spectrally flatten, the signal so as to eliminate the effects of the vocal tract spectrum on the detailed shape of the resulting autocorrelation function. The purpose of this paper is to present some results on several types of (nonlinear) preprocessing which can be used to effectively spectrally flatten the speech signal The types of nonlinearities which are considered are classified by a non-linear input-output quantizer characteristic. By appropriate adjustment of the quantizer threshold levels, both the ordinary (linear) autocorrelation analysis, and the center clipping-peak clipping autocorrelation of Dubnowski et al. [1] can be obtained. Results are presented to demonstrate the degree of spectrum flattening obtained using these methods. Each of the proposed methods was tested on several of the utterances used in a recent pitch detector comparison study by Rabiner et al. [2] Results of this comparison are included in this paper. One final topic which is discussed in this paper is an algorithm for adaptively choosing a frame size for an autocorrelation pitch analysis.
| Year | Citations | |
|---|---|---|
Page 1
Page 1