Publication | Closed Access
Pitch Estimation using Models of Voiced Speech on Three Levels
15
Citations
12
References
2007
Year
Unknown Venue
Fundamental FrequencyEngineeringSpeech SignalsAcoustic ModelingSpeech RecognitionRobust Speech RecognitionVoice RecognitionHealth SciencesSpeech SynthesisNoisy SpeechComputer ScienceDistant Speech RecognitionSignal ProcessingSpeech CommunicationSpeech TechnologyVoicePitch EstimationSpeech ProcessingSpeech PerceptionSpeaker Recognition
We present an algorithm for estimating the fundamental frequency in speech signals. Our approach incorporates models of voiced speech on three levels. First, we estimate the pitch for each time frame based on its harmonic structure using non-negative matrix factorization. The second level utilizes temporal pitch continuity to extract partial pitch contours. Thirdly, we incorporate statistics of the succession of voiced segments to aggregate partial contours to the final contour of an utterance. We evaluate our approach on the Keele database. The experimental results show the robustness of our method for noisy speech, and the good performance for clean speech in comparison with state-of-the-art algorithms.
| Year | Citations | |
|---|---|---|
Page 1
Page 1