Publication | Closed Access
PYIN: A fundamental frequency estimator using probabilistic threshold distributions
377
Citations
11
References
2014
Year
Unknown Venue
Fundamental FrequencyEngineeringMachine LearningSpectrum EstimationAcoustic ModelingStatistical Signal ProcessingAudio AnalysisAudio Signal AnalysisTimefrequency AnalysisStatisticsSpeech Signal AnalysisHealth SciencesProbabilistic YinFundamental Frequency EstimatorComputer ScienceSignal ProcessingSpeech AcousticsSpeech ProcessingConventional Yin
We propose the Probabilistic YIN (PYIN) algorithm, a modification of the well-known YIN algorithm for fundamental frequency (F0) estimation. Conventional YIN is a simple yet effective method for frame-wise monophonic F0 estimation and remains one of the most popular methods in this domain. In order to eliminate short-term errors, outputs of frequency estimators are usually post-processed resulting in a smoother pitch track. One shortcoming of YIN is that such post-processing cannot fall back on alternative interpretations of the signal because the method outputs precisely one estimate per frame. To address this problem we modify YIN to output multiple pitch candidates with associated probabilities (PYIN Stage 1). These probabilities arise naturally from a prior distribution on the YIN threshold parameter. We use these probabilities as observations in a hidden Markov model, which is Viterbi-decoded to produce an improved pitch track (PYIN Stage 2). We demonstrate that the combination of Stages 1 and 2 raises recall and precision substantially. The additional computational complexity of PYIN over YIN is low. We make the method freely available online <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sup> as an open source C++ library for Vamp hosts.
| Year | Citations | |
|---|---|---|
Page 1
Page 1