Publication | Closed Access
Modeling the identification of concurrent vowels with different fundamental frequencies
209
Citations
29
References
1992
Year
Human ListenersPhonologyAcoustic ModelingSpeech RecognitionPhoneticsAudio AnalysisLanguage StudiesHealth SciencesAuditory ModelingSimultaneous VowelsSignal ProcessingSpeech CommunicationSpeech TechnologyTimbre InformationPhonology MorphologySpeech AcousticsLanguage RecognitionSpeech ProcessingConcurrent VowelsSpeech PerceptionLinguistics
Human listeners are better able to identify two simultaneous vowels if the fundamental frequencies of the vowels are different. A computational model is presented which, for the first time, is able to simulate this phenomenon at least qualitatively. The first stage of the model is based upon a bank of bandpass filters and inner hair-cell simulators that simulate approximately the most relevant characteristics of the human auditory periphery. The output of each filter/hair-cell channel is then autocorrelated to extract pitch and timbre information. The pooled autocorrelation function (ACF) based on all channels is used to derive a pitch estimate for one of the component vowels from a signal composed of two vowels. Individual channel ACFs showing a pitch peak at this value are combined and used to identify the first vowel using a template matching procedure. The ACFs in the remaining channels are then combined and used to identify the second vowel. Model recognition performance shows a rapid improvement in correct vowel identification as the difference between the fundamental frequencies of two simultaneous vowels increases from zero to one semitone in a manner closely resembling human performance. As this difference increases up to four semitones, performance improves further only slowly, if at all.
| Year | Citations | |
|---|---|---|
Page 1
Page 1