Publication | Closed Access
Energy onset times for speaker identification
12
Citations
8
References
1994
Year
MusicHigh-resolution Teager OperatorEngineeringElectroglottographyResonant Energy PulsesAcoustic ModelingSpeech RecognitionSpeaker IdentificationSpeaker DiarizationAudio AnalysisAcoustic Signal ProcessingAcoustic AnalysisSpeech Signal AnalysisHealth SciencesSignal ProcessingSpeech CommunicationOnset TimesSpeech AcousticsSpeech ProcessingSpeech PerceptionSpeaker Recognition
Onset times of resonant energy pulses are measured with the high-resolution Teager operator and used as features in the Reynolds Gaussian-mixture speaker identification algorithm. Feature sets are constructed with primary pitch and secondary pulse locations derived from low and high speech formants. Preliminary testing was performed with a confusable 40-speaker subset from the NTIMIT (telephone channel) database. Speaker identification improved from 55 to 70% correct classification when the full set of new resonant energy-based features were added as an independent stream to conventional mel-cepstra.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
| Year | Citations | |
|---|---|---|
Page 1
Page 1