Publication | Closed Access
Experimental evaluation of features for robust speaker identification
307
Citations
12
References
1994
Year
EngineeringRobust Speaker IdentificationBiometricsSpeech RecognitionPattern RecognitionSpeaker IdentificationSpeaker DiarizationRobust Speech RecognitionChannel Compensation TechniquesAutomatic RecognitionAcoustic AnalysisSpeech Signal AnalysisHealth SciencesGaussian Mixture DensitiesSignal ProcessingSpeech CommunicationExperimental EvaluationMulti-speaker Speech RecognitionSpeech AcousticsSpeech ProcessingSpeech PerceptionSpeaker Recognition
This correspondence presents an experimental evaluation of different features and channel compensation techniques for robust speaker identification. The goal is to keep all processing and classification steps constant and to vary only the features and compensations used to allow a controlled comparison. A general, maximum-likelihood classifier based on Gaussian mixture densities is used as the classifier, and experiments are conducted on the King speech database, a conversational, telephone-speech database. The features examined are mel-frequency and linear-frequency filterbank cepstral coefficients, linear prediction cepstral coefficients, and perceptual linear prediction (PLP) cepstral coefficients. The channel compensation techniques examined are cepstral mean removal, RASTA processing, and a quadratic trend removal technique. It is shown for this database that performance differences between the basic features is small, and the major gains are due to the channel compensation techniques. The best "across-the-divide" recognition accuracy of 92% is obtained for both high-order LPC features and band-limited filterbank features.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
| Year | Citations | |
|---|---|---|
Page 1
Page 1