Publication | Closed Access
Spectral matching based voice activity detector for improved speaker recognition
22
Citations
4
References
2014
Year
Unknown Venue
EngineeringMachine LearningBiometricsFeature ExtractionSpeech RecognitionData SciencePattern RecognitionPhoneticsSpeaker DiarizationRobust Speech RecognitionVoice RecognitionHealth SciencesSpectral MatchingSilence SegmentsComputer ScienceDeep LearningDistant Speech RecognitionSignal ProcessingSpeech CommunicationVoiceMulti-speaker Speech RecognitionSpeech ProcessingVoice Activity DetectorSpeech PerceptionSpeaker Recognition
For spoken language processing applications like speaker recognition/verification, not only that the silence segments do not contribute any speaker specific information, but also it dilutes the already available information content in the speech segments in the audio data. It has been experimentally studied that removing silence segments with the help of a voice activity detector(VAD) from the utterance before feature extraction enhances the performance of speaker recognition systems. Empirical algorithms using signal energy and spectral centroid(ESC) is one of the most popular approaches to VAD. In this paper, we show that using spectral matching (SM) to distinguish between silence and speech segments for VAD outperforms the VAD using ESC. We use a neural network with TempoRAl PatternS (TRAPS) of critical band energies as its input for improved performance. We evaluate the performance of VADs using a speaker recognition system developed for 20 speakers.
| Year | Citations | |
|---|---|---|
Page 1
Page 1