Publication | Closed Access
GMM and CNN Hybrid Method for Short Utterance Speaker Recognition
164
Citations
30
References
2018
Year
EngineeringMachine LearningCnn Hybrid MethodSpeech RecognitionPattern RecognitionSpeaker DiarizationRobust Speech RecognitionVoice RecognitionHealth SciencesSpeaker Recognition TechniqueComputer ScienceDeep LearningSpeech CommunicationHigh AccuracyMulti-speaker Speech RecognitionGaussian Mixture ModelSpeech ProcessingSpeech InputSpeech PerceptionSpeaker Recognition
During the last few years, the speaker recognition technique has been widely attractive for its extensive application in many fields, such as speech communications, domestics services, and smart terminals. As a critical method, the Gaussian mixture model (GMM) makes it possible to achieve the recognition capability that is close to the hearing ability of human in a long speech. However, the GMM is failing to recognize a short utterance speaker with a high accuracy. Aiming at solving this problem, in this paper, we propose a novel model to enhance the recognition accuracy of the short utterance speaker recognition system. Different from traditional models based on the GMM, we design a method to train a convolutional neural network to process spectrograms, which can describe speakers better. Thus, the recognition system gains the considerable accuracy as well as the reasonable convergence speed. The experiment results show that our model can help to decrease the equal error rate of the recognition from 4.9% to 2.5%.
| Year | Citations | |
|---|---|---|
Page 1
Page 1