Publication | Closed Access
Speaker change point detection using deep neural nets
28
Citations
7
References
2015
Year
Unknown Venue
Deep Neural NetsSpeaker Change PointsEngineeringMachine LearningSpeech RecognitionPattern RecognitionSpeaker DiarizationRobust Speech RecognitionVoice RecognitionMachine TranslationHealth SciencesMachine VisionSpeaker Change PointDeep LearningSpeech CommunicationMulti-speaker Speech RecognitionSpeech ProcessingSpeech PerceptionLinguisticsSpeaker Recognition
We investigate the use of deep neural nets (DNN) to provide initial speaker change points in a speaker diarization system. The DNN trains states that correspond to the location of the speaker change point (SCP) in the speech segment input to the DNN. We model these different speaker change point locations in the DNN input by 10 to 20 states. The confidence in the SCP is measured by the number of frame synchronous states that correspond to the hypothesized speaker change point. We only keep the speaker change points with the highest confidence. We show that this DNN-based change point detector reduces the number of missed change points for both an English test set and a French dev set. We also show that the DNN-based change points reduce the diarization error rate for both an English and a French diarization system. These results show the feasibility of DNNs to provide initial speaker change points.
| Year | Citations | |
|---|---|---|
Page 1
Page 1