Publication | Closed Access
Stream-based speaker segmentation using speaker factors and eigenvoices
87
Citations
11
References
2008
Year
EngineeringMachine LearningCorpus LinguisticsSpeech RecognitionNatural Language ProcessingNist SreSegmentation TaskPattern RecognitionSpeaker DiarizationRobust Speech RecognitionStream-based Speaker SegmentationHealth SciencesSignal ProcessingSpeech CommunicationMulti-speaker Speech RecognitionSpeech ProcessingSpeech PerceptionNist 2000Speaker Recognition
This paper presents a stream-based approach for unsupervised multi-speaker conversational speech segmentation. The main idea of this work is to exploit prior knowledge about the speaker space to find a low dimensional vector of speaker factors that summarize the salient speaker characteristics. This new approach produces segmentation error rates that are better than the state of the art ones reported in our previous work on the segmentation task in the NIST 2000 Speaker Recognition Evaluation (SRE). We also show how the performance of a speaker recognition system in the core test of the 2006 NIST SRE is affected, comparing the results obtained using single speaker and automatically segmented test data.
| Year | Citations | |
|---|---|---|
Page 1
Page 1