Publication | Open Access
Fast Variational Bayes for Heavy-tailed PLDA Applied to i-vectors and x-vectors
14
Citations
4
References
2018
Year
Unknown Venue
Heavy-tailed Plda AppliedEngineeringMachine LearningGaussian PldaCorpus LinguisticsHeavy-tailed PldaSpeech RecognitionNatural Language ProcessingStatistical Signal ProcessingLength NormalizationData ScienceComputational LinguisticsSpeaker DiarizationRobust Speech RecognitionMultilinear Subspace LearningLanguage StudiesPrincipal Component AnalysisEstimation TheoryStatisticsMachine TranslationDeep LearningFast Variational BayesSpeech CommunicationHigh-dimensional MethodMulti-speaker Speech RecognitionSpeech ProcessingStatistical InferenceSpeech InputLinguisticsSpeaker Recognition
The standard state-of-the-art backend for text-independent speaker recognizers that use i-vectors or x-vectors, is Gaussian PLDA (G-PLDA), assisted by a Gaussianization step involving length normalization. G-PLDA can be trained with both generative or discriminative methods. It has long been known that heavy-tailed PLDA (HT-PLDA), applied without length normalization, gives similar accuracy, but at considerable extra computational cost. We have recently introduced a fast scoring algorithm for a discriminatively trained HT-PLDA backend. This paper extends that work by introducing a fast, variational Bayes, generative training algorithm. We compare old and new backends, with and without length-normalization, with i-vectors and x-vectors, on SRE'10, SRE'16 and SITW.
| Year | Citations | |
|---|---|---|
Page 1
Page 1