Publication | Open Access
Clova Baseline System for the VoxCeleb Speaker Recognition Challenge 2020
96
Citations
17
References
2020
Year
EngineeringMachine LearningInterspeech 2020Speaker Recognition ModelsSpeech RecognitionData SciencePattern RecognitionPopular Resnet ArchitectureSpeaker DiarizationRobust Speech RecognitionVoice RecognitionHealth SciencesComputer ScienceClova Baseline SystemDistant Speech RecognitionSpeech CommunicationVoiceMulti-speaker Speech RecognitionSpeech AcousticsSpeech ProcessingSpeech PerceptionSpeaker Recognition
This report describes our submission to the VoxCeleb Speaker Recognition Challenge (VoxSRC) at Interspeech 2020. We perform a careful analysis of speaker recognition models based on the popular ResNet architecture, and train a number of variants using a range of loss functions. Our results show significant improvements over most existing works without the use of model ensemble or post-processing. We release the training code and pre-trained models as unofficial baselines for this year's challenge.
| Year | Citations | |
|---|---|---|
Page 1
Page 1