Publication | Closed Access
ResMax: Detecting Voice Spoofing Attacks with Residual Network and Max Feature Map
16
Citations
19
References
2021
Year
Unknown Venue
EngineeringMachine LearningEnsemble ApproachDetecting VoiceAsvspoof 2019Speech RecognitionMax Feature MapAdversarial Machine LearningRobust Speech RecognitionVoice RecognitionResidual NetworkHealth SciencesSkip ConnectionDefense SystemsSpeech SynthesisComputer ScienceDeep LearningVoiceMulti-speaker Speech RecognitionSpeech ProcessingVoice TechnologySpeaker Recognition
The “2019 Automatic Speaker Verification Spoofing And Countermeasures Challenge” (ASVspoof) competition aimed to facilitate the design of highly accurate voice spoofing attack detection systems. the competition did not emphasize model complexity and latency requirements; such constraints are strict and integral in real-world deployment. Hence, most of the top performing solutions from the competition all used an ensemble approach, and combined multiple complex deep learning models to maximize detection accuracy - this kind of approach would sit uneasily with real-world deployment constraints. To design a lightweight system, we combined the notions of skip connection (from ResNet) and max feature map (from Light CNN), and evaluated the accuracy of the system using the ASVspoof 2019 dataset. With an optimized constant Q transform (CQT) feature, our single model achieved a replay attack detection equal error rate (EER) of 0.37% on the evaluation set, surpassing the top ensemble system from the competition that achieved an EER of 0.39%.
| Year | Citations | |
|---|---|---|
Page 1
Page 1