Concepedia

Publication | Closed Access

MLP-SVNET: A Multi-Layer Perceptrons Based Network for Speaker Verification

10

Citations

18

References

2022

Year

Abstract

Convolution and self-attention based neural networks have both obtained excellent performance in automatic speaker verification. However, the convolution model often lacks the ability of long-term dependency modeling due to the limitation of receptive field, while the self-attention model is insufficient to model local information. To tackle this limitation, we propose a new multi-layer perceptrons based speaker verification network (MLP-SVNet) which can apply MLPs across temporal and frequency dimensions to capture the local and global information at the same time. The experimental results conducted on Voxceleb show that the proposed model is very competitive when compared to other systems based on convolution or self-attention. In addition, we demonstrate that MLP-SVNet based on multi-layer per-ceptrons can produce complementary embeddings, which can be fused with the state-of-the-art system to further improve the performance.

References

YearCitations

2023

73.5K

2018

26.8K

2020

21.2K

2010

3.6K

2019

3.2K

2016

3.1K

2018

2.6K

2018

2.2K

2021

1.4K

2020

1.3K

Page 1