Barlow Twins self-supervised learning for robust speaker recognition - Avignon Université Accéder directement au contenu
Communication Dans Un Congrès Année : 2022

Barlow Twins self-supervised learning for robust speaker recognition

Résumé

Acoustic noise is a big challenge for speaker recognition systems. The state-of-the-art speaker recognition systems are based on deep neural network speaker embeddings called xvector extractor. A noise-robust x-vector extractor is highly demanded in speaker recognition systems. In this paper, we introduce Barlow Twins self-supervised loss function in the area of speaker recognition. Barlow Twins objective function tries to optimize two criteria: Firstly, it increases the similarity between two versions of the same signal (i.e. the clean and its augmented noisy version) to make the speaker embedding invariant to the acoustic noise. Secondly, it reduces the redundancy between dimensions of the x-vectors that improves the overall quality of speaker embeddings. In our research, Barlow Twins objective function is integrated with the ResNet-based speaker embedding system. In the proposed system, the Barlow Twins objective function is calculated in the embedding layer and it is optimized jointly with the speaker classifier loss function. The experimental results on Fabiole corpus show 22 % relative gain in terms of EER in the clean environments and 18% improvement in the presence of noise with low SNR and reverberation.
Fichier principal
Vignette du fichier
BT_SR_INTERSPEECH_Mohammadamini.pdf (170.84 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03710445 , version 1 (30-06-2022)
hal-03710445 , version 2 (01-07-2022)

Identifiants

Citer

Mohammad Mohammadamini, Driss Matrouf, Jean-François A Bonastre, Sandipana Dowerah, Romain Serizel, et al.. Barlow Twins self-supervised learning for robust speaker recognition. Interspeech 2022 - Human and Humanizing Speech Technology, Sep 2022, Incheon, South Korea. ⟨10.21437/Interspeech.2022-11301⟩. ⟨hal-03710445v2⟩
535 Consultations
426 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More