Anonymous speaker clusters: Making distinctions between anonymised speech recordings with clustering interface - Laboratoire Informatique d'Avignon Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

Anonymous speaker clusters: Making distinctions between anonymised speech recordings with clustering interface

Résumé

Our study examined the performance of evaluators tasked to group natural and anonymised speech recordings into clusters based on their perceived similarities. Speech stimuli were selected from the VCTK corpus; two systems developed for the VoicePrivacy 2020 Challenge were used for anonymisation. The Baseline-1 (B1) system was developed by using x-vectors and neural waveform models, while the Baseline-2 (B2) system relied on digital-signal-processing techniques. 74 evaluators completed three trials composed of 16 recordings with either natural or anonymised speech generated from a single system. F-measure and cluster purity metrics were used to assess evaluator accuracy. Probabilistic linear discriminant analysis (PLDA) scores from an automatic speaker verification system were generated to quantify similarity between recordings and used to correlate subjective results. Our findings showed that non-native English speaking evaluators significantly lowered their F-measure means when presented anonymised recordings. We observed no significance for cluster purity. Pearson correlation procedures revealed that PLDA scores generated from natural and B2-anonymised speech recordings correlated positively to F-measure and cluster purity metrics. These findings show evaluators were able to use the interface to cluster natural and anonymised speech recordings and suggest anonymisation systems modelled like B1 are more effective at suppressing identifiable speech characteristics.
Fichier principal
Vignette du fichier
Linkablity_INTERSPEECH_2021.pdf (247.39 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03267084 , version 1 (22-06-2021)
hal-03267084 , version 2 (16-12-2021)

Identifiants

  • HAL Id : hal-03267084 , version 1

Citer

Benjamin O'Brien, Natalia Tomashenko, Anaïs Chanclu, Jean-François Bonastre. Anonymous speaker clusters: Making distinctions between anonymised speech recordings with clustering interface. INTERSPEECH 2021, Aug 2021, Brno, Czech Republic. ⟨hal-03267084v1⟩
163 Consultations
210 Téléchargements

Partager

Gmail Facebook X LinkedIn More