Automatic Classification of Phonation Types in Spontaneous Speech: Towards a New Workflow for the Characterization of Speakers’ Voice Quality - Laboratoire Informatique d'Avignon Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

Automatic Classification of Phonation Types in Spontaneous Speech: Towards a New Workflow for the Characterization of Speakers’ Voice Quality

Résumé

Voice quality is known to be an important factor for the characterization of a speaker's voice, both in terms of physiological features (mainly laryngeal and supralaryngeal) and of the speaker's habits (sociolinguistic factors). This paper is devoted to one of the main components of voice quality: phonation type. It proposes neural representations of speech followed by a cascade of two binary neural network-based classifiers, one dedicated to the detection of modal and nonmodal vowels, and one for the classification of nonmodal vowels into creaky and breathy types. This approach is evaluated on the spontaneous part of the PTSVOX database, following an expert manual labelling of the data by phonation type. The results of the proposed classifiers reaches on average 85 % accuracy at the framelevel and up to 95 % accuracy at the segment-level. Further research is planned to generalize the classifiers on more contexts and speakers, and thus pave the way for a new workflow aimed at characterizing phonation types.
Fichier principal
Vignette du fichier
IS2021_-_Automatic_classification_of_phonation_types_in_spontaneous_speech_-_Final.pdf (1.28 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03334492 , version 1 (03-09-2021)

Identifiants

Citer

Anaïs Chanclu, Imen Ben Amor, Cédric Gendrot, Emmanuel Ferragne, Jean-François Bonastre. Automatic Classification of Phonation Types in Spontaneous Speech: Towards a New Workflow for the Characterization of Speakers’ Voice Quality. Interspeech 2021, Aug 2021, Brno, Czech Republic. pp.1015-1018, ⟨10.21437/Interspeech.2021-1765⟩. ⟨hal-03334492⟩
140 Consultations
177 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More