Group Non-Negative Matrix Factorisation With Speaker And Session Similarity Constraints For Speaker Identification - Département Image, Données, Signal Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Group Non-Negative Matrix Factorisation With Speaker And Session Similarity Constraints For Speaker Identification

Résumé

This paper presents a feature learning approach for speaker identification that is based on non-negative matrix factorisa-tion. Recent studies have shown that in methods such as non-negative matrix factorisation, the dictionary atoms can represent well the speaker identity and that Using speaker identity to induce group similarity can proven to improve further the performance. However, the approaches proposed so far fo-cused only on speakers variability and not on sessions variability. However, this later point is a crucial aspect in the success of the I-vector approaches that is now the state-of-the-art in speaker identification. This paper proposes an approach that relies on group-NMF and that is inspired that the I-vector training procedure. By doing so this approach intends to capture both the speaker variability and the session variability. Results on a small corpus prove the proposed approach to be competitive with the state-of-the-art I-vector approach.
Fichier principal
Vignette du fichier
16-2.pdf (147.29 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01393968 , version 1 (08-11-2016)

Identifiants

  • HAL Id : hal-01393968 , version 1

Citer

Romain Serizel, Slim Essid, Gael Richard. Group Non-Negative Matrix Factorisation With Speaker And Session Similarity Constraints For Speaker Identification. IEEE International Conference on Acoustics, Speech, and Signal Processing, Mar 2016, Shangai, China. ⟨hal-01393968⟩
118 Consultations
162 Téléchargements

Partager

Gmail Facebook X LinkedIn More