RUS  ENG
Full version
JOURNALS // Matematicheskoe modelirovanie // Archive

Matem. Mod., 2015 Volume 27, Number 7, Pages 51–57 (Mi mm3622)

Training a speaker verification system on unlabelled data

A. V. Ermilov, I. M. Gostev

National Research University "Higher School of Economics"

Abstract: In the article we consider a method of labeling speaker data using clusterization techniques. Labelling problems arise when one needs to use a speaker database from new channels, for example, mobile devices. Newly labelled database might then be used to construct a speaker verification system. In the article described a speaker verification task along with some methods to solve it which are based on GMM-UBM, also some channel normalization techniques are described, which might enhance the quality of recognition. Methods based on supervectors and PLDA are also presented. We also study the quality of labeling obtained through clusterization with different metrics. Resulting labelled database is then used to train several PLDA models. Then these models fused and used to solve a speaker verification task on i-vectors from NIST are i-vector Machine Learning Challenge 2014.

Keywords: patern recognition, automatic speaker verification, clusterization, PLDA.

Received: 30.03.2015



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024