T. V. Yermolenko, M. S. Klymenko, “Usage of Speech Signal Segmentation for the Construction of Complex Model in the Speaker Identification System.”, Tr. SPIIRAN, 2013, Issue 26,Pages <nobr>332

Usage of Speech Signal Segmentation for the Construction of Complex Model in the Speaker Identification System.

T. V. Yermolenko^ab, M. S. Klymenko^a

^a Institute of Artificial Intelligence
^b Donetsk National Technical University

Abstract: The article is devoted to development of a complex speaker model for using at the text-independent speaker identification. The complex speaker model is based on gaussian mixture method. The model is formed by preliminary segmented speech signal, where each segment matches to certain broad phonetic class. Method of speaker models structuring is proposed. Speaker models are structured as a tree, which allows to identify speaker without running a full search on the set of models. Researches have shown the division of the acoustic space of speaker's voice on the set of classes that represent some phonetic events, increases the efficiency of voice identification and the proposed structuring method of models accelerates the search operation.

Keywords: clustering, gaussian mixture, speaker models, broad phonetic classes, mel-frequency cepstral coefficients.

UDC: 004.89, 004.93

PACS: 43.71.Sy

MSC: 68T50

Received: 04.04.2013