A. V. Nitsenko, “A ‘by part’ method of russian word speech recognition”, Eurasian Journal of Mathematical and Computer Applications, 2013, том 1, выпуск 2,страницы 102

A ‘by part’ method of russian word speech recognition

A. V. Nitsenko

Institute of Artificial Intelligence, the Ministry of Education and Science of Ukraine and National Academy of Science of Ukraine, Donetsk

Аннотация: The present article is a description of a speech recognition method based on the idea of recognizing words by their component parts. The method proceeds from automatic phonetic segmentation, using full variation digital analogue, to further compose a diphone base and carry out a DTW algorithm-based speech recognition: rstly, for a variable word part (a quasiexion) and secondly, for its static part (a quasibase), with reference templates automatically formed from diphone templates. It results in considerable reduction of the running time and the reliability growth of word form speech recognition. This method can be employed for recognizing large and very large vocabularies.

Ключевые слова: segmentation of speech signal, diphone, dynamic time warping, feature vector, quasiexion.

MSC: 68T10, 68T50

Поступила в редакцию: 02.12.2013

Язык публикации: английский