RUS  ENG
Full version
JOURNALS // Informatics and Automation // Archive

Tr. SPIIRAN, 2013 Issue 31, Pages 20–42 (Mi trspy695)

Segmentation and diphone recognition of speech signals.

A. Buribayevaa, G. V. Dorokhinab, A. V. Nitsenkob, V. Shelepovb

a L. N. Gumilev Eurasian National University, Astana
b Institute of Artificial Intelligence, the Ministry of Education and Science of Ukraine and National Academy of Science of Ukraine, Donetsk

Abstract: Abstract. The paper is devoted to speech recognition technology developed in Artificial intelligence Institute (Donetsk, Ukraine). It is based on the following main stages: segmentation with the help of full variation digital analogue; diphone-database creation; DTW-recognition of words based on diphone templates. The technology could be used for large vocabulary speech recognition as well as for development of text editors with voice input.

Keywords: Keywords: segmentation of speech signal, diphone, DTW-recognition.

UDC: 004.934.2

Received: 22.10.2013



© Steklov Math. Inst. of RAS, 2024