RUS  ENG
Full version
JOURNALS // News of the Kabardino-Balkarian Scientific Center of the Russian Academy of Sciences // Archive

News of the Kabardino-Balkarian Scientific Center of the Russian Academy of Sciences, 2020 Issue 6, Pages 20–33 (Mi izkab248)

System analysis, management and information processing

Modern problems of automatic speech recognition

I. A. Gurtueva

Institute of Computer Science and Problems of Regional Management – branch of Federal public budgetary scientific establishment «Federal scientific center «Kabardin-Balkar Scientific Center of the Russian Academy of Sciences», 360000, KBR, Nalchik, 37-a, I. Armand St.

Abstract: This paper provides a concise review of the most applied methods in speech recognition. Various principles of transcription developed in the Linguistic Data Consortium are discussed. The problems in evaluating the human level of efficiency in solving the problem of speech recognition are described. The typical errors made by a human are analyzed. It has been shown that transcribers demonstrate a high level of consistency with accurate transcription of pre-prepared English speech and fast transcription of conversational telephone speech. It is also shown that with increasing complexity of speech, the word disagreement rate increases. The results of a comparative analysis of errors generated by the speech system and those made by humans are presented. Their similarities and differences are analyzed. The modern automatic speech recognition problems are listed, the prospects for their solution and the directions of future research are estimated.

Keywords: deep learning, artificial intelligence, artificial neuron networks, speech recognition, human parity.

UDC: 004.896

MSC: Primary 68T10; Secondary 68T50

Received: 30.11.2020

DOI: 10.35330/1991-6639-2020-6-98-20-33



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024