RUS  ENG
Full version
JOURNALS // Avtomatika i Telemekhanika // Archive

Avtomat. i Telemekh., 2022 Issue 6, Pages 38–52 (Mi at15975)

This article is cited in 2 papers

Method for reducing the feature space dimension in speech emotion recognition using convolutional neural networks

A. O. Iskhakova, D. A. Vol'f, R. V. Meshcheryakov

Trapeznikov Institute of Control Sciences, Russian Academy of Sciences, Moscow, 117997 Russia

Abstract: We consider the architectures of convolutional neural networks used to assess the emotional state of a person by their speech. The problem of increasing the efficiency of emotion recognition by reducing the computational complexity of this process is solved. To this end, we propose a method transforming the input data into a form suitable for machine learning algorithms.

Keywords: recognition of speech emotions, speech signal, sound, identification of emotional state, detection of aggression, classification of speech signals, socio-cyber-physical system, convolutional neural network.

Presented by the member of Editorial Board: O. P. Kuznetsov

Received: 17.11.2021
Revised: 19.01.2022
Accepted: 26.01.2022

DOI: 10.31857/S0005231022060046


 English version:
Automation and Remote Control, 2022, 83:6, 857–868


© Steklov Math. Inst. of RAS, 2024