RUS  ENG
Full version
JOURNALS // Vestnik of Astrakhan State Technical University. Series: Management, Computer Sciences and Informatics // Archive

Vestn. Astrakhan State Technical Univ. Ser. Management, Computer Sciences and Informatics, 2023 Number 1, Pages 36–42 (Mi vagtu738)

COMPUTER SOFTWARE AND COMPUTING EQUIPMENT

Hybrid technique of speech signal noise reduction for video conferencing system

S. V. Belov, S. S. Katunin

Astrakhan State Technical University, Astrakhan, Russia

Abstract: The article focuses on the problem of audio signal quality during video conferences. The effect of noise on the quality and intelligibility of the speech signal is described. Analysis of the noise reduction in the audio signal in real time has been carried out. The main problems arising in the digital processing of the audio signal in real time are highlighted. General methods of reducing the noise are considered and the disadvantages of classical methods are revealed. The problem of eliminating noise with a variable frequency band by using classical noise reduction methods is formulated. There is substantiated the need for creating a hybrid noise reduction technique by using machine and deep learning methods to eliminate both static noise and noise with complex and variable spectral characteristics. The main approaches to solving the problem of noise reduction in real time are highlighted, namely the approach with recognition and elimination of noise and the approach with voice recognition and elimination of sounds that differ from the speech signal. A noise reduction algorithm based on an approach with recognition and elimination of noise is described. Optimization of the algorithm is proposed by decomposing the spectrum of the input signal according to the Bark scale. A recurrent neural network is proposed as a tool for implementing a noise reduction algorithm. The formats of the input and output data of the neural network as well as the format of the training data are defined. A model for adjusting parameters and rules for adapting the noise reduction algorithm to the specific operating conditions is described. A hybrid noise reduction technique combining classical noise reduction methods and methods based on a recurrent neural network is proposed. A scheme of a hybrid technique has been developed. A method of testing the effectiveness of the noise reduction technique is proposed.

Keywords: video conference, speech signal, noise suppression, signal quality, spectrum, filter coefficient, noise threshold, frequency band, Bark scale, recurrent neural networks.

UDC: 004.773.5:[004.032.26+534.83+534.442]

Received: 30.11.2022
Accepted: 27.12.2022

DOI: 10.24143/2073-5529-2023-1-36-42



© Steklov Math. Inst. of RAS, 2024