RUS  ENG
Full version
JOURNALS // Computational nanotechnology // Archive

Comp. nanotechnol., 2024 Volume 11, Issue 1, Pages 135–150 (Mi cn468)

ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

Neural networks in the task of genre classification of musical compositions

N. V. Grineva, N. V. Grineva

Financial University under the Government of the Russian Federation

Abstract: This study investigates the application of neural networks in the task of classifying audio signals into ten different genres. The peculiarities of processing audio signals in the digital environment are examined, along with the relationship between Fourier transformation and spectrograms, and the characteristics of audio signals. Neural network training was conducted using the GTZAN dataset, which contains 1000 compositions. Four comparable datasets were formed based on this dataset, and the performance of three neural network architectures – convolutional, recurrent, and multilayer perceptron – was evaluated on each of them. The practical significance of this work lies in the possibility of forming musical recommendations and organizing music. The goal of the study is to develop a classifier that could accurately determine the probability of a composition belonging to one of the ten genres.

Keywords: audio signal, mel spectrogram, spectrum, Fourier transform, GTZAN, multilayer perceptron (MLP), convolutional neural network (CNN), genre classification task.

UDC: 519.6

DOI: 10.33693/2313-223X-2024-11-1-135-150



© Steklov Math. Inst. of RAS, 2024