RUS  ENG
Full version
JOURNALS // Proceedings of the Institute for System Programming of the RAS // Archive

Proceedings of ISP RAS, 2023 Volume 35, Issue 6, Pages 247–264 (Mi tisp845)

Style transfer as a way to improve the generalization ability of a neural network in an object detection task

D. K. Karachevab, S. E. Shtekhina, V. S. Tarasyanb, I. U. Smolina, M. V. Isakova

a Industry Center for Information Systems' Development and Deployment
b Urals State University of Railway Transport

Abstract: This paper proposes the implementation of a neural network training approach for object detection, using augmentation - style transfer. This method improves the generalization ability of the neural network to determine the location of objects in the image by improving the interaction with low-level features such as textures, different colors and small changes in shapes. The effectiveness of the method is experimentally proved and the numerical values of the object detection metrics are demonstrated on several datasets with different classes. The application of augmentation is proposed using an unused before neural network architecture capable of carrying an arbitrary number of styles. The peculiarity of the approach is also that the weights of the neural network for styling are frozen and it is added to the graph of the detection network, which allows augmentation speed.

Keywords: neural networks, computer vision, style transfer, machine learning, object detection

DOI: 10.15514/ISPRAS-2023-35(6)-16



© Steklov Math. Inst. of RAS, 2024