RUS  ENG
Full version
JOURNALS // Proceedings of the Institute for System Programming of the RAS // Archive

Proceedings of ISP RAS, 2023 Volume 35, Issue 2, Pages 7–18 (Mi tisp766)

Real application of CNN interpretation methods: document image classification model errors’ detection and validation

A. O. Golodkov, O. V. Belyaeva, A. I. Perminov

Ivannikov Institute for System Programming of the RAS

Abstract: In this paper, we consider the case of applying convolutional neural networks interpretation methods to ResNet 18 model in order to identify and justify model errors. The model is used in the problem of classifying the orientation of text documents images. First, using interpretation methods, an assumption was made as to why the neural network shows low metrics on data that differs from training images. The alleged reason was the presence of artifacts on the generated training images, caused by the use of an image rotation function. Further, using the Vanilla Gradient, Guided Backpropagation, Integrated Gradients, GradCAM methods and the invented metric, we managed to accurately confirm the hypothesis put forward. The obtained results helped to significantly improve the accuracy of the model.

Keywords: CNN Interpretation, Document Image Classification, Document Orientation Detection

Language: English

DOI: 10.15514/ISPRAS-2023-35(2)-1



© Steklov Math. Inst. of RAS, 2024