RUS  ENG
Full version
JOURNALS // Matematicheskoe modelirovanie // Archive

Mat. Model., 2023 Volume 35, Number 12, Pages 18–30 (Mi mm4510)

On using the computer linguistic models in the classification of biomedical images

E.Yu.Shchetinin

Financial University under the Government the Russian Federation

Abstract: Computer linguistic models have become widespread in the field of natural language processing and have recently been actively used to solve various computer vision problems. In this article, computer studies have been carried out aimed to identify the effectiveness of the use of transformer models in the task of classifying X-ray images of the lungs. The studies used pre-trained models of transformers with different sizes ViT-B(16/32), ViT-L(16/32), which were then fine-tuned on a set of X-ray images of lung. Computer studies of the use of convolutional neural networks VGG-16, Inception V3, ResNet50, EfficientNetV2, DenseNet121 have also been conducted. A comparative analysis of the classification results of the studied X-ray images showed that the ViT-B/32 transformer model has the best accuracy metrics accuracy=97.56%, AUC=99%.

Keywords: transformers, deep convolutional networks, classification, lungs X-ray images.

Received: 19.06.2023
Revised: 19.06.2023
Accepted: 11.09.2023

DOI: 10.20948/mm-2023-12-02


 English version:
Mathematical Models and Computer Simulations, 2024, 16:2, 246–253


© Steklov Math. Inst. of RAS, 2025