RUS  ENG
Full version
JOURNALS // Computer Research and Modeling // Archive

Computer Research and Modeling, 2020 Volume 12, Issue 6, Pages 1383–1395 (Mi crm855)

This article is cited in 1 paper

ANALYSIS AND MODELING OF COMPLEX LIVING SYSTEMS

Ensemble building and statistical mechanics methods for MHC-peptide binding prediction

I. V. Grebenkina, A. E. Alekseenkob, N. A. Gaivoronskiya, M. G. Ignatovb, A. M. Kazennovb, D. Kozakovc, A. P. Kulagina, Ya. A. Kholodova

a Innopolis University, 1 Universitetskaya st., Innopolis, 420500, Russia
b Institute of Computer Aided Design of the Russian Academy of Sciences, 19/18 2 Brestskaya st., Moscow, 123056, Russia
c Stony Brook University, 100 Nicolls Rd, Stony Brook, NY, 11794, U.S.A.

Abstract: The proteins of the Major Histocompatibility Complex (MHC) play a key role in the functioning of the adaptive immune system, and the identification of peptides that bind to them is an important step in the development of vaccines and understanding the mechanisms of autoimmune diseases. Today, there are a number of methods for predicting the binding of a particular MHC allele to a peptide. One of the best such methods isNetMHCpan-4.0, which is based on an ensemble of artificial neural networks. This paper presents a methodology for qualitatively improving the underlying neural network underlying NetMHCpan-4.0. The proposed method uses the ensemble construction technique and adds as input an estimate of the Potts model taken from static mechanics, which is a generalization of the Ising model. In the general case, the model reflects the interaction of spins in the crystal lattice. Within the framework of the proposed method, the model is used to better represent the physical nature of the interaction of proteins included in the complex. To assess the interaction of the MHC+peptide complex, we use a two-dimensional Potts model with 20 states (corresponding to basic amino acids). Solving the inverse problem using data on experimentally confirmed interacting pairs, we obtain the values of the parameters of the Potts model, which we then use to evaluate a new pair of MHC+peptide, and supplement this value with the input data of the neural network. This approach, combined with the ensemble construction technique, allows for improved prediction accuracy, in terms of the positive predictive value (PPV) metric, compared to the baseline model.

Keywords: major histocompatibility complex, binding affinity, neural network, machine learning, Potts model.

UDC: 577.27

Received: 10.08.2020
Revised: 19.10.2020
Accepted: 29.10.2020

DOI: 10.20537/2076-7633-2020-12-6-1383-1395



© Steklov Math. Inst. of RAS, 2024