RUS  ENG
Full version
JOURNALS // Mendeleev Communications // Archive

Mendeleev Commun., 2026 Volume 36, Issue 3, Pages 257–259 (Mi mendc7430)

Communications

Overfitting in machine learning models for predicting partial atomic charges of drug-like molecules

D. V. Zverev, P. K. Nikiforova, A. R. Shaimardanov, D. A. Shulga, V. A. Palyulin

Department of Chemistry, M. V. Lomonosov Moscow State University, 119991 Moscow, Russian Federation

Abstract: Overfitting in machine learning models for partial atomic charges was investigated for highly heterogeneous datasets common in medicinal chemistry. Random forest and multilayer perceptron models were trained and validated on a specially clustered dataset of drug-like molecules. Analysis of standard quality metrics for reproducing RESP charges showed that the trained models exhibit no evidence of overfitting.

Keywords: overfitting, transferability, machine learning, neural network, multilayer perceptron, random forest, partial atomic charges, drug-like molecules, chemical datasets.

Received: 02.10.2025
Accepted: 25.12.2025

Language: English

DOI: 10.71267/mencom.7930



© Steklov Math. Inst. of RAS, 2026