RUS  ENG
Full version
JOURNALS // Journal of the Belarusian State University. Mathematics and Informatics // Archive

Journal of the Belarusian State University. Mathematics and Informatics, 2019 Volume 1, Pages 77–89 (Mi bgumi80)

This article is cited in 1 paper

Theoretical foundations of computer science

Selecting informative features of human gene exons

A. V. Volkov, N. N. Yatskou, V. V. Grinev

Belarusian State University, 4 Niezaliežnasci Avenue, Minsk 220030, Belarus

Abstract: Dimensionality reduction of the human gene exon feature space is considered with the aim of gene identification. To evaluate the performance of various feature selection algorithms, computational experiments were carried out using the examples of exons of 14 known human genes. It is proven that exons are clearly separable regarding gene affiliation. Feature selection algorithms are sensitive to noise features and allow to estimate their number. Reducing the number of features improves CPU-time, memory usage as well as reduces the complexity of a model and makes it easier to interpret. Our findings indicate that utilizing of features of flanking intronic sequences leads to better prediction models in comparison with utilizing of exon features. The results of the research provide new opportunities for study of human gene data using machine learning algorithms.

Keywords: exon; intron; bioinformatics; feature selection; simulation modeling; classification algorithm.

UDC: 57.087.1

DOI: 10.33581/2520-6508-2019-1-77-89



© Steklov Math. Inst. of RAS, 2024