RUS  ENG
Full version
JOURNALS // Matematicheskaya Biologiya i Bioinformatika // Archive

Mat. Biolog. Bioinform., 2018 Volume 13, Issue Suppl., Pages t84–t103 (Mi mbb365)

Translations of Published Articles

Investigation of latent periodicity phenomenon in the genomes of eukaryotic organisms

M. B. Chaleya, V. A. Kutyrkinb, E. I. Teplukhinaa, G. E. Tyulbashevaa, N. N. Nazipovaa

a Institute of Mathematical Problems of Biology, Russian Academy of Sciences, Pushchino, Russia
b Moscow State Technical University n.a. N.E. Bauman, Moscow, Russia

Abstract: Data analysis is presented for the HeteroGenome database first release which contains latent periodicity regions revealed in a number of eukaryotic organisms. Tandem repeats with different integrity of pattern copies, including the highly diverged repeats, have been identified in the genomes of S. cerevisiae, A. thaliana, C. elegans and D. melanogaster. Such data were obtained with the help of original spectral-statistical approach to searching for reliable regions of the latent periodicity in DNA sequences. Special structure of data presentation, consisting of the two levels, was proposed. On the first, nonredundant level the latent periodicity regions are considered as a whole and, additionally, on the second level only conservative elements of their periodic structures are shown. Such data presentation allowed estimating share of the periodicity regions as nearly 10% of the length in analyzed genomes. This estimate was deduced basing on the first level data. Quantitative and qualitative investigation of the latent periodicity regions, their divergence level over all chromosomes of the organisms considered, revealed characteristic types of periodicity in the genome of every organism. Histograms of density distribution for the latent periodicity regions on each chromosome of the genomes analyzed were obtained. Repertoire of period lengths were determinated. The HeteroGenome database has additional possibilities for inner data analysis and is accessible by URL: http://www.jcbi.ru/lp_baze/.

Key words: latent periodicity, approximate tandem repeats, genome analysis.

UDC: 577.322

Received 23.07.2018, Published 09.08.2018

Language: English

DOI: 10.17537/2018.13.t84



© Steklov Math. Inst. of RAS, 2024