Abstract:
Sequencing of the human genome began in 1994. It took 10 years of collaborative work of many teams in order to obtain a draft of human DNA. Modern technology of sequencing allows one to read the individual genomes in a few days. Advances in modern bioinformatics related to the emergence of high-performance sequencing platforms, which not only contributed to the expansion of the capabilities of biology and related sciences, but also gave rise to the phenomenon of large data. In the paper the necessity of development of new technologies and methods for organization of storage, management, analysis and visualization of large data is substantiated. Modern bioinformatics has faced not only the problem of enormous volumes of heterogenous data, but also with a huge variety of processing and presentation methods, the existence of various software tools and data formats. The ways of solving the arising challenges are discussed in the paper, in particular by using achievements from other areas of modern life, such as web intelligence and business intelligence. New storage systems, other than relational ones, will help to solve the problem of archiving and ensuring an acceptable time for performing search queries. New programming technologies, namely generic programming and visual programming can help to overcome the problem of diversity of formats of genomic data and provide the ability to experimentators to quickly create scripts for data processing.