RUS  ENG
Full version
JOURNALS // Matematicheskaya Biologiya i Bioinformatika // Archive

Mat. Biolog. Bioinform., 2025 Volume 20, Issue 1, Pages 100–121 (Mi mbb588)

Bioinformatics

Facilitating rapid screening of metaviral data with the VERA pipeline

I. K. Chudinovab, A. S. Speranskayac

a Scientific Research Institute for Systems Biology and Medicine of Rospotrebnadzor, Moscow
b Moscow Institute of Physics and Technology (National Research University), Moscow, Russia
c Vavilov Institute of General Genetics Russian Academy of Science, Moscow, Russia

Abstract: The study of metaviral data has become a crucial area of research in recent years, leading to a significant increase in metagenomic and metaviral datasets. This growth has resulted in advancements in data collection and processing techniques, focusing on analyzing the complex viral diversity found in natural samples. Recent developments in metaviral research have emphasized the integration of high-throughput sequencing technologies with advanced bioinformatics tools, enhancing the sensitivity and accuracy of virus identification. This study introduces a novel pipeline designed for rigorous double filtering of assembled contigs, maximizing the detection of true viral sequences while minimizing the inclusion of non-viral artifacts. By improving virus detection through dual taxonomic filtering and leveraging modern workflow management systems, the new pipeline not only addresses gaps in existing methods but also offers a fresh approach to metaviral research. The VERA pipeline was implemented using the Nextflow workflow system, and its validation was performed with control samples, showing consistent results that align with manual validation and commercial platforms. This work highlights the importance of innovative approaches in capturing the full breadth and complexity of viral diversity, even amidst recent technological advancements.

Key words: NGS, metaviral, nextflow, pipelines.

Received 10.03.2025, 23.03.2025, Published 14.05.2025

Language: English

DOI: 10.17537/2025.20.100



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2025