Unknown

Dataset Information

0

VERSE: a novel approach to detect virus integration in host genomes through reference genome customization.


ABSTRACT: Fueled by widespread applications of high-throughput next generation sequencing (NGS) technologies and urgent need to counter threats of pathogenic viruses, large-scale studies were conducted recently to investigate virus integration in host genomes (for example, human tumor genomes) that may cause carcinogenesis or other diseases. A limiting factor in these studies, however, is rapid virus evolution and resulting polymorphisms, which prevent reads from aligning readily to commonly used virus reference genomes, and, accordingly, make virus integration sites difficult to detect. Another confounding factor is host genomic instability as a result of virus insertions. To tackle these challenges and improve our capability to identify cryptic virus-host fusions, we present a new approach that detects Virus intEgration sites through iterative Reference SEquence customization (VERSE). To the best of our knowledge, VERSE is the first approach to improve detection through customizing reference genomes. Using 19 human tumors and cancer cell lines as test data, we demonstrated that VERSE substantially enhanced the sensitivity of virus integration site detection. VERSE is implemented in the open source package VirusFinder 2 that is available at http://bioinfo.mc.vanderbilt.edu/VirusFinder/.

SUBMITTER: Wang Q 

PROVIDER: S-EPMC4333248 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

VERSE: a novel approach to detect virus integration in host genomes through reference genome customization.

Wang Qingguo Q   Jia Peilin P   Zhao Zhongming Z  

Genome medicine 20150120 1


Fueled by widespread applications of high-throughput next generation sequencing (NGS) technologies and urgent need to counter threats of pathogenic viruses, large-scale studies were conducted recently to investigate virus integration in host genomes (for example, human tumor genomes) that may cause carcinogenesis or other diseases. A limiting factor in these studies, however, is rapid virus evolution and resulting polymorphisms, which prevent reads from aligning readily to commonly used virus re  ...[more]

Similar Datasets

| S-EPMC8040819 | biostudies-literature
| S-EPMC10698978 | biostudies-literature
| S-EPMC4120144 | biostudies-literature
| S-EPMC4726161 | biostudies-literature
| PRJEB49424 | ENA
| S-EPMC3663743 | biostudies-literature
| S-EPMC5891936 | biostudies-literature
| S-EPMC7379554 | biostudies-literature
| S-EPMC2224419 | biostudies-literature
| S-EPMC8051094 | biostudies-literature