Unknown

Dataset Information

0

VIP: an integrated pipeline for metagenomics of virus identification and discovery.


ABSTRACT: Identification and discovery of viruses using next-generation sequencing technology is a fast-developing area with potential wide application in clinical diagnostics, public health monitoring and novel virus discovery. However, tremendous sequence data from NGS study has posed great challenge both in accuracy and velocity for application of NGS study. Here we describe VIP ("Virus Identification Pipeline"), a one-touch computational pipeline for virus identification and discovery from metagenomic NGS data. VIP performs the following steps to achieve its goal: (i) map and filter out background-related reads, (ii) extensive classification of reads on the basis of nucleotide and remote amino acid homology, (iii) multiple k-mer based de novo assembly and phylogenetic analysis to provide evolutionary insight. We validated the feasibility and veracity of this pipeline with sequencing results of various types of clinical samples and public datasets. VIP has also contributed to timely virus diagnosis (~10 min) in acutely ill patients, demonstrating its potential in the performance of unbiased NGS-based clinical studies with demand of short turnaround time. VIP is released under GPLv3 and is available for free download at: https://github.com/keylabivdc/VIP.

SUBMITTER: Li Y 

PROVIDER: S-EPMC4824449 | biostudies-literature | 2016 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

VIP: an integrated pipeline for metagenomics of virus identification and discovery.

Li Yang Y   Wang Hao H   Nie Kai K   Zhang Chen C   Zhang Yi Y   Wang Ji J   Niu Peihua P   Ma Xuejun X  

Scientific reports 20160330


Identification and discovery of viruses using next-generation sequencing technology is a fast-developing area with potential wide application in clinical diagnostics, public health monitoring and novel virus discovery. However, tremendous sequence data from NGS study has posed great challenge both in accuracy and velocity for application of NGS study. Here we describe VIP ("Virus Identification Pipeline"), a one-touch computational pipeline for virus identification and discovery from metagenomic  ...[more]

Similar Datasets

| S-EPMC5768174 | biostudies-literature
| S-EPMC7655621 | biostudies-literature
| S-EPMC6375137 | biostudies-literature
| S-EPMC5088602 | biostudies-literature
| S-EPMC3945619 | biostudies-literature
| S-EPMC4759986 | biostudies-literature
| S-EPMC7672179 | biostudies-literature
| S-EPMC10234390 | biostudies-literature
| S-EPMC4322365 | biostudies-literature
| S-EPMC7024552 | biostudies-literature