Unknown

Dataset Information

0

ENVirT: inference of ecological characteristics of viruses from metagenomic data.


ABSTRACT: BACKGROUND:Estimating the parameters that describe the ecology of viruses,particularly those that are novel, can be made possible using metagenomic approaches. However, the best-performing existing methods require databases to first estimate an average genome length of a viral community before being able to estimate other parameters, such as viral richness. Although this approach has been widely used, it can adversely skew results since the majority of viruses are yet to be catalogued in databases. RESULTS:In this paper, we present ENVirT, a method for estimating the richness of novel viral mixtures, and for the first time we also show that it is possible to simultaneously estimate the average genome length without a priori information. This is shown to be a significant improvement over database-dependent methods, since we can now robustly analyze samples that may include novel viral types under-represented in current databases. We demonstrate that the viral richness estimates produced by ENVirT are several orders of magnitude higher in accuracy than the estimates produced by existing methods named PHACCS and CatchAll when benchmarked against simulated data. We repeated the analysis of 20 metavirome samples using ENVirT, which produced results in close agreement with complementary in virto analyses. CONCLUSIONS:These insights were previously not captured by existing computational methods. As such, ENVirT is shown to be an essential tool for enhancing our understanding of novel viral populations.

SUBMITTER: Jayasundara D 

PROVIDER: S-EPMC7394321 | biostudies-literature | 2019 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

ENVirT: inference of ecological characteristics of viruses from metagenomic data.

Jayasundara Duleepa D   Herath Damayanthi D   Senanayake Damith D   Saeed Isaam I   Yang Cheng-Yu CY   Sun Yuan Y   Chang Bill C BC   Tang Sen-Lin SL   Halgamuge Saman K SK  

BMC bioinformatics 20190204 Suppl 13


<h4>Background</h4>Estimating the parameters that describe the ecology of viruses,particularly those that are novel, can be made possible using metagenomic approaches. However, the best-performing existing methods require databases to first estimate an average genome length of a viral community before being able to estimate other parameters, such as viral richness. Although this approach has been widely used, it can adversely skew results since the majority of viruses are yet to be catalogued in  ...[more]

Similar Datasets

| S-EPMC9580935 | biostudies-literature
| S-EPMC8172088 | biostudies-literature
| S-EPMC5541006 | biostudies-literature
| S-EPMC9303593 | biostudies-literature
| S-EPMC6058391 | biostudies-literature
| S-EPMC8921650 | biostudies-literature
| S-EPMC6792449 | biostudies-literature
| S-EPMC8452072 | biostudies-literature
| S-EPMC5847339 | biostudies-literature
| S-EPMC4938975 | biostudies-literature