Unknown

Dataset Information

0

Empirical validation of viral quasispecies assembly algorithms: state-of-the-art and challenges.


ABSTRACT: Next generation sequencing (NGS) is superseding Sanger technology for analysing intra-host viral populations, in terms of genome length and resolution. We introduce two new empirical validation data sets and test the available viral population assembly software. Two intra-host viral population 'quasispecies' samples (type-1 human immunodeficiency and hepatitis C virus) were Sanger-sequenced, and plasmid clone mixtures at controlled proportions were shotgun-sequenced using Roche's 454 sequencing platform. The performance of different assemblers was compared in terms of phylogenetic clustering and recombination with the Sanger clones. Phylogenetic clustering showed that all assemblers captured a proportion of the most divergent lineages, but none were able to provide a high precision/recall tradeoff. Estimated variant frequencies mildly correlated with the original. Given the limitations of currently available algorithms identified by our empirical validation, the development and exploitation of additional data sets is needed, in order to establish an efficient framework for viral population reconstruction using NGS.

SUBMITTER: Prosperi MC 

PROVIDER: S-EPMC3789152 | biostudies-literature | 2013 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Empirical validation of viral quasispecies assembly algorithms: state-of-the-art and challenges.

Prosperi Mattia C F MC   Yin Li L   Nolan David J DJ   Lowe Amanda D AD   Goodenow Maureen M MM   Salemi Marco M  

Scientific reports 20131003


Next generation sequencing (NGS) is superseding Sanger technology for analysing intra-host viral populations, in terms of genome length and resolution. We introduce two new empirical validation data sets and test the available viral population assembly software. Two intra-host viral population 'quasispecies' samples (type-1 human immunodeficiency and hepatitis C virus) were Sanger-sequenced, and plasmid clone mixtures at controlled proportions were shotgun-sequenced using Roche's 454 sequencing  ...[more]

Similar Datasets

| S-EPMC3967922 | biostudies-literature
| S-EPMC5411778 | biostudies-literature
| S-EPMC3372249 | biostudies-other
| S-EPMC3595110 | biostudies-literature
| S-EPMC5482426 | biostudies-literature
| S-EPMC6797082 | biostudies-literature
| S-EPMC7557393 | biostudies-literature
| S-EPMC7794923 | biostudies-literature
| S-EPMC2660199 | biostudies-other
| S-EPMC3764528 | biostudies-literature