Unknown

Dataset Information

0

Indel and Carryforward Correction (ICC): a new analysis approach for processing 454 pyrosequencing data.


ABSTRACT: Pyrosequencing technology provides an important new approach to more extensively characterize diverse sequence populations and detect low frequency variants. However, the promise of this technology has been difficult to realize, as careful correction of sequencing errors is crucial to distinguish rare variants (?1%) in an infected host with high sensitivity and specificity.We developed a new approach, referred to as Indel and Carryforward Correction (ICC), to cluster sequences without substitutions and locally correct only indel and carryforward sequencing errors within clusters to ensure that no rare variants are lost. ICC performs sequence clustering in the order of (i) homopolymer indel patterns only, (ii) indel patterns only and (iii) carryforward errors only, without the requirement of a distance cutoff value. Overall, ICC removed 93-95% of sequencing errors found in control datasets. On pyrosequencing data from a PCR fragment derived from 15 HIV-1 plasmid clones mixed at various frequencies as low as 0.1%, ICC achieved the highest sensitivity and similar specificity compared with other commonly used error correction and variant calling algorithms.Source code is freely available for download at http://indra.mullins.microbiol.washington.edu/ICC. It is implemented in Perl and supported on Linux, Mac OS X and MS Windows.

SUBMITTER: Deng W 

PROVIDER: S-EPMC3777116 | biostudies-literature | 2013 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Indel and Carryforward Correction (ICC): a new analysis approach for processing 454 pyrosequencing data.

Deng Wenjie W   Maust Brandon S BS   Westfall Dylan H DH   Chen Lennie L   Zhao Hong H   Larsen Brendan B BB   Iyer Shyamala S   Liu Yi Y   Mullins James I JI  

Bioinformatics (Oxford, England) 20130729 19


<h4>Motivation</h4>Pyrosequencing technology provides an important new approach to more extensively characterize diverse sequence populations and detect low frequency variants. However, the promise of this technology has been difficult to realize, as careful correction of sequencing errors is crucial to distinguish rare variants (∼1%) in an infected host with high sensitivity and specificity.<h4>Results</h4>We developed a new approach, referred to as Indel and Carryforward Correction (ICC), to c  ...[more]

Similar Datasets

| S-EPMC3605598 | biostudies-literature
| S-EPMC3985643 | biostudies-literature
| S-EPMC4234478 | biostudies-literature
| S-EPMC3431371 | biostudies-literature
| S-EPMC7114314 | biostudies-literature
| S-EPMC3194189 | biostudies-literature
| S-EPMC3840631 | biostudies-literature
| S-EPMC3287517 | biostudies-literature
| S-EPMC3954088 | biostudies-literature
| S-EPMC3017093 | biostudies-literature