Unknown

Dataset Information

0

High-throughput DNA sequencing errors are reduced by orders of magnitude using circle sequencing.


ABSTRACT: A major limitation of high-throughput DNA sequencing is the high rate of erroneous base calls produced. For instance, Illumina sequencing machines produce errors at a rate of ~0.1-1 × 10(-2) per base sequenced. These technologies typically produce billions of base calls per experiment, translating to millions of errors. We have developed a unique library preparation strategy, "circle sequencing," which allows for robust downstream computational correction of these errors. In this strategy, DNA templates are circularized, copied multiple times in tandem with a rolling circle polymerase, and then sequenced on any high-throughput sequencing machine. Each read produced is computationally processed to obtain a consensus sequence of all linked copies of the original molecule. Physically linking the copies ensures that each copy is independently derived from the original molecule and allows for efficient formation of consensus sequences. The circle-sequencing protocol precedes standard library preparations and is therefore suitable for a broad range of sequencing applications. We tested our method using the Illumina MiSeq platform and obtained errors in our processed sequencing reads at a rate as low as 7.6 × 10(-6) per base sequenced, dramatically improving the error rate of Illumina sequencing and putting error on par with low-throughput, but highly accurate, Sanger sequencing. Circle sequencing also had substantially higher efficiency and lower cost than existing barcode-based schemes for correcting sequencing errors.

SUBMITTER: Lou DI 

PROVIDER: S-EPMC3856802 | biostudies-other | 2013 Dec

REPOSITORIES: biostudies-other

altmetric image

Publications

High-throughput DNA sequencing errors are reduced by orders of magnitude using circle sequencing.

Lou Dianne I DI   Hussmann Jeffrey A JA   McBee Ross M RM   Acevedo Ashley A   Andino Raul R   Press William H WH   Sawyer Sara L SL  

Proceedings of the National Academy of Sciences of the United States of America 20131115 49


A major limitation of high-throughput DNA sequencing is the high rate of erroneous base calls produced. For instance, Illumina sequencing machines produce errors at a rate of ~0.1-1 × 10(-2) per base sequenced. These technologies typically produce billions of base calls per experiment, translating to millions of errors. We have developed a unique library preparation strategy, "circle sequencing," which allows for robust downstream computational correction of these errors. In this strategy, DNA t  ...[more]

Similar Datasets

| S-EPMC4719071 | biostudies-literature
| S-EPMC3852801 | biostudies-literature
| S-EPMC3083090 | biostudies-literature
2024-01-14 | MSV000093867 | MassIVE
| S-EPMC5937187 | biostudies-literature
| S-EPMC5896730 | biostudies-literature
| S-EPMC3045962 | biostudies-literature
| S-EPMC5704956 | biostudies-literature
| S-EPMC6142223 | biostudies-literature
| S-EPMC2941283 | biostudies-literature