Unknown

Dataset Information

0

Systematic evaluation of error rates and causes in short samples in next-generation sequencing.


ABSTRACT: Next-generation sequencing (NGS) is the method of choice when large numbers of sequences have to be obtained. While the technique is widely applied, varying error rates have been observed. We analysed millions of reads obtained after sequencing of one single sequence on an Illumina sequencer. According to our analysis, the index-PCR for sample preparation has no effect on the observed error rate, even though PCR is traditionally seen as one of the major contributors to enhanced error rates in NGS. In addition, we observed very persistent pre-phasing effects although the base calling software corrects for these. Removal of shortened sequences abolished these effects and allowed analysis of the actual mutations. The average error rate determined was 0.24?±?0.06% per base and the percentage of mutated sequences was found to be 6.4?±?1.24%. Constant regions at the 5'- and 3'-end, e.g., primer binding sites used in in vitro selection procedures seem to have no effect on mutation rates and re-sequencing of samples obtains very reproducible results. As phasing effects and other sequencing problems vary between equipment and individual setups, we recommend evaluation of error rates and types to all NGS-users to improve the quality and analysis of NGS data.

SUBMITTER: Pfeiffer F 

PROVIDER: S-EPMC6053417 | biostudies-literature | 2018 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Systematic evaluation of error rates and causes in short samples in next-generation sequencing.

Pfeiffer Franziska F   Gröber Carsten C   Blank Michael M   Händler Kristian K   Beyer Marc M   Schultze Joachim L JL   Mayer Günter G  

Scientific reports 20180719 1


Next-generation sequencing (NGS) is the method of choice when large numbers of sequences have to be obtained. While the technique is widely applied, varying error rates have been observed. We analysed millions of reads obtained after sequencing of one single sequence on an Illumina sequencer. According to our analysis, the index-PCR for sample preparation has no effect on the observed error rate, even though PCR is traditionally seen as one of the major contributors to enhanced error rates in NG  ...[more]

Similar Datasets

| S-EPMC5587813 | biostudies-literature
| S-EPMC2704438 | biostudies-literature
| S-EPMC3495688 | biostudies-literature
| S-EPMC3776835 | biostudies-literature
| S-EPMC3875435 | biostudies-literature
| S-EPMC4878677 | biostudies-literature
| S-EPMC6401092 | biostudies-literature
| S-EPMC4216915 | biostudies-literature
| S-EPMC6018347 | biostudies-literature
| S-EPMC7953111 | biostudies-literature