Unknown

Dataset Information

0

Performance difference of graph-based and alignment-based hybrid error correction methods for error-prone long reads.


ABSTRACT: The error-prone third-generation sequencing (TGS) long reads can be corrected by the high-quality second-generation sequencing (SGS) short reads, which is referred to as hybrid error correction. We here investigate the influences of the principal algorithmic factors of two major types of hybrid error correction methods by mathematical modeling and analysis on both simulated and real data. Our study reveals the distribution of accuracy gain with respect to the original long read error rate. We also demonstrate that the original error rate of 19% is the limit for perfect correction, beyond which long reads are too error-prone to be corrected by these methods.

SUBMITTER: Wang A 

PROVIDER: S-EPMC6966875 | biostudies-literature | 2020 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Performance difference of graph-based and alignment-based hybrid error correction methods for error-prone long reads.

Wang Anqi A   Au Kin Fai KF  

Genome biology 20200117 1


The error-prone third-generation sequencing (TGS) long reads can be corrected by the high-quality second-generation sequencing (SGS) short reads, which is referred to as hybrid error correction. We here investigate the influences of the principal algorithmic factors of two major types of hybrid error correction methods by mathematical modeling and analysis on both simulated and real data. Our study reveals the distribution of accuracy gain with respect to the original long read error rate. We al  ...[more]

Similar Datasets

| S-EPMC6362602 | biostudies-literature
| S-EPMC6265270 | biostudies-literature
| S-EPMC3707490 | biostudies-literature
| S-EPMC6028576 | biostudies-literature
| S-EPMC7792008 | biostudies-literature
| S-EPMC5661950 | biostudies-literature
| S-EPMC9639292 | biostudies-literature
| S-EPMC5066169 | biostudies-literature
| S-EPMC8170817 | biostudies-literature
| S-EPMC2978382 | biostudies-literature