Unknown

Dataset Information

0

CGAL: computing genome assembly likelihoods.


ABSTRACT: Assembly algorithms have been extensively benchmarked using simulated data so that results can be compared to ground truth. However, in de novo assembly, only crude metrics such as contig number and size are typically used to evaluate assembly quality. We present CGAL, a novel likelihood-based approach to assembly assessment in the absence of a ground truth. We show that likelihood is more accurate than other metrics currently used for evaluating assemblies, and describe its application to the optimization and comparison of assembly algorithms. Our methods are implemented in software that is freely available at http://bio.math.berkeley.edu/cgal/.

SUBMITTER: Rahman A 

PROVIDER: S-EPMC3663106 | biostudies-literature | 2013 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

CGAL: computing genome assembly likelihoods.

Rahman Atif A   Pachter Lior L  

Genome biology 20130129 1


Assembly algorithms have been extensively benchmarked using simulated data so that results can be compared to ground truth. However, in de novo assembly, only crude metrics such as contig number and size are typically used to evaluate assembly quality. We present CGAL, a novel likelihood-based approach to assembly assessment in the absence of a ground truth. We show that likelihood is more accurate than other metrics currently used for evaluating assemblies, and describe its application to the o  ...[more]

Similar Datasets

| S-EPMC6602551 | biostudies-literature
| S-EPMC5693259 | biostudies-literature
| S-EPMC11208726 | biostudies-literature
| PRJEB50026 | ENA
| S-EPMC3932042 | biostudies-literature
| S-EPMC1456855 | biostudies-literature
| S-EPMC5850616 | biostudies-literature
| S-EPMC5499160 | biostudies-literature
| S-EPMC9931113 | biostudies-literature
| S-EPMC5967857 | biostudies-literature