Unknown

Dataset Information

0

ARBitR: an overlap-aware genome assembly scaffolder for linked reads.


ABSTRACT:

Summary

Linked genomic sequencing reads contain information that can be used to join sequences together into scaffolds in draft genome assemblies. Existing software for this purpose performs the scaffolding by joining sequences with a gap between them, not considering potential overlaps of contigs. We developed ARBitR to create scaffolds where overlaps are taken into account and show that it can accurately recreate regions where draft assemblies are broken.

Availability and implementation

ARBitR is written and implemented in Python3 for Unix-based operative systems. All source code is available at https://github.com/markhilt/ARBitR under the GNU General Public License v3.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Hiltunen M 

PROVIDER: S-EPMC8352505 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC9508831 | biostudies-literature
| S-EPMC5834899 | biostudies-literature
| S-EPMC6879002 | biostudies-literature
| S-EPMC9710574 | biostudies-literature
| S-EPMC8549298 | biostudies-literature
| S-EPMC6881392 | biostudies-literature
| S-EPMC6821208 | biostudies-literature
| S-EPMC6350039 | biostudies-literature
| S-EPMC8092372 | biostudies-literature
| S-EPMC6030987 | biostudies-literature