Unknown

Dataset Information

0

NxRepair: error correction in de novo sequence assembly using Nextera mate pairs.


ABSTRACT: Scaffolding errors and incorrect repeat disambiguation during de novo assembly can result in large scale misassemblies in draft genomes. Nextera mate pair sequencing data provide additional information to resolve assembly ambiguities during scaffolding. Here, we introduce NxRepair, an open source toolkit for error correction in de novo assemblies that uses Nextera mate pair libraries to identify and correct large-scale errors. We show that NxRepair can identify and correct large scaffolding errors, without use of a reference sequence, resulting in quantitative improvements in the assembly quality. NxRepair can be downloaded from GitHub or PyPI, the Python Package Index; a tutorial and user documentation are also available.

SUBMITTER: Murphy RR 

PROVIDER: S-EPMC4458127 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

NxRepair: error correction in de novo sequence assembly using Nextera mate pairs.

Murphy Rebecca R RR   O'Connell Jared J   Cox Anthony J AJ   Schulz-Trieglaff Ole O  

PeerJ 20150602


Scaffolding errors and incorrect repeat disambiguation during de novo assembly can result in large scale misassemblies in draft genomes. Nextera mate pair sequencing data provide additional information to resolve assembly ambiguities during scaffolding. Here, we introduce NxRepair, an open source toolkit for error correction in de novo assemblies that uses Nextera mate pair libraries to identify and correct large-scale errors. We show that NxRepair can identify and correct large scaffolding erro  ...[more]

Similar Datasets

| S-EPMC3707490 | biostudies-literature
| S-EPMC6545690 | biostudies-literature
| S-EPMC2824677 | biostudies-literature
| S-EPMC5563063 | biostudies-other
| S-EPMC4617970 | biostudies-literature
| S-EPMC3103447 | biostudies-other
| S-EPMC534640 | biostudies-literature
| S-EPMC6123912 | biostudies-literature
| S-EPMC3272011 | biostudies-literature
| S-EPMC3728768 | biostudies-literature