Unknown

Dataset Information

0

Accurate detection of complex structural variations using single-molecule sequencing.


ABSTRACT: Structural variations are the greatest source of genetic variation, but they remain poorly understood because of technological limitations. Single-molecule long-read sequencing has the potential to dramatically advance the field, although high error rates are a challenge with existing methods. Addressing this need, we introduce open-source methods for long-read alignment (NGMLR; https://github.com/philres/ngmlr ) and structural variant identification (Sniffles; https://github.com/fritzsedlazeck/Sniffles ) that provide unprecedented sensitivity and precision for variant detection, even in repeat-rich regions and for complex nested events that can have substantial effects on human health. In several long-read datasets, including healthy and cancerous human genomes, we discovered thousands of novel variants and categorized systematic errors in short-read approaches. NGMLR and Sniffles can automatically filter false events and operate on low-coverage data, thereby reducing the high costs that have hindered the application of long reads in clinical and research settings.

SUBMITTER: Sedlazeck FJ 

PROVIDER: S-EPMC5990442 | biostudies-literature | 2018 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Accurate detection of complex structural variations using single-molecule sequencing.

Sedlazeck Fritz J FJ   Rescheneder Philipp P   Smolka Moritz M   Fang Han H   Nattestad Maria M   von Haeseler Arndt A   Schatz Michael C MC  

Nature methods 20180430 6


Structural variations are the greatest source of genetic variation, but they remain poorly understood because of technological limitations. Single-molecule long-read sequencing has the potential to dramatically advance the field, although high error rates are a challenge with existing methods. Addressing this need, we introduce open-source methods for long-read alignment (NGMLR; https://github.com/philres/ngmlr ) and structural variant identification (Sniffles; https://github.com/fritzsedlazeck/  ...[more]

Similar Datasets

| S-EPMC8193640 | biostudies-literature
| S-EPMC4970289 | biostudies-literature
| S-EPMC3912422 | biostudies-literature
| S-EPMC10300613 | biostudies-literature
| S-EPMC5709945 | biostudies-literature
| S-EPMC9493964 | biostudies-literature
| S-EPMC8896822 | biostudies-literature
| S-EPMC10497132 | biostudies-literature
| S-EPMC10324673 | biostudies-literature
| S-EPMC7545150 | biostudies-literature