Unknown

Dataset Information

0

ScanIndel: a hybrid framework for indel detection via gapped alignment, split reads and de novo assembly.


ABSTRACT: Comprehensive identification of insertions/deletions (indels) across the full size spectrum from second generation sequencing is challenging due to the relatively short read length inherent in the technology. Different indel calling methods exist but are limited in detection to specific sizes with varying accuracy and resolution. We present ScanIndel, an integrated framework for detecting indels with multiple heuristics including gapped alignment, split reads and de novo assembly. Using simulation data, we demonstrate ScanIndel's superior sensitivity and specificity relative to several state-of-the-art indel callers across various coverage levels and indel sizes. ScanIndel yields higher predictive accuracy with lower computational cost compared with existing tools for both targeted resequencing data from tumor specimens and high coverage whole-genome sequencing data from the human NIST standard NA12878. Thus, we anticipate ScanIndel will improve indel analysis in both clinical and research settings. ScanIndel is implemented in Python, and is freely available for academic use at https://github.com/cauyrd/ScanIndel.

SUBMITTER: Yang R 

PROVIDER: S-EPMC4671222 | biostudies-literature | 2015 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

ScanIndel: a hybrid framework for indel detection via gapped alignment, split reads and de novo assembly.

Yang Rendong R   Nelson Andrew C AC   Henzler Christine C   Thyagarajan Bharat B   Silverstein Kevin A T KA  

Genome medicine 20151207


Comprehensive identification of insertions/deletions (indels) across the full size spectrum from second generation sequencing is challenging due to the relatively short read length inherent in the technology. Different indel calling methods exist but are limited in detection to specific sizes with varying accuracy and resolution. We present ScanIndel, an integrated framework for detecting indels with multiple heuristics including gapped alignment, split reads and de novo assembly. Using simulati  ...[more]

Similar Datasets

| S-EPMC5884821 | biostudies-literature
| S-EPMC3707490 | biostudies-literature
| S-EPMC9839709 | biostudies-literature
| S-EPMC7879691 | biostudies-literature
| S-EPMC10997618 | biostudies-literature
| S-EPMC4682372 | biostudies-literature
| S-EPMC3158087 | biostudies-literature
| S-EPMC4582210 | biostudies-literature
| S-EPMC5411768 | biostudies-literature
| S-EPMC3389770 | biostudies-literature