Unknown

Dataset Information

0

SOAPindel: efficient identification of indels from short paired reads.


ABSTRACT: We present a new approach to indel calling that explicitly exploits that indel differences between a reference and a sequenced sample make the mapping of reads less efficient. We assign all unmapped reads with a mapped partner to their expected genomic positions and then perform extensive de novo assembly on the regions with many unmapped reads to resolve homozygous, heterozygous, and complex indels by exhaustive traversal of the de Bruijn graph. The method is implemented in the software SOAPindel and provides a list of candidate indels with quality scores. We compare SOAPindel to Dindel, Pindel, and GATK on simulated data and find similar or better performance for short indels (<10 bp) and higher sensitivity and specificity for long indels. A validation experiment suggests that SOAPindel has a false-positive rate of ?10% for long indels (>5 bp), while still providing many more candidate indels than other approaches.

SUBMITTER: Li S 

PROVIDER: S-EPMC3530679 | biostudies-literature | 2013 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

SOAPindel: efficient identification of indels from short paired reads.

Li Shengting S   Li Ruiqiang R   Li Heng H   Lu Jianliang J   Li Yingrui Y   Bolund Lars L   Schierup Mikkel H MH   Wang Jun J   Wang Jun J  

Genome research 20120912 1


We present a new approach to indel calling that explicitly exploits that indel differences between a reference and a sequenced sample make the mapping of reads less efficient. We assign all unmapped reads with a mapped partner to their expected genomic positions and then perform extensive de novo assembly on the regions with many unmapped reads to resolve homozygous, heterozygous, and complex indels by exhaustive traversal of the de Bruijn graph. The method is implemented in the software SOAPind  ...[more]

Similar Datasets

| S-EPMC3614465 | biostudies-other
| S-EPMC3919575 | biostudies-literature
| S-EPMC7168855 | biostudies-literature
| S-EPMC3161018 | biostudies-literature
| S-EPMC3158087 | biostudies-literature
| S-EPMC3527383 | biostudies-literature
| S-EPMC3143109 | biostudies-literature
| S-EPMC6668410 | biostudies-literature
| S-EPMC3076424 | biostudies-literature
| S-EPMC4674864 | biostudies-literature