Unknown

Dataset Information

0

INDELseek: detection of complex insertions and deletions from next-generation sequencing data.


ABSTRACT: Complex insertions and deletions (indels) from next-generation sequencing (NGS) data were prone to escape detection by currently available variant callers as shown by large-scale human genomics studies. Somatic and germline complex indels in key disease driver genes could be missed in NGS-based genomics studies.INDELseek is an open-source complex indel caller designed for NGS data of random fragments and PCR amplicons. The key differentiating factor of INDELseek is that each NGS read alignment was examined as a whole instead of "pileup" of each reference position across multiple alignments. In benchmarking against the reference material NA12878 genome (n?=?160 derived from high-confidence variant calls), GATK, SAMtools and INDELseek showed complex indel detection sensitivities of 0%, 0% and 100%, respectively. INDELseek also detected all known germline (BRCA1 and BRCA2) and somatic (CALR and JAK2) complex indels in human clinical samples (n?=?8). Further experiments validated all 10 detected KIT complex indels in a discovery cohort of clinical samples. In silico semi-simulation showed sensitivities of 93.7-96.2% based on 8671 unique complex indels in >5000 genes from dbSNP and COSMIC. We also demonstrated the importance of complex indel detection in accurately annotating BRCA1, BRCA2 and TP53 mutations with gained or rescued protein-truncating effects.INDELseek is an accurate and versatile tool for complex indel detection in NGS data. It complements other variant callers in NGS-based genomics studies targeting a wide spectrum of genetic variations.

SUBMITTER: Au CH 

PROVIDER: S-EPMC5217656 | biostudies-literature | 2017 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

INDELseek: detection of complex insertions and deletions from next-generation sequencing data.

Au Chun Hang CH   Leung Anskar Y H AY   Kwong Ava A   Chan Tsun Leung TL   Ma Edmond S K ES  

BMC genomics 20170105 1


<h4>Background</h4>Complex insertions and deletions (indels) from next-generation sequencing (NGS) data were prone to escape detection by currently available variant callers as shown by large-scale human genomics studies. Somatic and germline complex indels in key disease driver genes could be missed in NGS-based genomics studies.<h4>Results</h4>INDELseek is an open-source complex indel caller designed for NGS data of random fragments and PCR amplicons. The key differentiating factor of INDELsee  ...[more]

Similar Datasets

| S-EPMC5549930 | biostudies-other
| S-EPMC2865866 | biostudies-literature
| S-EPMC5657046 | biostudies-literature
| S-EPMC3371040 | biostudies-literature
| S-EPMC7734255 | biostudies-literature
| S-EPMC3765144 | biostudies-literature
2017-04-03 | PXD003804 | Pride
| S-EPMC8138798 | biostudies-literature
| S-EPMC3437896 | biostudies-other