Unknown

Dataset Information

0

Indel detection from DNA and RNA sequencing data with transIndel.


ABSTRACT: BACKGROUND:Insertions and deletions (indels) are a major class of genomic variation associated with human disease. Indels are primarily detected from DNA sequencing (DNA-seq) data but their transcriptional consequences remain unexplored due to challenges in discriminating medium-sized and large indels from splicing events in RNA-seq data. RESULTS:Here, we developed transIndel, a splice-aware algorithm that parses the chimeric alignments predicted by a short read aligner and reconstructs the mid-sized insertions and large deletions based on the linear alignments of split reads from DNA-seq or RNA-seq data. TransIndel exhibits competitive or superior performance over eight state-of-the-art indel detection tools on benchmarks using both synthetic and real DNA-seq data. Additionally, we applied transIndel to DNA-seq and RNA-seq datasets from 333 primary prostate cancer patients from The Cancer Genome Atlas (TCGA) and 59 metastatic prostate cancer patients from AACR-PCF Stand-Up- To-Cancer (SU2C) studies. TransIndel enhanced the taxonomy of DNA- and RNA-level alterations in prostate cancer by identifying recurrent FOXA1 indels as well as exitron splicing in genes implicated in disease progression. CONCLUSIONS:Our study demonstrates that transIndel is a robust tool for elucidation of medium- and large-sized indels from DNA-seq and RNA-seq data. Including RNA-seq in indel discovery efforts leads to significant improvements in sensitivity for identification of med-sized and large indels missed by DNA-seq, and reveals non-canonical RNA-splicing events in genes associated with disease pathology.

SUBMITTER: Yang R 

PROVIDER: S-EPMC5909256 | biostudies-literature | 2018 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Indel detection from DNA and RNA sequencing data with transIndel.

Yang Rendong R   Van Etten Jamie L JL   Dehm Scott M SM  

BMC genomics 20180419 1


<h4>Background</h4>Insertions and deletions (indels) are a major class of genomic variation associated with human disease. Indels are primarily detected from DNA sequencing (DNA-seq) data but their transcriptional consequences remain unexplored due to challenges in discriminating medium-sized and large indels from splicing events in RNA-seq data.<h4>Results</h4>Here, we developed transIndel, a splice-aware algorithm that parses the chimeric alignments predicted by a short read aligner and recons  ...[more]

Similar Datasets

| S-EPMC6157028 | biostudies-literature
| S-EPMC6735753 | biostudies-literature
| S-EPMC5862335 | biostudies-literature
| S-EPMC6597088 | biostudies-literature
| S-EPMC6142223 | biostudies-literature
| S-EPMC3149584 | biostudies-literature
| S-EPMC5507611 | biostudies-literature
| S-EPMC4232354 | biostudies-literature
| S-EPMC4240813 | biostudies-literature
| S-EPMC3852351 | biostudies-literature