Unknown

Dataset Information

0

RNAIndel: discovering somatic coding indels from tumor RNA-Seq data.


ABSTRACT: MOTIVATION:Reliable identification of expressed somatic insertions/deletions (indels) is an unmet need due to artifacts generated in PCR-based RNA-Seq library preparation and the lack of normal RNA-Seq data, presenting analytical challenges for discovery of somatic indels in tumor transcriptome. RESULTS:We present RNAIndel, a tool for predicting somatic, germline and artifact indels from tumor RNA-Seq data. RNAIndel leverages features derived from indel sequence context and biological effect in a machine-learning framework. Except for tumor samples with microsatellite instability, RNAIndel robustly predicts 88-100% of somatic indels in five diverse test datasets of pediatric and adult cancers, even recovering subclonal (VAF range 0.01-0.15) driver indels missed by targeted deep-sequencing, outperforming the current best-practice for RNA-Seq variant calling which had 57% sensitivity but with 14 times more false positives. AVAILABILITY AND IMPLEMENTATION:RNAIndel is freely available at https://github.com/stjude/RNAIndel. SUPPLEMENTARY INFORMATION:Supplementary data are available at Bioinformatics online.

SUBMITTER: Hagiwara K 

PROVIDER: S-EPMC7523641 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

RNAIndel: discovering somatic coding indels from tumor RNA-Seq data.

Hagiwara Kohei K   Ding Liang L   Edmonson Michael N MN   Rice Stephen V SV   Newman Scott S   Easton John J   Dai Juncheng J   Meshinchi Soheil S   Ries Rhonda E RE   Rusch Michael M   Zhang Jinghui J  

Bioinformatics (Oxford, England) 20200301 5


<h4>Motivation</h4>Reliable identification of expressed somatic insertions/deletions (indels) is an unmet need due to artifacts generated in PCR-based RNA-Seq library preparation and the lack of normal RNA-Seq data, presenting analytical challenges for discovery of somatic indels in tumor transcriptome.<h4>Results</h4>We present RNAIndel, a tool for predicting somatic, germline and artifact indels from tumor RNA-Seq data. RNAIndel leverages features derived from indel sequence context and biolog  ...[more]

Similar Datasets

| S-EPMC8373110 | biostudies-literature
| S-EPMC4523039 | biostudies-literature
| S-EPMC2672628 | biostudies-literature
| S-EPMC8742092 | biostudies-literature
| S-EPMC9279659 | biostudies-literature
| S-EPMC6074801 | biostudies-literature
| S-EPMC8208744 | biostudies-literature
| S-EPMC4787191 | biostudies-literature
| S-EPMC8551294 | biostudies-literature
| S-EPMC9646749 | biostudies-literature