Dataset Information

RNA variant identification discrepancy among splice-aware alignment algorithms.

ABSTRACT: Next-generation sequencing (NGS) techniques have been generating various molecular maps, including transcriptomes via RNA-seq. Although the primary purpose of RNA-seq is to quantify the expression level of known genes, RNA variants are also identifiable. However, care must be taken to account for RNA's dynamic nature. In this study, we evaluated the following popular splice-aware alignment algorithms in the context of RNA variant-calling analysis: HISAT2, STAR, STAR (two-pass mode), Subread, and Subjunc. For this, we performed RNA-seq with ten pieces of invasive ductal carcinoma from breast tissue and three pieces of adjacent normal tissue from a single patient. These RNA-seq data were used to evaluate the performance of splice-aware aligners. Surprisingly, the number of common potential RNA editing sites (pRESs) identified by all alignment algorithms was less than 2% of the total. The main cause of this difference was the mapped reads on the splice junctions. In addition, the RNA quality significantly affected the outcome. Therefore, researchers must consider these experimental and bioinformatic features during RNA variant analysis. Further investigations of common pRESs discovered that BDH1, CCDC137, and TBC1D10A transcripts contained a single non-synonymous RNA variant that was unique to breast cancer tissue compared to adjacent normal tissue; thus, further clinical validation is required.

SUBMITTER: Hong JH

PROVIDER: S-EPMC6072070 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

RNA variant identification discrepancy among splice-aware alignment algorithms.

Hong Ji Hyung JH Ko Yoon Ho YH Kang Keunsoo K

PloS one 20180802 8

Next-generation sequencing (NGS) techniques have been generating various molecular maps, including transcriptomes via RNA-seq. Although the primary purpose of RNA-seq is to quantify the expression level of known genes, RNA variants are also identifiable. However, care must be taken to account for RNA's dynamic nature. In this study, we evaluated the following popular splice-aware alignment algorithms in the context of RNA variant-calling analysis: HISAT2, STAR, STAR (two-pass mode), Subread, and ...[more]

PMID: 30071094

Dataset Information

RNA variant identification discrepancy among splice-aware alignment algorithms.

Publications

RNA variant identification discrepancy among splice-aware alignment algorithms.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Evaluation of tools for long read RNA-seq splice-aware alignment.
| S-EPMC6192213 | biostudies-literature

A practical evaluation of alignment algorithms for RNA variant calling analysis
2018-02-06 | GSE110114 | GEO

Evaluation of alignment algorithms for discovery and identification of pathogens using RNA-Seq.
| S-EPMC3813700 | biostudies-literature

Benchmarking splice variant prediction algorithms using massively parallel splicing assays.
| S-EPMC10187268 | biostudies-literature

Benchmarking splice variant prediction algorithms using massively parallel splicing assays.
| S-EPMC10734170 | biostudies-literature

A practical evaluation of alignment algorithms for RNA variant calling analysis
| PRJNA432903 | ENA

Comparative analysis of RNA-Seq alignment algorithms and the RNA-Seq unified mapper (RUM).
| S-EPMC3167048 | biostudies-literature

A comprehensive evaluation of alignment algorithms in the context of RNA-seq.
| S-EPMC3530550 | biostudies-literature

Comparative Analysis of RNA-Seq Alignment Algorithms and the RNA-Seq Unified Mapper (RUM).
2011-08-03 | E-GEOD-26248 | biostudies-arrayexpress

Comparative Analysis of RNA-Seq Alignment Algorithms and the RNA-Seq Unified Mapper (RUM).
2011-08-03 | GSE26248 | GEO