Dataset Information

The fractured landscape of RNA-seq alignment: the default in our STARs.

ABSTRACT: Many tools are available for RNA-seq alignment and expression quantification, with comparative value being hard to establish. Benchmarking assessments often highlight methods' good performance, but are focused on either model data or fail to explain variation in performance. This leaves us to ask, what is the most meaningful way to assess different alignment choices? And importantly, where is there room for progress? In this work, we explore the answers to these two questions by performing an exhaustive assessment of the STAR aligner. We assess STAR's performance across a range of alignment parameters using common metrics, and then on biologically focused tasks. We find technical metrics such as fraction mapping or expression profile correlation to be uninformative, capturing properties unlikely to have any role in biological discovery. Surprisingly, we find that changes in alignment parameters within a wide range have little impact on both technical and biological performance. Yet, when performance finally does break, it happens in difficult regions, such as X-Y paralogs and MHC genes. We believe improved reporting by developers will help establish where results are likely to be robust or fragile, providing a better baseline to establish where methodological progress can still occur.

SUBMITTER: Ballouz S

PROVIDER: S-EPMC6007662 | biostudies-literature | 2018 Jun

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

The fractured landscape of RNA-seq alignment: the default in our STARs.

Ballouz Sara S Dobin Alexander A Gingeras Thomas R TR Gillis Jesse J

Nucleic acids research 20180601 10

Many tools are available for RNA-seq alignment and expression quantification, with comparative value being hard to establish. Benchmarking assessments often highlight methods' good performance, but are focused on either model data or fail to explain variation in performance. This leaves us to ask, what is the most meaningful way to assess different alignment choices? And importantly, where is there room for progress? In this work, we explore the answers to these two questions by performing an ex ...[more]

PMID: 29718481

Dataset Information

The fractured landscape of RNA-seq alignment: the default in our STARs.

Publications

The fractured landscape of RNA-seq alignment: the default in our STARs.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Supersplat--spliced RNA-seq alignment.
| S-EPMC2881391 | biostudies-literature

Comparative analysis of RNA-Seq alignment algorithms and the RNA-Seq unified mapper (RUM).
| S-EPMC3167048 | biostudies-literature

Characteristics of Cross-Hybridization and Cross-Alignment in Pseudo-Xenograft samples by RNA-Seq and Microarrays [RNA-Seq]
| S-ECPF-GEOD-40890 | biostudies-other

iMapSplice: Alleviating reference bias through personalized RNA-seq alignment.
| S-EPMC6086400 | biostudies-literature

Supervised Adversarial Alignment of Single-Cell RNA-seq Data.
| S-EPMC8418522 | biostudies-literature

Limitations of alignment-free tools in total RNA-seq quantification.
| S-EPMC6042521 | biostudies-literature

Systematic evaluation of spliced alignment programs for RNA-seq data.
| S-EPMC4018468 | biostudies-literature

CBA: Cluster-Guided Batch Alignment for Single Cell RNA-seq.
| S-EPMC8076908 | biostudies-literature

Evaluation of tools for long read RNA-seq splice-aware alignment.
| S-EPMC6192213 | biostudies-literature

Evaluation of STAR and Kallisto on Single Cell RNA-Seq Data Alignment.
| S-EPMC7202009 | biostudies-literature