Dataset Information

RNA-eXpress annotates novel transcript features in RNA-seq data.

ABSTRACT: Next-generation sequencing is rapidly becoming the approach of choice for transcriptional analysis experiments. Substantial advances have been achieved in computational approaches to support these technologies. These approaches typically rely on existing transcript annotations, introducing a bias towards known genes, require specific experimental design and computational resources, or focus only on identification of splice variants (ignoring other biologically relevant transcribed features contained within the data that may be important for downstream analysis). Biologically relevant transcribed features also include large and small non-coding RNA, new transcription start sites, alternative promoters, RNA editing and processing of coding transcripts. Also, many existing solutions lack accessible interfaces required for wide scale adoption. We present a user-friendly, rapid and computation-efficient feature annotation framework (RNA-eXpress) that enables identification of transcripts and other genomic and transcriptional features independently of current annotations. RNA-eXpress accepts mapped reads in the standard binary alignment (BAM) format and produces a study-specific feature annotation in GTF format, comparison statistics, sequence extraction and feature counts. The framework is designed to be easily accessible while allowing advanced users to integrate new feature-identification algorithms through simple class extension, thus facilitating expansion to novel feature types or identification of study-specific feature types.

SUBMITTER: Forster SC

PROVIDER: S-EPMC3597146 | biostudies-literature | 2013 Mar

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

RNA-eXpress annotates novel transcript features in RNA-seq data.

Forster Samuel C SC Finkel Alexander M AM Gould Jodee A JA Hertzog Paul J PJ

Bioinformatics (Oxford, England) 20130208 6

Next-generation sequencing is rapidly becoming the approach of choice for transcriptional analysis experiments. Substantial advances have been achieved in computational approaches to support these technologies. These approaches typically rely on existing transcript annotations, introducing a bias towards known genes, require specific experimental design and computational resources, or focus only on identification of splice variants (ignoring other biologically relevant transcribed features conta ...[more]

PMID: 23396121

Dataset Information

RNA-eXpress annotates novel transcript features in RNA-seq data.

Publications

RNA-eXpress annotates novel transcript features in RNA-seq data.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Measure transcript integrity using RNA-seq data.
| S-EPMC4739097 | biostudies-literature

ChimeRScope: a novel alignment-free algorithm for fusion transcript prediction using paired-end RNA-Seq data.
| S-EPMC5737728 | biostudies-literature

Transcript annotation of Chinese sturgeon (Acipenser sinensis) using Iso-seq and RNA-seq data.
| S-EPMC9950146 | biostudies-literature

A dual transcript-discovery approach to improve the delimitation of gene features from RNA-seq data in the chicken model.
| S-EPMC5827264 | biostudies-literature

Vaeda computationally annotates doublets in single-cell RNA sequencing data.
| S-EPMC9805559 | biostudies-literature

Terminus enables the discovery of data-driven, robust transcript groups from RNA-seq data.
| S-EPMC7355257 | biostudies-literature

Context-aware transcript quantification from long-read RNA-seq data with Bambu.
| S-EPMC10448944 | biostudies-literature

Fast and accurate approximate inference of transcript expression from RNA-seq data.
| S-EPMC4673974 | biostudies-literature

RNA-seq: impact of RNA degradation on transcript quantification.
| S-EPMC4071332 | biostudies-literature

A novel min-cost flow method for estimating transcript expression with RNA-Seq.
| S-EPMC3622638 | biostudies-literature