Dataset Information

Efficient RNA isoform identification and quantification from RNA-Seq data with network flows.

ABSTRACT:

Motivation

Several state-of-the-art methods for isoform identification and quantification are based on [Formula: see text]-regularized regression, such as the Lasso. However, explicitly listing the-possibly exponentially-large set of candidate transcripts is intractable for genes with many exons. For this reason, existing approaches using the [Formula: see text]-penalty are either restricted to genes with few exons or only run the regression algorithm on a small set of preselected isoforms.

Results

We introduce a new technique called FlipFlop, which can efficiently tackle the sparse estimation problem on the full set of candidate isoforms by using network flow optimization. Our technique removes the need of a preselection step, leading to better isoform identification while keeping a low computational cost. Experiments with synthetic and real RNA-Seq data confirm that our approach is more accurate than alternative methods and one of the fastest available.

Availability and implementation

Source code is freely available as an R package from the Bioconductor Web site (http://www.bioconductor.org/), and more information is available at http://cbio.ensmp.fr/flipflop.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Bernard E

PROVIDER: S-EPMC4147886 | biostudies-literature | 2014 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Efficient RNA isoform identification and quantification from RNA-Seq data with network flows.

Bernard Elsa E Jacob Laurent L Mairal Julien J Vert Jean-Philippe JP

Bioinformatics (Oxford, England) 20140509 17

<h4>Motivation</h4>Several state-of-the-art methods for isoform identification and quantification are based on [Formula: see text]-regularized regression, such as the Lasso. However, explicitly listing the-possibly exponentially-large set of candidate transcripts is intractable for genes with many exons. For this reason, existing approaches using the [Formula: see text]-penalty are either restricted to genes with few exons or only run the regression algorithm on a small set of preselected isofor ...[more]

PMID: 24813214

Dataset Information

Efficient RNA isoform identification and quantification from RNA-Seq data with network flows.

Motivation

Results

Availability and implementation

Supplementary information

Publications

Efficient RNA isoform identification and quantification from RNA-Seq data with network flows.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Network-Based Isoform Quantification with RNA-Seq Data for Cancer Transcriptome Analysis.
| S-EPMC4689380 | biostudies-literature

Towards reliable isoform quantification using RNA-SEQ data.
| S-EPMC2863065 | biostudies-literature

WemIQ: an accurate and robust isoform quantification method for RNA-seq data.
| S-EPMC4380033 | biostudies-literature

Simultaneous isoform discovery and quantification from RNA-seq.
| S-EPMC3718502 | biostudies-literature

Alternating EM algorithm for a bilinear model in isoform quantification from RNA-seq data.
| S-EPMC9883676 | biostudies-literature

Quantification of mutant-allele expression at isoform level in cancer from RNA-seq data.
| S-EPMC9278039 | biostudies-literature

Acfs: accurate circRNA identification and quantification from RNA-Seq data.
| S-EPMC5144000 | biostudies-literature

DELongSeq for efficient detection of differential isoform expression from long-read RNA-seq data.
| S-EPMC9985341 | biostudies-literature

Comparative evaluation of full-length isoform quantification from RNA-Seq.
| S-EPMC8145802 | biostudies-literature

Simulation-based benchmarking of isoform quantification in single-cell RNA-seq.
| S-EPMC6223048 | biostudies-literature