Unknown

Dataset Information

0

RCDA: a highly sensitive and specific alternatively spliced transcript assembly tool featuring upstream consecutive exon structures.


ABSTRACT: When applied to complex transcript datasets, current tools for automated assembly of mRNA sequences require long run times and produce exponentially increasing numbers of splice variants. Here, we describe RCDA, a genome-based transcript assembly tool comprising RCluster, that recursively clusters transcripts, and DAssemble, that generates composite transcript sequences through path-finding using a directed acyclic graph. Each exon included in a final transcript is associated with an array of all upstream consecutive exon structures obtained from original transcripts. When a depth-first-search path reaches an exon, the path is retained only if it contains a structure from that exon's array. RCDA assemblies, therefore, include only those transcripts with experimentally supported exon patterns. When applied to >23,000 transcripts from human chromosome 21, using biologically reasonable filters, RCDA execution time was approximately 4h. RCDA outperformed ECgene in reconstructing RefSeq transcripts and in limiting the total number of transcripts and transcripts per gene.

SUBMITTER: Sturgeon XH 

PROVIDER: S-EPMC5470730 | biostudies-literature | 2012 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

RCDA: a highly sensitive and specific alternatively spliced transcript assembly tool featuring upstream consecutive exon structures.

Sturgeon Xiaolu H XH   Gardiner Katheleen J KJ  

Genomics 20120820 6


When applied to complex transcript datasets, current tools for automated assembly of mRNA sequences require long run times and produce exponentially increasing numbers of splice variants. Here, we describe RCDA, a genome-based transcript assembly tool comprising RCluster, that recursively clusters transcripts, and DAssemble, that generates composite transcript sequences through path-finding using a directed acyclic graph. Each exon included in a final transcript is associated with an array of al  ...[more]

Similar Datasets

| S-EPMC3070201 | biostudies-literature
| S-EPMC2975243 | biostudies-literature
| S-EPMC2602786 | biostudies-literature
| S-EPMC2651755 | biostudies-literature
| S-EPMC5698419 | biostudies-literature
2020-07-13 | GSE153660 | GEO
| S-EPMC10006080 | biostudies-literature
| S-EPMC6880636 | biostudies-literature
| S-EPMC4984455 | biostudies-literature
| S-EPMC360225 | biostudies-other