Transcriptomics

Dataset Information

0

Comparative Validation of the D. melanogaster Encyclopedia of DNA Elements Transcript Models


ABSTRACT: The model organism Encyclopedia of DNA Elements project (modENCODE) has produced a comprehensive annotation of D. melanogaster transcript models based on an enormous amount of high-throughput experimental data. However, some transcribed elements may not be functional, and technical artifacts may lead to erroneous inference of transcription. Inter-species comparison provides confidence to predicted annotation, since transcriptional activity that has been evolutionarily conserved is likely to have an advantageous function. We have performed RNA-Seq and CAGE-Seq experiments on more than 80 samples from multiple tissues and stages of 15 Drosophila species, including 8 previously unsequenced genomes. We have found strikingly conserved sequence, expression, and splicing for the vast majority of transcript models in modENCODE annotation (e.g. 99% exons of coding sequences (CDS), 88% exons of untranslated regions (UTR), and 87% splicing events), indicating that the transcriptome annotation is of very high quality. We also describe dynamic transcriptome evolution within the Drosophila genus, including conserved promoter structure, labile positions of transcription start sites, and rapidly evolving RNA-editing events. We demonstrate how this phylogenetic approach to DNA element validation will prove useful in the annotation of other high priority genomes, especially for genomes that are less compact than Drosophila (e.g. the vast majority of vertebrate genomes).

ORGANISM(S): Drosophila elegans Drosophila ananassae Drosophila kikkawai Drosophila yakuba Drosophila pseudoobscura Drosophila takahashii Drosophila ficusphila Drosophila mojavensis Drosophila simulans Drosophila bipectinata Drosophila rhopaloa Drosophila eugracilis Drosophila melanogaster Drosophila virilis Drosophila biarmipes

PROVIDER: GSE44612 | GEO | 2013/02/25

REPOSITORIES: GEO

Similar Datasets

2013-02-25 | E-GEOD-44612 | biostudies-arrayexpress
| PRJNA63449 | ENA
| PRJNA63477 | ENA
2013-12-31 | GSE49879 | GEO
2010-04-09 | E-GEOD-21152 | biostudies-arrayexpress
| PRJNA119899 | ENA
2016-03-24 | GSE79530 | GEO
2021-09-01 | ST001926 | MetabolomicsWorkbench
2020-09-04 | GSE134055 | GEO
2005-06-03 | E-GEOD-2347 | biostudies-arrayexpress