Unknown

Dataset Information

0

Re-annotation of eight Drosophila genomes.


ABSTRACT: The sequenced genomes of the Drosophila phylogeny are a central resource for comparative work supporting the understanding of the Drosophila melanogaster non-mammalian model system. These have also facilitated evolutionary studies on the selected and random differences that distinguish the thousands of extant species of Drosophila. However, full utility has been hampered by uneven genome annotation. We have generated a large expression profile dataset for nine species of Drosophila and trained a transcriptome assembly approach on D. melanogaster that best matched the extensively curated annotation. We then applied this to the other species to add more than 10000 transcript models per species. We also developed new orthologs to facilitate cross-species comparisons. We validated the new annotation of the distantly related Drosophila grimshawi with an extensive collection of newly sequenced cDNAs. This re-annotation will facilitate understanding both the core commonalities and the species differences in this important group of model organisms, and suggests a strategy for annotating the many forthcoming genomes covering the tree of life.

SUBMITTER: Yang H 

PROVIDER: S-EPMC6305970 | biostudies-literature | 2018 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Re-annotation of eight <i>Drosophila</i> genomes.

Yang Haiwang H   Jaime Maria M   Polihronakis Maxi M   Kanegawa Kelvin K   Markow Therese T   Kaneshiro Kenneth K   Oliver Brian B  

Life science alliance 20181224 6


The sequenced genomes of the <i>Drosophila</i> phylogeny are a central resource for comparative work supporting the understanding of the <i>Drosophila melanogaster</i> non-mammalian model system. These have also facilitated evolutionary studies on the selected and random differences that distinguish the thousands of extant species of <i>Drosophila</i>. However, full utility has been hampered by uneven genome annotation. We have generated a large expression profile dataset for nine species of <i>  ...[more]

Similar Datasets

2017-01-30 | GSE91066 | GEO
| S-EPMC5714196 | biostudies-literature
2017-08-09 | PXD005844 | Pride
2017-08-03 | PXD005901 | Pride
| S-EPMC3694665 | biostudies-literature
| S-EPMC1899501 | biostudies-literature
| S-EPMC3548604 | biostudies-literature
| S-EPMC6627511 | biostudies-literature
| S-EPMC3686433 | biostudies-literature
| S-EPMC2712747 | biostudies-literature