Unknown

Dataset Information

0

Similar Ratios of Introns to Intergenic Sequence across Animal Genomes.


ABSTRACT: One central goal of genome biology is to understand how the usage of the genome differs between organisms. Our knowledge of genome composition, needed for downstream inferences, is critically dependent on gene annotations, yet problems associated with gene annotation and assembly errors are usually ignored in comparative genomics. Here, we analyze the genomes of 68 species across 12 animal phyla and some single-cell eukaryotes for general trends in genome composition and transcription, taking into account problems of gene annotation. We show that, regardless of genome size, the ratio of introns to intergenic sequence is comparable across essentially all animals, with nearly all deviations dominated by increased intergenic sequence. Genomes of model organisms have ratios much closer to 1:1, suggesting that the majority of published genomes of nonmodel organisms are underannotated and consequently omit substantial numbers of genes, with likely negative impact on evolutionary interpretations. Finally, our results also indicate that most animals transcribe half or more of their genomes arguing against differences in genome usage between animal groups, and also suggesting that the transcribed portion is more dependent on genome size than previously thought.

SUBMITTER: Francis WR 

PROVIDER: S-EPMC5534336 | biostudies-literature | 2017 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Similar Ratios of Introns to Intergenic Sequence across Animal Genomes.

Francis Warren R WR   Wörheide Gert G  

Genome biology and evolution 20170601 6


One central goal of genome biology is to understand how the usage of the genome differs between organisms. Our knowledge of genome composition, needed for downstream inferences, is critically dependent on gene annotations, yet problems associated with gene annotation and assembly errors are usually ignored in comparative genomics. Here, we analyze the genomes of 68 species across 12 animal phyla and some single-cell eukaryotes for general trends in genome composition and transcription, taking in  ...[more]

Similar Datasets

| S-EPMC6211125 | biostudies-literature
| S-EPMC1833996 | biostudies-literature
| S-EPMC148383 | biostudies-other
| S-EPMC3164680 | biostudies-literature
| S-EPMC5054447 | biostudies-literature
| S-EPMC5755908 | biostudies-literature
| S-EPMC5191860 | biostudies-literature
| S-EPMC150231 | biostudies-literature
| S-EPMC9576210 | biostudies-literature
| S-EPMC43669 | biostudies-other