Unknown

Dataset Information

0

Serial gene losses and foreign DNA underlie size and sequence variation in the plastid genomes of diatoms.


ABSTRACT: Photosynthesis by diatoms accounts for roughly one-fifth of global primary production, but despite this, relatively little is known about their plastid genomes. We report the completely sequenced plastid genomes for eight phylogenetically diverse diatoms and show them to be variable in size, gene and foreign sequence content, and gene order. The genomes contain a core set of 122 protein-coding genes, with 15 additional genes exhibiting complex patterns of 1) gene losses at varying phylogenetic scales, 2) functional transfers to the nucleus, 3) gene duplication, divergence, and differential retention of paralogs, and 4) acquisitions of putatively functional recombinase genes from resident plasmids. The newly sequenced genomes also contain several previously unreported genes, highlighting how poorly characterized diatom plastid genomes are overall. Genome size variation reflects major expansions of the inverted repeat region in some cases but, more commonly, large-scale expansions of intergenic regions, many of which contain unique open reading frames of likely foreign origin. Although many gene clusters are conserved across species, rearrangements appear to be frequent in most lineages.

SUBMITTER: Ruck EC 

PROVIDER: S-EPMC3971590 | biostudies-literature | 2014 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Serial gene losses and foreign DNA underlie size and sequence variation in the plastid genomes of diatoms.

Ruck Elizabeth C EC   Nakov Teofil T   Jansen Robert K RK   Theriot Edward C EC   Alverson Andrew J AJ  

Genome biology and evolution 20140301 3


Photosynthesis by diatoms accounts for roughly one-fifth of global primary production, but despite this, relatively little is known about their plastid genomes. We report the completely sequenced plastid genomes for eight phylogenetically diverse diatoms and show them to be variable in size, gene and foreign sequence content, and gene order. The genomes contain a core set of 122 protein-coding genes, with 15 additional genes exhibiting complex patterns of 1) gene losses at varying phylogenetic s  ...[more]

Similar Datasets

| S-EPMC3843732 | biostudies-literature
| S-EPMC8268801 | biostudies-literature
| S-EPMC6921581 | biostudies-literature
| S-EPMC5481826 | biostudies-literature
| S-EPMC5979308 | biostudies-literature
| S-EPMC2923445 | biostudies-literature
| S-EPMC3640096 | biostudies-literature
| S-EPMC11368500 | biostudies-literature
| S-EPMC3881603 | biostudies-literature
| S-EPMC5543022 | biostudies-literature