Unknown

Dataset Information

0

Improved Reference Genome for Cyclotella cryptica CCMP332, a Model for Cell Wall Morphogenesis, Salinity Adaptation, and Lipid Production in Diatoms (Bacillariophyta).


ABSTRACT: The diatom, Cyclotella cryptica, is a well-established model species for physiological studies and biotechnology applications of diatoms. To further facilitate its use as a model diatom, we report an improved reference genome assembly and annotation for C. cryptica strain CCMP332. We used a combination of long- and short-read sequencing to assemble a high-quality and contaminant-free genome. The genome is 171 Mb in size and consists of 662 scaffolds with a scaffold N50 of 494 kb. This represents a 176-fold decrease in scaffold number and 41-fold increase in scaffold N50 compared to the previous assembly. The genome contains 21,250 predicted genes, 75% of which were assigned putative functions. Repetitive DNA comprises 59% of the genome, and an improved classification of repetitive elements indicated that a historically steady accumulation of transposable elements has contributed to the relatively large size of the C. cryptica genome. The high-quality C. cryptica genome will serve as a valuable reference for ecological, genetic, and biotechnology studies of diatoms.

SUBMITTER: Roberts WR 

PROVIDER: S-EPMC7466962 | biostudies-literature | 2020 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improved Reference Genome for <i>Cyclotella cryptica</i> CCMP332, a Model for Cell Wall Morphogenesis, Salinity Adaptation, and Lipid Production in Diatoms (Bacillariophyta).

Roberts Wade R WR   Downey Kala M KM   Ruck Elizabeth C EC   Traller Jesse C JC   Alverson Andrew J AJ  

G3 (Bethesda, Md.) 20200902 9


The diatom, <i>Cyclotella cryptica</i>, is a well-established model species for physiological studies and biotechnology applications of diatoms. To further facilitate its use as a model diatom, we report an improved reference genome assembly and annotation for <i>C. cryptica</i> strain CCMP332. We used a combination of long- and short-read sequencing to assemble a high-quality and contaminant-free genome. The genome is 171 Mb in size and consists of 662 scaffolds with a scaffold N50 of 494 kb. T  ...[more]

Similar Datasets

| PRJNA411787 | ENA
| PRJNA337876 | ENA
| S-EPMC5561644 | biostudies-literature
| S-EPMC7661000 | biostudies-literature
| PRJNA628076 | ENA
| PRJNA387304 | ENA
| PRJNA589195 | ENA
| S-EPMC5124317 | biostudies-literature
| S-EPMC6007386 | biostudies-literature
| S-EPMC5850769 | biostudies-literature