Unknown

Dataset Information

0

Cyanorak v2.1: a scalable information system dedicated to the visualization and expert curation of marine and brackish picocyanobacteria genomes.


ABSTRACT: Cyanorak v2.1 (http://www.sb-roscoff.fr/cyanorak) is an information system dedicated to visualizing, comparing and curating the genomes of Prochlorococcus, Synechococcus and Cyanobium, the most abundant photosynthetic microorganisms on Earth. The database encompasses sequences from 97 genomes, covering most of the wide genetic diversity known so far within these groups, and which were split into 25,834 clusters of likely orthologous groups (CLOGs). The user interface gives access to genomic characteristics, accession numbers as well as an interactive map showing strain isolation sites. The main entry to the database is through search for a term (gene name, product, etc.), resulting in a list of CLOGs and individual genes. Each CLOG benefits from a rich functional annotation including EggNOG, EC/K numbers, GO terms, TIGR Roles, custom-designed Cyanorak Roles as well as several protein motif predictions. Cyanorak also displays a phyletic profile, indicating the genotype and pigment type for each CLOG, and a genome viewer (Jbrowse) to visualize additional data on each genome such as predicted operons, genomic islands or transcriptomic data, when available. This information system also includes a BLAST search tool, comparative genomic context as well as various data export options. Altogether, Cyanorak v2.1 constitutes an invaluable, scalable tool for comparative genomics of ecologically relevant marine microorganisms.

SUBMITTER: Garczarek L 

PROVIDER: S-EPMC7779031 | biostudies-literature | 2021 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications


Cyanorak v2.1 (http://www.sb-roscoff.fr/cyanorak) is an information system dedicated to visualizing, comparing and curating the genomes of Prochlorococcus, Synechococcus and Cyanobium, the most abundant photosynthetic microorganisms on Earth. The database encompasses sequences from 97 genomes, covering most of the wide genetic diversity known so far within these groups, and which were split into 25,834 clusters of likely orthologous groups (CLOGs). The user interface gives access to genomic char  ...[more]

Similar Datasets

| S-EPMC2698417 | biostudies-literature
| S-EPMC4139726 | biostudies-literature
| S-EPMC8463805 | biostudies-literature
| S-EPMC5148188 | biostudies-literature
| S-EPMC2129112 | biostudies-literature
| S-EPMC5315467 | biostudies-literature
| S-EPMC4699468 | biostudies-literature
| S-EPMC6070758 | biostudies-literature
| S-EPMC5503172 | biostudies-literature
| S-EPMC4914166 | biostudies-literature