Dataset Information

Cyanorak v2.1: a scalable information system dedicated to the visualization and expert curation of marine and brackish picocyanobacteria genomes.

ABSTRACT: Cyanorak v2.1 (http://www.sb-roscoff.fr/cyanorak) is an information system dedicated to visualizing, comparing and curating the genomes of Prochlorococcus, Synechococcus and Cyanobium, the most abundant photosynthetic microorganisms on Earth. The database encompasses sequences from 97 genomes, covering most of the wide genetic diversity known so far within these groups, and which were split into 25,834 clusters of likely orthologous groups (CLOGs). The user interface gives access to genomic characteristics, accession numbers as well as an interactive map showing strain isolation sites. The main entry to the database is through search for a term (gene name, product, etc.), resulting in a list of CLOGs and individual genes. Each CLOG benefits from a rich functional annotation including EggNOG, EC/K numbers, GO terms, TIGR Roles, custom-designed Cyanorak Roles as well as several protein motif predictions. Cyanorak also displays a phyletic profile, indicating the genotype and pigment type for each CLOG, and a genome viewer (Jbrowse) to visualize additional data on each genome such as predicted operons, genomic islands or transcriptomic data, when available. This information system also includes a BLAST search tool, comparative genomic context as well as various data export options. Altogether, Cyanorak v2.1 constitutes an invaluable, scalable tool for comparative genomics of ecologically relevant marine microorganisms.

SUBMITTER: Garczarek L

PROVIDER: S-EPMC7779031 | biostudies-literature | 2021 Jan

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Cyanorak v2.1: a scalable information system dedicated to the visualization and expert curation of marine and brackish picocyanobacteria genomes.

Garczarek Laurence L Guyet Ulysse U Doré Hugo H Farrant Gregory K GK Hoebeke Mark M Brillet-Guéguen Loraine L Bisch Antoine A Ferrieux Mathilde M Siltanen Jukka J Corre Erwan E Le Corguillé Gildas G Ratin Morgane M Pitt Frances D FD Ostrowski Martin M Conan Maël M Siegel Anne A Labadie Karine K Aury Jean-Marc JM Wincker Patrick P Scanlan David J DJ Partensky Frédéric F

Nucleic acids research 20210101 D1

Cyanorak v2.1 (http://www.sb-roscoff.fr/cyanorak) is an information system dedicated to visualizing, comparing and curating the genomes of Prochlorococcus, Synechococcus and Cyanobium, the most abundant photosynthetic microorganisms on Earth. The database encompasses sequences from 97 genomes, covering most of the wide genetic diversity known so far within these groups, and which were split into 25,834 clusters of likely orthologous groups (CLOGs). The user interface gives access to genomic char ...[more]

PMID: 33125079

Similar Datasets

Project description:Picocyanobacteria Prochlorococcus and Synechococcus are abundant in the global oceans and subject to active viral infection. In this study, the genetic diversity of picocyanobacteria and the genetic diversity of cyanopodoviruses were synchronously investigated along water columns in the equatorial Indian Ocean and over a seasonal time course in the coastal Sanya Bay, South China Sea. Using the 16S-23S rRNA internal transcribed spacer (ITS)-based clone library and quantitative PCR (qPCR) analyses, the picocyanobacterial community composition and abundance were determined. Sanya Bay was dominated by clade II Synechococcus during all the seasons, and a typical population shift from high-light-adapted Prochlorococcus to low-light-adapted Prochlorococcus was found along the vertical profiles. Strikingly, the DNA polymerase gene sequences of cyanopodoviruses revealed a much greater genetic diversity than we expected. Nearly one-third of the phylogenetic groups were newly described here. No apparent seasonal pattern was observed for the Sanya Bay picocyanobacterial or cyanopodoviral communities. Different dominant cyanopodovirus lineages were identified for the coastal area, upper euphotic zone, and middle-to-lower euphotic zone of the open ocean. Diversity indices of both picocyanobacteria and cyanopodoviruses were highest in the middle euphotic zone and both were lower in the upper euphotic zone, reflecting a host-virus interaction. Cyanopodoviral communities differed significantly between the upper euphotic zone and the middle-to-lower euphotic zone, showing a vertical pattern similar to that of picocyanobacteria. However, in the surface waters of the open ocean, cyanopodoviruses exhibited no apparent biogeographic pattern, differing from picocyanobacteria. This study demonstrates correlated distribution patterns of picocyanobacteria and cyanopodoviruses, as well as the complex biogeography of cyanopodoviruses.IMPORTANCE Picocyanobacteria are highly diverse and abundant in the ocean and display remarkable global biogeography and a vertical distribution pattern. However, how the diversity and distribution of picocyanobacteria affect those of the viruses that infect them remains largely unknown. Here we synchronously analyzed the community structures of cyanopodoviruses and picocyanobacteria at spatial and temporal scales. Both spatial and temporal variations of cyanopodoviral communities can be linked to those of picocyanobacteria. The coastal area, upper euphotic zone, and middle-to-lower euphotic zone of the open ocean have distinct cyanopodoviral communities, showing horizontal and vertical variation patterns closely related to those of picocyanobacteria. These findings emphasize the driving force of host community in shaping the biogeographic structure of viruses. Our work provides important information for future assessments of the ecological roles of viruses and hosts for each other.

Dataset Information

Cyanorak v2.1: a scalable information system dedicated to the visualization and expert curation of marine and brackish picocyanobacteria genomes.

Publications

Cyanorak v2.1: a scalable information system dedicated to the visualization and expert curation of marine and brackish picocyanobacteria genomes.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets