Project description:Synechococcus is among the most important contributors to global primary productivity. The genomes of several strains of this taxon have been previously sequenced in an effort to understand the physiology and ecology of these highly diverse microorganisms. Here we present a comparative study of Synechococcus genomes. For that end, we developed GenTaxo, a program written in Perl to perform genomic taxonomy based on average nucleotide identity, average amino acid identity and dinucleotide signatures, which revealed that the analyzed strains are drastically distinct regarding their genomic content. Phylogenomic reconstruction indicated a division of Synechococcus in two clades (i.e. Synechococcus and the new genus Parasynechococcus), corroborating evidences that this is in fact a polyphyletic group. By clustering protein encoding genes into homologue groups we were able to trace the Pangenome and core genome of both marine and freshwater Synechococcus and determine the genotypic traits that differentiate these lineages.
Project description:Morphologically similar to Synechococcus, a large number of Parasynechococcus strains were misclassified, resulting in extreme underestimation of their genetic diversity. In this study, 80 Synechococcus-like strains were reevaluated using a combination of 16S rRNA phylogeny and genomic approach, identifying 54 strains as Parasynechococcus-like strains and showing considerably intragenus genetic divergence among the subclades identified. Further, bioinformatics analysis disclosed diversified patterns of distribution, abundance, density, and diversity of microsatellites (SSRs) and compound microsatellites (CSSRs) in genomes of these Parasynechococcus-like strains. Variations of SSRs and CSSRs were observed amongst phylotypes and subclades. Both SSRs and CSSRs were in particular unequally distributed among genomes. Dinucleotide SSRs were the most widespread, while the genomes showed two patterns in the second most abundant repeat type (mononucleotide or trinucleotide SSRs). Both SSRs and CSSRs were predominantly observed in coding regions. These two types of microsatellites showed positive correlation with genome size (p < 0.01) but negative correlation with GC content (p < 0.05). Additionally, the motif (A)n, (AG)n and (AGC)n was a major one in the corresponding category. Meanwhile, distinctive motifs of CSSRs were found in 39 genomes. This study characterizes SSRs and CSSRs in genomes of Parasynechococcus-like strains and will be useful as a prerequisite for future studies regarding their distribution, function, and evolution. Moreover, the identified SSRs may facilitate fast acclimation of Parasynechococcus-like strains to fluctuating environments and contribute to the extensive distribution of Parasynechococcus species in global marine environments.