Unknown

Dataset Information

0

Novel metrics for quantifying bacterial genome composition skews.


ABSTRACT:

Background

Bacterial genomes have characteristic compositional skews, which are differences in nucleotide frequency between the leading and lagging DNA strands across a segment of a genome. It is thought that these strand asymmetries arise as a result of mutational biases and selective constraints, particularly for energy efficiency. Analysis of compositional skews in a diverse set of bacteria provides a comparative context in which mutational and selective environmental constraints can be studied. These analyses typically require finished and well-annotated genomic sequences.

Results

We present three novel metrics for examining genome composition skews; all three metrics can be computed for unfinished or partially-annotated genomes. The first two metrics, (dot-skew and cross-skew) depend on sequence and gene annotation of a single genome, while the third metric (residual skew) highlights unusual genomes by subtracting a GC content-based model of a library of genome sequences. We applied these metrics to 7738 available bacterial genomes, including partial drafts, and identified outlier species. A phylogenetically diverse set of these outliers (i.e., Borrelia, Ehrlichia, Kinetoplastibacterium, and Phytoplasma) display similar skew patterns but share lifestyle characteristics, such as intracellularity and biosynthetic dependence on their hosts.

Conclusions

Our novel metrics appear to reflect the effects of biosynthetic constraints and adaptations to life within one or more hosts on genome composition. We provide results for each analyzed genome, software and interactive visualizations at http://db.systemsbiology.net/gestalt/ skew_metrics .

SUBMITTER: Joesch-Cohen LM 

PROVIDER: S-EPMC6042203 | biostudies-literature | 2018 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Novel metrics for quantifying bacterial genome composition skews.

Joesch-Cohen Lena M LM   Robinson Max M   Jabbari Neda N   Lausted Christopher G CG   Glusman Gustavo G  

BMC genomics 20180711 1


<h4>Background</h4>Bacterial genomes have characteristic compositional skews, which are differences in nucleotide frequency between the leading and lagging DNA strands across a segment of a genome. It is thought that these strand asymmetries arise as a result of mutational biases and selective constraints, particularly for energy efficiency. Analysis of compositional skews in a diverse set of bacteria provides a comparative context in which mutational and selective environmental constraints can  ...[more]

Similar Datasets

| S-EPMC2031905 | biostudies-other
| S-EPMC9115964 | biostudies-literature
2016-09-22 | GSE78756 | GEO
| S-EPMC7653371 | biostudies-literature
| S-EPMC4588562 | biostudies-literature
| S-EPMC6938549 | biostudies-literature
2012-10-01 | E-GEOD-33063 | biostudies-arrayexpress
| S-EPMC8460438 | biostudies-literature
| PRJNA1065110 | ENA
| S-EPMC5357938 | biostudies-literature