Unknown

Dataset Information

0

Homopeptide and homocodon levels across fungi are coupled to GC/AT-bias and intrinsic disorder, with unique behaviours for some amino acids.


ABSTRACT: Homopeptides (runs of one amino-acid type) are evolutionarily important since they are prone to expand/contract during DNA replication, recombination and repair. To gain insight into the genomic/proteomic traits driving their variation, we analyzed how homopeptides and homocodons (which are pure codon repeats) vary across 405 Dikarya, and probed their linkage to genome GC/AT bias and other factors. We find that amino-acid homopeptide frequencies vary diversely between clades, with the AT-rich Saccharomycotina trending distinctly. As organisms evolve, homocodon and homopeptide numbers are majorly coupled to GC/AT-bias, exhibiting a bi-furcated correlation with degree of AT- or GC-bias. Mid-GC/AT genomes tend to have markedly fewer simply because they are mid-GC/AT. Despite these trends, homopeptides tend to be GC-biased relative to other parts of coding sequences, even in AT-rich organisms, indicating they absorb AT bias less or are inherently more GC-rich. The most frequent and most variable homopeptide amino acids favour intrinsic disorder, and there are an opposing correlation and anti-correlation versus homopeptide levels for intrinsic disorder and structured-domain content respectively. Specific homopeptides show unique behaviours that we suggest are linked to inherent slippage probabilities during DNA replication and recombination, such as poly-glutamine, which is an evolutionarily very variable homopeptide with a codon repertoire unbiased for GC/AT, and poly-lysine whose homocodons are overwhelmingly made from the codon AAG.

SUBMITTER: Wang Y 

PROVIDER: S-EPMC8113271 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC5930961 | biostudies-literature
| S-EPMC555766 | biostudies-literature
2021-11-03 | PXD027887 | Pride
| S-EPMC7612807 | biostudies-literature
| S-EPMC8074881 | biostudies-literature
| S-EPMC7785727 | biostudies-literature
| S-EPMC4772475 | biostudies-literature
| S-EPMC5877746 | biostudies-literature
| S-EPMC5733138 | biostudies-literature
| S-EPMC3092812 | biostudies-literature