Unknown

Dataset Information

0

The oligodeoxynucleotide sequences corresponding to never-expressed peptide motifs are mainly located in the non-coding strand.


ABSTRACT:

Background

We study the usage of specific peptide platforms in protein composition. Using the pentapeptide as a unit of length, we find that in the universal proteome many pentapeptides are heavily repeated (even thousands of times), whereas some are quite rare, and a small number do not appear at all. To understand the physico-chemical-biological basis underlying peptide usage at the proteomic level, in this study we analyse the energetic costs for the synthesis of rare and never-expressed versus frequent pentapeptides. In addition, we explore residue bulkiness, hydrophobicity, and codon number as factors able to modulate specific peptide frequencies. Then, the possible influence of amino acid composition is investigated in zero- and high-frequency pentapeptide sets by analysing the frequencies of the corresponding inverse-sequence pentapeptides. As a final step, we analyse the pentadecamer oligodeoxynucleotide sequences corresponding to the never-expressed pentapeptides.

Results

We find that only DNA context-dependent constraints (such as oligodeoxynucleotide sequence location in the minus strand, introns, pseudogenes, frameshifts, etc.) provide a coherent mechanistic platform to explain the occurrence of never-expressed versus frequent pentapeptides in the protein world.

Conclusions

This study is of importance in cell biology. Indeed, the rarity (or lack of expression) of specific 5-mer peptide modules implies the rarity (or lack of expression) of the corresponding n-mer peptide sequences (with n < 5), so possibly modulating protein compositional trends. Moreover the data might further our understanding of the role exerted by rare pentapeptide modules as critical biological effectors in protein-protein interactions.

SUBMITTER: Capone G 

PROVIDER: S-EPMC2919516 | biostudies-literature | 2010 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

The oligodeoxynucleotide sequences corresponding to never-expressed peptide motifs are mainly located in the non-coding strand.

Capone Giovanni G   Novello Giuseppe G   Fasano Candida C   Trost Brett B   Bickis Mik M   Kusalik Anthony A   Kanduc Darja D  

BMC bioinformatics 20100720


<h4>Background</h4>We study the usage of specific peptide platforms in protein composition. Using the pentapeptide as a unit of length, we find that in the universal proteome many pentapeptides are heavily repeated (even thousands of times), whereas some are quite rare, and a small number do not appear at all. To understand the physico-chemical-biological basis underlying peptide usage at the proteomic level, in this study we analyse the energetic costs for the synthesis of rare and never-expres  ...[more]

Similar Datasets

| S-EPMC2176071 | biostudies-literature
| S-EPMC3091326 | biostudies-literature
| S-EPMC3655826 | biostudies-literature
| S-EPMC7185194 | biostudies-literature
| S-EPMC5915474 | biostudies-literature
| S-EPMC5013901 | biostudies-literature
| S-EPMC3751894 | biostudies-literature
| S-EPMC2703934 | biostudies-literature
| S-EPMC310685 | biostudies-literature
| S-EPMC3082213 | biostudies-literature