Unknown

Dataset Information

0

Highly expressed proteins have an increased frequency of alanine in the second amino acid position.


ABSTRACT: Although the sequence requirements for translation initiation regions have been frequently analysed, usually the highly expressed genes are not treated as a separate dataset.To investigate this, we analysed the mRNA regions downstream of initiation codons in nine bacteria, three archaea and three unicellular eukaryotes, comparing the dataset of highly expressed genes to the dataset of all genes. In addition to the detailed analysis of the nucleotide and codon frequencies we compared the N-termini of highly expressed proteins to the N-termini of all proteins coded in the genome.The most conserved pattern was observed at the amino acid level: strong alanine over-representation was observed at the second amino acid position of highly expressed proteins. This pattern is well conserved in all three domains of life.

SUBMITTER: Tats A 

PROVIDER: S-EPMC1397820 | biostudies-literature | 2006 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Highly expressed proteins have an increased frequency of alanine in the second amino acid position.

Tats Age A   Remm Maido M   Tenson Tanel T  

BMC genomics 20060216


<h4>Background</h4>Although the sequence requirements for translation initiation regions have been frequently analysed, usually the highly expressed genes are not treated as a separate dataset.<h4>Results</h4>To investigate this, we analysed the mRNA regions downstream of initiation codons in nine bacteria, three archaea and three unicellular eukaryotes, comparing the dataset of highly expressed genes to the dataset of all genes. In addition to the detailed analysis of the nucleotide and codon f  ...[more]

Similar Datasets

| S-EPMC2034453 | biostudies-literature
| S-EPMC7894817 | biostudies-literature
| S-EPMC3567014 | biostudies-literature
| S-EPMC8022432 | biostudies-literature
| S-EPMC1242296 | biostudies-literature
| S-EPMC6800875 | biostudies-literature
2013-04-06 | E-GEOD-45556 | biostudies-arrayexpress
2013-05-06 | E-GEOD-46651 | biostudies-arrayexpress
2013-05-06 | GSE46651 | GEO
2013-04-06 | GSE45556 | GEO