Dataset Information

Intron positions correlate with module boundaries in ancient proteins.

ABSTRACT: We analyze the three-dimensional structure of proteins by a computer program that finds regions of sequence that contain module boundaries, defining a module as a segment of polypeptide chain bounded in space by a specific given distance. The program defines a set of "linker regions" that have the property that if an intron were to be placed into each linker region, the protein would be dissected into a set of modules all less than the specified diameter. We test a set of 32 proteins, all of ancient origin, and a corresponding set of 570 intron positions, to ask if there is a statistically significant excess of intron positions within the linker regions. For 28-A modules, a standard size used historically, we find such an excess, with P < 0.003. This correlation is neither due to a compositional or sequence bias in the linker regions nor to a surface bias in intron positions. Furthermore, a subset of 20 introns, which can be putatively identified as old, lies even more explicitly within the linker regions, with P < 0.0003. Thus, there is a strong correlation between intron positions and three-dimensional structural elements of ancient proteins as expected by the introns-early approach. We then study a range of module diameters and show that, as the diameter varies, significant peaks of correlation appear for module diameters centered at 21.7, 27.6, and 32.9 A. These preferred module diameters roughly correspond to predicted exon sizes of 15, 22, and 30 residues. Thus, there are significant correlations between introns, modules, and a quantized pattern of the lengths of polypeptide chains, which is the prediction of the "Exon Theory of Genes."

SUBMITTER: de Souza SJ

PROVIDER: S-EPMC26186 | biostudies-literature | 1996 Dec

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Intron positions correlate with module boundaries in ancient proteins.

de Souza S J SJ Long M M Schoenbach L L Roy S W SW Gilbert W W

Proceedings of the National Academy of Sciences of the United States of America 19961201 25

We analyze the three-dimensional structure of proteins by a computer program that finds regions of sequence that contain module boundaries, defining a module as a segment of polypeptide chain bounded in space by a specific given distance. The program defines a set of "linker regions" that have the property that if an intron were to be placed into each linker region, the protein would be dissected into a set of modules all less than the specified diameter. We test a set of 32 proteins, all of anc ...[more]

PMID: 8962105

Dataset Information

Intron positions correlate with module boundaries in ancient proteins.

Publications

Intron positions correlate with module boundaries in ancient proteins.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Conserved intron positions in ancient protein modules.
| S-EPMC1800838 | biostudies-other

Intron "sliding" and the diversity of intron positions.
| S-EPMC23469 | biostudies-literature

U12 intron positions are more strongly conserved between animals and plants than U2 intron positions.
| S-EPMC2426677 | biostudies-literature

Some novel intron positions in conserved Drosophila genes are caused by intron sliding or tandem duplication.
| S-EPMC2891723 | biostudies-literature

Integrating Phylogenetics With Intron Positions Illuminates the Origin of the Complex Spliceosome.
| S-EPMC9887622 | biostudies-literature

The cAMP binding domain: an ancient signaling module.
| S-EPMC544069 | biostudies-literature

Sequence analysis of malacoherpesvirus proteins: Pan-herpesvirus capsid module and replication enzymes with an ancient connection to "Megavirales".
| S-EPMC7172337 | biostudies-literature

Su(H)-mediated repression positions gene boundaries along the dorsal-ventral axis of Drosophila embryos.
| S-EPMC4201238 | biostudies-literature

Hydrogen atoms in proteins: positions and dynamics.
| S-EPMC193546 | biostudies-literature

An ancient anion-binding structural module in RNA and DNA helicases.
| S-EPMC7610952 | biostudies-literature