Unknown

Dataset Information

0

The network of sequence flow between protein structures.


ABSTRACT: Sequence-structure relationships in proteins are highly asymmetric because many sequences fold into relatively few structures. What is the number of sequences that fold into a particular protein structure? Is it possible to switch between stable protein folds by point mutations? To address these questions, we compute a directed graph of sequences and structures of proteins, which is based on 2,060 experimentally determined protein shapes from the Protein Data Bank. The directed graph is highly connected at native energies with "sinks" that attract many sequences from other folds. The sinks are rich in beta-sheets. The number of sequences that transition between folds is significantly smaller than the number of sequences retained by their fold. The sequence flow into a particular protein shape from other proteins correlates with the number of sequences that matches this shape in empirically determined genomes. Properties of strongly connected components of the graph are correlated with protein length and secondary structure.

SUBMITTER: Meyerguz L 

PROVIDER: S-EPMC1913895 | biostudies-literature | 2007 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

The network of sequence flow between protein structures.

Meyerguz Leonid L   Kleinberg Jon J   Elber Ron R  

Proceedings of the National Academy of Sciences of the United States of America 20070627 28


Sequence-structure relationships in proteins are highly asymmetric because many sequences fold into relatively few structures. What is the number of sequences that fold into a particular protein structure? Is it possible to switch between stable protein folds by point mutations? To address these questions, we compute a directed graph of sequences and structures of proteins, which is based on 2,060 experimentally determined protein shapes from the Protein Data Bank. The directed graph is highly c  ...[more]

Similar Datasets

| S-EPMC2811751 | biostudies-literature
| S-EPMC7886156 | biostudies-literature
| S-EPMC4987928 | biostudies-literature
| S-EPMC4637837 | biostudies-literature
| S-EPMC1366981 | biostudies-literature
| S-EPMC2764419 | biostudies-literature
| S-EPMC6677015 | biostudies-literature
| S-EPMC6442632 | biostudies-literature
| S-EPMC2708048 | biostudies-literature
| S-EPMC3962176 | biostudies-literature