Unknown

Dataset Information

0

The proteome: structure, function and evolution.


ABSTRACT: This paper reports two studies to model the inter-relationships between protein sequence, structure and function. First, an automated pipeline to provide a structural annotation of proteomes in the major genomes is described. The results are stored in a database at Imperial College, London (3D-GENOMICS) that can be accessed at www.sbg.bio.ic.ac.uk. Analysis of the assignments to structural superfamilies provides evolutionary insights. 3D-GENOMICS is being integrated with related proteome annotation data at University College London and the European Bioinformatics Institute in a project known as e-protein (http://www.e-protein.org/). The second topic is motivated by the developments in structural genomics projects in which the structure of a protein is determined prior to knowledge of its function. We have developed a new approach PHUNCTIONER that uses the gene ontology (GO) classification to supervise the extraction of the sequence signal responsible for protein function from a structure-based sequence alignment. Using GO we can obtain profiles for a range of specificities described in the ontology. In the region of low sequence similarity (around 15%), our method is more accurate than assignment from the closest structural homologue. The method is also able to identify the specific residues associated with the function of the protein family.

SUBMITTER: Fleming K 

PROVIDER: S-EPMC1609342 | biostudies-literature | 2006 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

The proteome: structure, function and evolution.

Fleming Keiran K   Kelley Lawrence A LA   Islam Suhail A SA   MacCallum Robert M RM   Muller Arne A   Pazos Florencio F   Sternberg Michael J E MJ  

Philosophical transactions of the Royal Society of London. Series B, Biological sciences 20060301 1467


This paper reports two studies to model the inter-relationships between protein sequence, structure and function. First, an automated pipeline to provide a structural annotation of proteomes in the major genomes is described. The results are stored in a database at Imperial College, London (3D-GENOMICS) that can be accessed at www.sbg.bio.ic.ac.uk. Analysis of the assignments to structural superfamilies provides evolutionary insights. 3D-GENOMICS is being integrated with related proteome annotat  ...[more]

Similar Datasets

| S-EPMC3205581 | biostudies-literature
| S-EPMC6337491 | biostudies-literature
| S-EPMC3673471 | biostudies-literature
| S-EPMC3641637 | biostudies-literature
| S-EPMC1347420 | biostudies-literature
| S-EPMC3533774 | biostudies-literature
| S-EPMC5584315 | biostudies-literature
| S-EPMC6363617 | biostudies-literature
| S-EPMC5457962 | biostudies-literature
| S-EPMC3140765 | biostudies-literature