Unknown

Dataset Information

0

High-throughput computational structure-based characterization of protein families: START domains and implications for structural genomics.


ABSTRACT: SkyLine, a high-throughput homology modeling pipeline tool, detects and models true sequence homologs to a given protein structure. Structures and models are stored in SkyBase with links to computational function annotation, as calculated by MarkUs. The SkyLine/SkyBase/MarkUs technology represents a novel structure-based approach that is more objective and versatile than other protein classification resources. This structure-centric strategy provides a multi-dimensional organization and coverage of protein space at the levels of family, function, and genome. The concept of "modelability", the ability to model sequences on related structures, provides a reliable criterion for membership in a protein family ("leverage") and underlies the unique success of this approach. The overall procedure is illustrated by its application to START domains, which comprise a Biomedical Theme for the Northeast Structural Genomics Consortium as part of the Protein Structure Initiative. START domains are typically involved in the non-vesicular transport of lipids. While 19 experimentally determined structures are available, the family, whose evolutionary hierarchy is not well determined, is highly sequence diverse, and the ligand-binding potential of many family members is unknown. The SkyLine/SkyBase/MarkUs approach provides significant insights and predicts: (1) many more family members (approximately 4,000) than any other resource; (2) the function for a large number of unannotated proteins; (3) instances of START domains in genomes from which they were thought to be absent; and (4) the existence of two types of novel proteins, those containing dual START domain and those containing N-terminal START domains.

SUBMITTER: Lee H 

PROVIDER: S-EPMC2881152 | biostudies-literature | 2010 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

High-throughput computational structure-based characterization of protein families: START domains and implications for structural genomics.

Lee Hunjoong H   Li Zhaohui Z   Silkov Antonina A   Fischer Markus M   Petrey Donald D   Honig Barry B   Murray Diana D  

Journal of structural and functional genomics 20100301 1


SkyLine, a high-throughput homology modeling pipeline tool, detects and models true sequence homologs to a given protein structure. Structures and models are stored in SkyBase with links to computational function annotation, as calculated by MarkUs. The SkyLine/SkyBase/MarkUs technology represents a novel structure-based approach that is more objective and versatile than other protein classification resources. This structure-centric strategy provides a multi-dimensional organization and coverage  ...[more]

Similar Datasets

| S-EPMC528931 | biostudies-literature
| S-EPMC7586902 | biostudies-literature
| S-EPMC3127847 | biostudies-literature
| S-EPMC2764548 | biostudies-literature
| S-EPMC2144534 | biostudies-other
| S-EPMC129326 | biostudies-literature
| S-EPMC499543 | biostudies-literature
| S-EPMC4814114 | biostudies-literature
| S-EPMC3001128 | biostudies-literature
| S-EPMC3123459 | biostudies-literature