Unknown

Dataset Information

0

Selecting protein targets for structural genomics of Pyrobaculum aerophilum: validating automated fold assignment methods by using binary hypothesis testing.


ABSTRACT: Three-dimensional protein folds were assigned to all ORFs of the recently sequenced genome of the hyperthermophilic archaeon Pyrobaculum aerophilum. Binary hypothesis testing was used to estimate a confidence level for each assignment. A separate test was conducted to assign a probability for whether each sequence has a novel fold-i.e., one that is not yet represented in the experimental database of known structures. Of the 2,130 predicted nontransmembrane proteins in this organism, 916 matched a fold at a cumulative 90% confidence level, and 245 could be assigned at a 99% confidence level. Likewise, 286 proteins were predicted to have a previously unobserved fold with a 90% confidence level, and 14 at a 99% confidence level. These statistically based tools are combined with homology searches against the Online Mendelian Inheritance in Man (OMIM) human genetics database and other protein databases for the selection of attractive targets for crystallographic or NMR structure determination. Results of these studies have been collated and placed at http://www.doe-mbi.ucla.edu/people/parag/P A_HOME/, the University of California, Los Angeles-Department of Energy Pyrobaculum aerophilum web site.

SUBMITTER: Mallick P 

PROVIDER: S-EPMC15949 | biostudies-literature | 2000 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Selecting protein targets for structural genomics of Pyrobaculum aerophilum: validating automated fold assignment methods by using binary hypothesis testing.

Mallick P P   Goodwill K E KE   Fitz-Gibbon S S   Miller J H JH   Eisenberg D D  

Proceedings of the National Academy of Sciences of the United States of America 20000301 6


Three-dimensional protein folds were assigned to all ORFs of the recently sequenced genome of the hyperthermophilic archaeon Pyrobaculum aerophilum. Binary hypothesis testing was used to estimate a confidence level for each assignment. A separate test was conducted to assign a probability for whether each sequence has a novel fold-i.e., one that is not yet represented in the experimental database of known structures. Of the 2,130 predicted nontransmembrane proteins in this organism, 916 matched  ...[more]

Similar Datasets

| S-EPMC2223247 | biostudies-literature
| S-EPMC117417 | biostudies-literature
| S-EPMC1214675 | biostudies-literature
| S-EPMC30402 | biostudies-literature
| S-EPMC3673990 | biostudies-literature
| PRJNA106587 | ENA
| PRJNA35109 | ENA
| S-EPMC2632070 | biostudies-literature
| S-EPMC3144789 | biostudies-literature
2009-01-23 | GSE11366 | GEO