Unknown

Dataset Information

0

Protein structure database search and evolutionary classification.


ABSTRACT: As more protein structures become available and structural genomics efforts provide structural models in a genome-wide strategy, there is a growing need for fast and accurate methods for discovering homologous proteins and evolutionary classifications of newly determined structures. We have developed 3D-BLAST, in part, to address these issues. 3D-BLAST is as fast as BLAST and calculates the statistical significance (E-value) of an alignment to indicate the reliability of the prediction. Using this method, we first identified 23 states of the structural alphabet that represent pattern profiles of the backbone fragments and then used them to represent protein structure databases as structural alphabet sequence databases (SADB). Our method enhanced BLAST as a search method, using a new structural alphabet substitution matrix (SASM) to find the longest common substructures with high-scoring structured segment pairs from an SADB database. Using personal computers with Intel Pentium4 (2.8 GHz) processors, our method searched more than 10 000 protein structures in 1.3 s and achieved a good agreement with search results from detailed structure alignment methods. [3D-BLAST is available at http://3d-blast.life.nctu.edu.tw].

SUBMITTER: Yang JM 

PROVIDER: S-EPMC1540718 | biostudies-literature | 2006

REPOSITORIES: biostudies-literature

altmetric image

Publications

Protein structure database search and evolutionary classification.

Yang Jinn-Moon JM   Tung Chi-Hua CH  

Nucleic acids research 20060802 13


As more protein structures become available and structural genomics efforts provide structural models in a genome-wide strategy, there is a growing need for fast and accurate methods for discovering homologous proteins and evolutionary classifications of newly determined structures. We have developed 3D-BLAST, in part, to address these issues. 3D-BLAST is as fast as BLAST and calculates the statistical significance (E-value) of an alignment to indicate the reliability of the prediction. Using th  ...[more]

Similar Datasets

| S-EPMC6030909 | biostudies-other
2015-07-31 | E-GEOD-59956 | biostudies-arrayexpress
2015-07-31 | GSE59956 | GEO
| S-EPMC3556496 | biostudies-literature
| S-EPMC3892073 | biostudies-literature
| S-EPMC1899123 | biostudies-literature
| S-EPMC145583 | biostudies-other
| S-EPMC4256011 | biostudies-literature
| S-EPMC3953177 | biostudies-literature
| S-EPMC1868941 | biostudies-literature