Unknown

Dataset Information

0

Using linear algebra for protein structural comparison and classification.


ABSTRACT: In this article, we describe a novel methodology to extract semantic characteristics from protein structures using linear algebra in order to compose structural signature vectors which may be used efficiently to compare and classify protein structures into fold families. These signatures are built from the pattern of hydrophobic intrachain interactions using Singular Value Decomposition (SVD) and Latent Semantic Indexing (LSI) techniques. Considering proteins as documents and contacts as terms, we have built a retrieval system which is able to find conserved contacts in samples of myoglobin fold family and to retrieve these proteins among proteins of varied folds with precision of up to 80%. The classifier is a web tool available at our laboratory website. Users can search for similar chains from a specific PDB, view and compare their contact maps and browse their structures using a JMol plug-in.

SUBMITTER: Gomide J 

PROVIDER: S-EPMC3036040 | biostudies-literature | 2009 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Using linear algebra for protein structural comparison and classification.

Gomide Janaína J   Melo-Minardi Raquel R   Dos Santos Marcos Augusto MA   Neshich Goran G   Meira Wagner W   Lopes Júlio César JC   Santoro Marcelo M  

Genetics and molecular biology 20090701 3


In this article, we describe a novel methodology to extract semantic characteristics from protein structures using linear algebra in order to compose structural signature vectors which may be used efficiently to compare and classify protein structures into fold families. These signatures are built from the pattern of hydrophobic intrachain interactions using Singular Value Decomposition (SVD) and Latent Semantic Indexing (LSI) techniques. Considering proteins as documents and contacts as terms,  ...[more]

Similar Datasets

| S-EPMC2259299 | biostudies-literature
| S-EPMC3268291 | biostudies-literature
| S-EPMC3272679 | biostudies-literature
2007-12-20 | GSE9929 | GEO
| S-EPMC7353965 | biostudies-literature
| S-EPMC9956871 | biostudies-literature
| S-EPMC6684663 | biostudies-literature
| S-EPMC4824117 | biostudies-literature
| S-EPMC1347461 | biostudies-literature
| S-EPMC6192565 | biostudies-literature