Unknown

Dataset Information

0

Identifying G-protein coupled receptors using weighted Levenshtein distance and the nearest neighbor method.


ABSTRACT: G-protein coupled receptors (GPCRs) are a class of seven-helix transmembrane proteins that have been used in bioinformatics as the targets to facilitate drug discovery for human diseases. Although thousands of GPCR sequences have been collected, the ligand specificity of many GPCRs are still unknown and only one crystal structure of the rhodopsin-like family has been solved. Therefore, identifying GPCR types only from sequence data has become an important research issue. In this study, a novel technique for identifying GPCR types based on the weighted Levenshtein distance between two receptor sequences and the nearest neighbor method (NNM) is introduced, which can deal with receptor sequences with different lengths directly. In our experiments for classifying four classes (acetylcholine, adrenoceptor, dopamine, and serotonin) of the rhodopsin-like family of GPCRs, the error rates from the leave-one-out procedure and the leave-half-out procedure are 0.62% and 1.24%, respectively. These results are prior to those of the covariant discriminant algorithm, the support vector machine method, and the NNM with Euclidean distance.

SUBMITTER: Xu JH 

PROVIDER: S-EPMC5173237 | biostudies-literature | 2005 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identifying G-protein coupled receptors using weighted Levenshtein distance and the nearest neighbor method.

Xu Jian Hua JH  

Genomics, proteomics & bioinformatics 20051101 4


G-protein coupled receptors (GPCRs) are a class of seven-helix transmembrane proteins that have been used in bioinformatics as the targets to facilitate drug discovery for human diseases. Although thousands of GPCR sequences have been collected, the ligand specificity of many GPCRs are still unknown and only one crystal structure of the rhodopsin-like family has been solved. Therefore, identifying GPCR types only from sequence data has become an important research issue. In this study, a novel t  ...[more]

Similar Datasets

| S-EPMC7206335 | biostudies-literature
| S-EPMC6044698 | biostudies-literature
| S-EPMC2967780 | biostudies-literature
| S-EPMC4106157 | biostudies-other
| S-EPMC7870093 | biostudies-literature
| S-EPMC7096456 | biostudies-literature
| S-EPMC7387719 | biostudies-literature
| S-EPMC8728060 | biostudies-literature
| S-EPMC7672824 | biostudies-literature
| S-EPMC6081758 | biostudies-literature