Unknown

Dataset Information

0

Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index.


ABSTRACT: BACKGROUND: Knowledge of protein domain boundaries is critical for the characterisation and understanding of protein function. The ability to identify domains without the knowledge of the structure--by using sequence information only--is an essential step in many types of protein analyses. In this present study, we demonstrate that the performance of DomainDiscovery is improved significantly by including the inter-domain linker index value for domain identification from sequence-based information. Improved DomainDiscovery uses a Support Vector Machine (SVM) approach and a unique training dataset built on the principle of consensus among experts in defining domains in protein structure. The SVM was trained using a PSSM (Position Specific Scoring Matrix), secondary structure, solvent accessibility information and inter-domain linker index to detect possible domain boundaries for a target sequence. RESULTS: Improved DomainDiscovery is compared with other methods by benchmarking against a structurally non-redundant dataset and also CASP5 targets. Improved DomainDiscovery achieves 70% accuracy for domain boundary identification in multi-domains proteins. CONCLUSION: Improved DomainDiscovery compares favourably to the performance of other methods and excels in the identification of domain boundaries for multi-domain proteins as a result of introducing support vector machine with benchmark_2 dataset.

SUBMITTER: Sikder AR 

PROVIDER: S-EPMC1764483 | biostudies-literature | 2006

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index.

Sikder Abdur R AR   Zomaya Albert Y AY  

BMC bioinformatics 20061218


<h4>Background</h4>Knowledge of protein domain boundaries is critical for the characterisation and understanding of protein function. The ability to identify domains without the knowledge of the structure--by using sequence information only--is an essential step in many types of protein analyses. In this present study, we demonstrate that the performance of DomainDiscovery is improved significantly by including the inter-domain linker index value for domain identification from sequence-based inf  ...[more]

Similar Datasets

| S-EPMC4290662 | biostudies-literature
| S-EPMC2632666 | biostudies-literature
| S-EPMC4664294 | biostudies-literature
| S-EPMC10692239 | biostudies-literature
| S-EPMC149209 | biostudies-literature
| S-EPMC5737734 | biostudies-literature
| S-EPMC4788683 | biostudies-literature
| S-EPMC9710680 | biostudies-literature
| S-EPMC2259413 | biostudies-literature
| S-EPMC4621036 | biostudies-literature