Unknown

Dataset Information

0

ThreaDomEx: a unified platform for predicting continuous and discontinuous protein domains by multiple-threading and segment assembly.


ABSTRACT: We develop a hierarchical pipeline, ThreaDomEx, for both continuous domain (CD) and discontinuous domain (DCD) structure predictions. Starting from a query sequence, ThreaDomEx first threads it through the PDB to identify multiple structure templates, where a profile of domain conservation score (DC-score) is derived for domain-segment assignment. To further detect DCDs that consist of separated segments along the sequence, a boundary-clustering algorithm is used to refine the DCD-linker locations. In case that the templates do not contain DCDs, a domain-segment assembly process, guided by symmetry comparison, is applied for further DCD detections. ThreaDomEx was tested a set of 1111 proteins and achieved a normalized domain overlap score of 89.3% compared to experimental data, which is significantly higher than other state-of-the-art methods. It also recalls 26.7% of DCDs with 72.7% precision on the proteins for which threading failed to detect any DCDs. The server provides facilities for users to interactively refine the domain models by adjusting DC-score threshold, deleting and adding domain linkers, and assembling domain segments, which are particularly helpful for the hard targets for which current methods have a low accuracy while human-expert knowledge and experimental insights can be used for refining models. ThreaDomEX server is available at http://zhanglab.ccmb.med.umich.edu/ThreaDomEx.

SUBMITTER: Wang Y 

PROVIDER: S-EPMC5793814 | biostudies-literature | 2017 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

ThreaDomEx: a unified platform for predicting continuous and discontinuous protein domains by multiple-threading and segment assembly.

Wang Yan Y   Wang Jian J   Li Ruiming R   Shi Qiang Q   Xue Zhidong Z   Zhang Yang Y  

Nucleic acids research 20170701 W1


We develop a hierarchical pipeline, ThreaDomEx, for both continuous domain (CD) and discontinuous domain (DCD) structure predictions. Starting from a query sequence, ThreaDomEx first threads it through the PDB to identify multiple structure templates, where a profile of domain conservation score (DC-score) is derived for domain-segment assignment. To further detect DCDs that consist of separated segments along the sequence, a boundary-clustering algorithm is used to refine the DCD-linker locatio  ...[more]

Similar Datasets

| S-EPMC6990874 | biostudies-literature
| S-EPMC2780749 | biostudies-literature
| S-EPMC5875494 | biostudies-literature
| S-EPMC1190255 | biostudies-literature
| S-EPMC8602663 | biostudies-literature
| S-EPMC4621036 | biostudies-literature
| S-EPMC3092796 | biostudies-literature
| S-EPMC6908709 | biostudies-literature
| S-EPMC7010958 | biostudies-literature
| S-EPMC10348810 | biostudies-literature