Unknown

Dataset Information

0

A novel framework for the identification of drug target proteins: Combining stacked auto-encoders with a biased support vector machine.


ABSTRACT: The identification of drug target proteins (IDTP) plays a critical role in biometrics. The aim of this study was to retrieve potential drug target proteins (DTPs) from a collected protein dataset, which represents an overwhelming task of great significance. Previously reported methodologies for this task generally employ protein-protein interactive networks but neglect informative biochemical attributes. We formulated a novel framework utilizing biochemical attributes to address this problem. In the framework, a biased support vector machine (BSVM) was combined with the deep embedded representation extracted using a deep learning model, stacked auto-encoders (SAEs). In cases of non-drug target proteins (NDTPs) contaminated by DTPs, the framework is beneficial due to the efficient representation of the SAE and relief of the imbalance effect by the BSVM. The experimental results demonstrated the effectiveness of our framework, and the generalization capability was confirmed via comparisons to other models. This study is the first to exploit a deep learning model for IDTP. In summary, nearly 23% of the NDTPs were predicted as likely DTPs, which are awaiting further verification based on biomedical experiments.

SUBMITTER: Wang Q 

PROVIDER: S-EPMC5409512 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

A novel framework for the identification of drug target proteins: Combining stacked auto-encoders with a biased support vector machine.

Wang Qi Q   Feng YangHe Y   Huang JinCai J   Wang TengJiao T   Cheng GuangQuan G  

PloS one 20170428 4


The identification of drug target proteins (IDTP) plays a critical role in biometrics. The aim of this study was to retrieve potential drug target proteins (DTPs) from a collected protein dataset, which represents an overwhelming task of great significance. Previously reported methodologies for this task generally employ protein-protein interactive networks but neglect informative biochemical attributes. We formulated a novel framework utilizing biochemical attributes to address this problem. In  ...[more]

Similar Datasets

| S-EPMC4097812 | biostudies-other
| S-EPMC1594580 | biostudies-literature
| S-EPMC7407276 | biostudies-literature
| S-EPMC4331676 | biostudies-literature
| S-EPMC5519637 | biostudies-literature
| S-EPMC4609875 | biostudies-other
| S-EPMC4395415 | biostudies-literature
| S-EPMC2396404 | biostudies-literature
| S-EPMC6380887 | biostudies-literature
| S-EPMC5210213 | biostudies-literature