Unknown

Dataset Information

0

Improving the topology prediction of α-helical transmembrane proteins with deep transfer learning.


ABSTRACT: Transmembrane proteins (TMPs) are essential for cell recognition and communication, and they serve as important drug targets in humans. Transmembrane proteins' 3D structures are critical for determining their functions and drug design but are hard to determine even by experimental methods. Although some computational methods have been developed to predict transmembrane helices (TMHs) and orientation, there is still room for improvement. Considering that the pre-trained language model can make full use of massive unlabeled protein sequences to obtain latent feature representation for TMPs and reduce the dependence on evolutionary information, we proposed DeepTMpred, which used pre-trained self-supervised language models called ESM, convolutional neural networks, attentive neural network and conditional random fields for alpha-TMP topology prediction. Compared with the current state-of-the-art tools on a non-redundant dataset of TMPs, DeepTMpred demonstrated superior predictive performance in most evaluation metrics, especially at the TMH level. Furthermore, DeepTMpred could also obtain reliable prediction results for TMPs without much evolutionary feature in a few seconds. A tutorial on how to use DeepTMpred can be found in the colab notebook (https://colab.research.google.com/github/ISYSLAB-HUST/DeepTMpred/blob/master/notebook/test.ipynb).

SUBMITTER: Wang L 

PROVIDER: S-EPMC9062415 | biostudies-literature | 2022

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improving the topology prediction of α-helical transmembrane proteins with deep transfer learning.

Wang Lei L   Zhong Haolin H   Xue Zhidong Z   Wang Yan Y  

Computational and structural biotechnology journal 20220420


Transmembrane proteins (TMPs) are essential for cell recognition and communication, and they serve as important drug targets in humans. Transmembrane proteins' 3D structures are critical for determining their functions and drug design but are hard to determine even by experimental methods. Although some computational methods have been developed to predict transmembrane helices (TMHs) and orientation, there is still room for improvement. Considering that the pre-trained language model can make fu  ...[more]

Similar Datasets

| S-EPMC7869861 | biostudies-literature
| S-EPMC7985279 | biostudies-literature
| S-EPMC2586566 | biostudies-literature
| S-EPMC2374033 | biostudies-literature
| S-EPMC4121805 | biostudies-literature
| S-EPMC3164537 | biostudies-literature
| S-EPMC2808916 | biostudies-literature
| S-EPMC8764525 | biostudies-literature
| S-EPMC5713875 | biostudies-literature
| S-EPMC4938508 | biostudies-literature