Dataset Information

Improving the topology prediction of α-helical transmembrane proteins with deep transfer learning.

ABSTRACT: Transmembrane proteins (TMPs) are essential for cell recognition and communication, and they serve as important drug targets in humans. Transmembrane proteins' 3D structures are critical for determining their functions and drug design but are hard to determine even by experimental methods. Although some computational methods have been developed to predict transmembrane helices (TMHs) and orientation, there is still room for improvement. Considering that the pre-trained language model can make full use of massive unlabeled protein sequences to obtain latent feature representation for TMPs and reduce the dependence on evolutionary information, we proposed DeepTMpred, which used pre-trained self-supervised language models called ESM, convolutional neural networks, attentive neural network and conditional random fields for alpha-TMP topology prediction. Compared with the current state-of-the-art tools on a non-redundant dataset of TMPs, DeepTMpred demonstrated superior predictive performance in most evaluation metrics, especially at the TMH level. Furthermore, DeepTMpred could also obtain reliable prediction results for TMPs without much evolutionary feature in a few seconds. A tutorial on how to use DeepTMpred can be found in the colab notebook (https://colab.research.google.com/github/ISYSLAB-HUST/DeepTMpred/blob/master/notebook/test.ipynb).

SUBMITTER: Wang L

PROVIDER: S-EPMC9062415 | biostudies-literature | 2022

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Improving the topology prediction of α-helical transmembrane proteins with deep transfer learning.

Wang Lei L Zhong Haolin H Xue Zhidong Z Wang Yan Y

Computational and structural biotechnology journal 20220420

Transmembrane proteins (TMPs) are essential for cell recognition and communication, and they serve as important drug targets in humans. Transmembrane proteins' 3D structures are critical for determining their functions and drug design but are hard to determine even by experimental methods. Although some computational methods have been developed to predict transmembrane helices (TMHs) and orientation, there is still room for improvement. Considering that the pre-trained language model can make fu ...[more]

PMID: 35521551

Similar Datasets

Project description:Membrane proteins, particularly those that are α-helical, such as transporters and G-protein-coupled receptors (GPCRs), have significant biological relevance. However, their expression and purification pose difficulties because of their poor water solubilities, which impedes progress in this field. The QTY method, a code-based protein-engineering approach, was recently developed to produce soluble transmembrane proteins. Here, we describe a comprehensive Web server built for QTY design and its relevance for in silico analyses. Typically, the simple design model is expected to require only 2 to 4 min of computer time, and the library design model requires 2 to 5 h, depending on the target protein size and the number of transmembrane helices. Detailed protocols for using the server with both the simple design and library design modules are provided. Methods for experiments following the QTY design are also included to facilitate the implementation of this approach. The design pipeline was further evaluated using microbial transmembrane proteins and structural alignment between the designed proteins and their origins by employing AlphaFold2. The results reveal that mutants generated by the developed pipeline were highly identical to their origins in terms of three-dimensional (3D) structures. In summary, the utilization of our Web server and associated protocols will enable QTY-based protein engineering to be implemented in a convenient, fast, accurate, and rational manner. The Protein Solubilizing Server (PSS) is publicly available at http://pss.sjtu.edu.cn. IMPORTANCE Water-soluble expression and purification are of considerable importance for protein identification and characterization. However, there has been a lack of an effective method for water-soluble expression of membrane proteins, which has severely hampered their studies. Here, an enabling comprehensive Web server, PSS, was developed for designing water-soluble mutants of α-helical membrane proteins, based on QTY design, a code-based protein-engineering approach. With microbial transmembrane proteins and GPCRs as examples, we systematically evaluated the server and demonstrated its successful performance. PSS is readily available for worldwide users as a Web-based tool, rendering QTY-based protein engineering convenient, efficient, accurate, and rational.

Dataset Information

Improving the topology prediction of α-helical transmembrane proteins with deep transfer learning.

Publications

Improving the topology prediction of α-helical transmembrane proteins with deep transfer learning.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets