Unknown

Dataset Information

0

THPLM: a sequence-based deep learning framework for protein stability changes prediction upon point variations using pretrained protein language model.


ABSTRACT:

Motivation

Quantitative determination of protein thermodynamic stability is a critical step in protein and drug design. Reliable prediction of protein stability changes caused by point variations contributes to developing-related fields. Over the past decades, dozens of structure-based and sequence-based methods have been proposed, showing good prediction performance. Despite the impressive progress, it is necessary to explore wild-type and variant protein representations to address the problem of how to represent the protein stability change in view of global sequence. With the development of structure prediction using learning-based methods, protein language models (PLMs) have shown accurate and high-quality predictions of protein structure. Because PLM captures the atomic-level structural information, it can help to understand how single-point variations cause functional changes.

Results

Here, we proposed THPLM, a sequence-based deep learning model for stability change prediction using Meta's ESM-2. With ESM-2 and a simple convolutional neural network, THPLM achieved comparable or even better performance than most methods, including sequence-based and structure-based methods. Furthermore, the experimental results indicate that the PLM's ability to generate representations of sequence can effectively improve the ability of protein function prediction.

Availability and implementation

The source code of THPLM and the testing data can be accessible through the following links: https://github.com/FPPGroup/THPLM.

SUBMITTER: Gong J 

PROVIDER: S-EPMC10627365 | biostudies-literature | 2023 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

THPLM: a sequence-based deep learning framework for protein stability changes prediction upon point variations using pretrained protein language model.

Gong Jianting J   Jiang Lili L   Chen Yongbing Y   Zhang Yixiang Y   Li Xue X   Ma Zhiqiang Z   Fu Zhiguo Z   He Fei F   Sun Pingping P   Ren Zilin Z   Tian Mingyao M  

Bioinformatics (Oxford, England) 20231101 11


<h4>Motivation</h4>Quantitative determination of protein thermodynamic stability is a critical step in protein and drug design. Reliable prediction of protein stability changes caused by point variations contributes to developing-related fields. Over the past decades, dozens of structure-based and sequence-based methods have been proposed, showing good prediction performance. Despite the impressive progress, it is necessary to explore wild-type and variant protein representations to address the  ...[more]

Similar Datasets

| S-EPMC7728386 | biostudies-literature
| S-EPMC11317537 | biostudies-literature
| S-EPMC6606456 | biostudies-literature
| S-EPMC2686433 | biostudies-literature
| S-EPMC5039926 | biostudies-literature
| S-EPMC9791599 | biostudies-literature
2025-05-03 | PXD046201 | JPOST Repository
| S-EPMC4403899 | biostudies-literature
| S-EPMC10676515 | biostudies-literature
| S-EPMC9505940 | biostudies-literature