Unknown

Dataset Information

0

SEMA: Antigen B-cell conformational epitope prediction using deep transfer learning.


ABSTRACT: One of the primary tasks in vaccine design and development of immunotherapeutic drugs is to predict conformational B-cell epitopes corresponding to primary antibody binding sites within the antigen tertiary structure. To date, multiple approaches have been developed to address this issue. However, for a wide range of antigens their accuracy is limited. In this paper, we applied the transfer learning approach using pretrained deep learning models to develop a model that predicts conformational B-cell epitopes based on the primary antigen sequence and tertiary structure. A pretrained protein language model, ESM-1v, and an inverse folding model, ESM-IF1, were fine-tuned to quantitatively predict antibody-antigen interaction features and distinguish between epitope and non-epitope residues. The resulting model called SEMA demonstrated the best performance on an independent test set with ROC AUC of 0.76 compared to peer-reviewed tools. We show that SEMA can quantitatively rank the immunodominant regions within the SARS-CoV-2 RBD domain. SEMA is available at https://github.com/AIRI-Institute/SEMAi and the web-interface http://sema.airi.net.

SUBMITTER: Shashkova TI 

PROVIDER: S-EPMC9523212 | biostudies-literature | 2022

REPOSITORIES: biostudies-literature

altmetric image

Publications


One of the primary tasks in vaccine design and development of immunotherapeutic drugs is to predict conformational B-cell epitopes corresponding to primary antibody binding sites within the antigen tertiary structure. To date, multiple approaches have been developed to address this issue. However, for a wide range of antigens their accuracy is limited. In this paper, we applied the transfer learning approach using pretrained deep learning models to develop a model that predicts conformational B-  ...[more]

Similar Datasets

| S-EPMC11223818 | biostudies-literature
| S-EPMC4237749 | biostudies-literature
2021-04-07 | GSE171636 | GEO
| S-EPMC7924438 | biostudies-literature
| S-EPMC7371472 | biostudies-literature
| S-EPMC4326220 | biostudies-literature
| S-EPMC5321161 | biostudies-literature
| S-EPMC10507959 | biostudies-literature
| S-EPMC5570230 | biostudies-literature
| PRJNA720385 | ENA