Unknown

Dataset Information

0

Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants.


ABSTRACT:

Background

Evaluating the impact of amino acid variants has been a critical challenge for studying protein function and interpreting genomic data. High-throughput experimental methods like deep mutational scanning (DMS) can measure the effect of large numbers of variants in a target protein, but because DMS studies have not been performed on all proteins, researchers also model DMS data computationally to estimate variant impacts by predictors.

Results

In this study, we extended a linear regression-based predictor to explore whether incorporating data from alanine scanning (AS), a widely used low-throughput mutagenesis method, would improve prediction results. To evaluate our model, we collected 146 AS datasets, mapping to 54 DMS datasets across 22 distinct proteins.

Conclusions

We show that improved model performance depends on the compatibility of the DMS and AS assays, and the scale of improvement is closely related to the correlation between DMS and AS results.

SUBMITTER: Fu Y 

PROVIDER: S-EPMC10506130 | biostudies-literature | 2022 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants.

Fu Yunfan Y   Bedő Justin J   Papenfuss Anthony T AT   Rubin Alan F AF  

GigaScience 20221201


<h4>Background</h4>Evaluating the impact of amino acid variants has been a critical challenge for studying protein function and interpreting genomic data. High-throughput experimental methods like deep mutational scanning (DMS) can measure the effect of large numbers of variants in a target protein, but because DMS studies have not been performed on all proteins, researchers also model DMS data computationally to estimate variant impacts by predictors.<h4>Results</h4>In this study, we extended a  ...[more]

Similar Datasets

| S-EPMC6126756 | biostudies-literature
2024-03-21 | GSE253580 | GEO
| S-EPMC10592615 | biostudies-literature
| S-SCDT-10_1038-S44320-024-00043-8 | biostudies-other
| S-EPMC8649220 | biostudies-literature
| S-EPMC7359949 | biostudies-literature
| S-EPMC11219921 | biostudies-literature
| S-EPMC10051238 | biostudies-literature
| S-EPMC30410 | biostudies-literature
| S-EPMC7248951 | biostudies-literature