Unknown

Dataset Information

0

DeMaSk: a deep mutational scanning substitution matrix and its use for variant impact prediction.


ABSTRACT:

Motivation

Accurately predicting the quantitative impact of a substitution on a protein's molecular function would be a great aid in understanding the effects of observed genetic variants across populations. While this remains a challenging task, new approaches can leverage data from the increasing numbers of comprehensive deep mutational scanning (DMS) studies that systematically mutate proteins and measure fitness.

Results

We introduce DeMaSk, an intuitive and interpretable method based only upon DMS datasets and sequence homologs that predicts the impact of missense mutations within any protein. DeMaSk first infers a directional amino acid substitution matrix from DMS datasets and then fits a linear model that combines these substitution scores with measures of per-position evolutionary conservation and variant frequency across homologs. Despite its simplicity, DeMaSk has state-of-the-art performance in predicting the impact of amino acid substitutions, and can easily and rapidly be applied to any protein sequence.

Availability

https://demask.princeton.edu generates fitness impact predictions and visualizations for any user-submitted protein sequence.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Munro D 

PROVIDER: S-EPMC8016454 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC7689672 | biostudies-literature
| S-EPMC7077003 | biostudies-literature
| S-EPMC8042483 | biostudies-literature
| S-EPMC7248951 | biostudies-literature
| S-EPMC6205457 | biostudies-literature
| S-EPMC5547491 | biostudies-literature
| S-EPMC4466731 | biostudies-literature
2016-10-30 | E-MTAB-5154 | biostudies-arrayexpress
2019-10-21 | GSE139122 | GEO
2018-04-23 | GSE100368 | GEO