Unknown

Dataset Information

0

DeMaSk: a deep mutational scanning substitution matrix and its use for variant impact prediction.


ABSTRACT: Accurately predicting the quantitative impact of a substitution on a protein's molecular function would be a great aid in understanding the effects of observed genetic variants across populations. While this remains a challenging task, new approaches can leverage data from the increasing numbers of comprehensive deep mutational scanning (DMS) studies that systematically mutate proteins and measure fitness. We introduce DeMaSk, an intuitive and interpretable method based only upon DMS datasets and sequence homologs that predicts the impact of missense mutations within any protein. DeMaSk first infers a directional amino acid substitution matrix from DMS datasets and then fits a linear model that combines these substitution scores with measures of per-position evolutionary conservation and variant frequency across homologs. Despite its simplicity, DeMaSk has state-of-the-art performance in predicting the impact of amino acid substitutions, and can easily and rapidly be applied to any protein sequence. https://demask.princeton.edu generates fitness impact predictions and visualizations for any user-submitted protein sequence. Supplementary data are available at Bioinformatics online.

SUBMITTER: Munro D 

PROVIDER: S-EPMC8016454 | biostudies-literature | 2020 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

DeMaSk: a deep mutational scanning substitution matrix and its use for variant impact prediction.

Munro Daniel D   Singh Mona M  

Bioinformatics (Oxford, England) 20210401 22-23


<h4>Motivation</h4>Accurately predicting the quantitative impact of a substitution on a protein's molecular function would be a great aid in understanding the effects of observed genetic variants across populations. While this remains a challenging task, new approaches can leverage data from the increasing numbers of comprehensive deep mutational scanning (DMS) studies that systematically mutate proteins and measure fitness.<h4>Results</h4>We introduce DeMaSk, an intuitive and interpretable meth  ...[more]

Similar Datasets

| S-EPMC7689672 | biostudies-literature
| S-EPMC10407742 | biostudies-literature
| S-EPMC7077003 | biostudies-literature
| S-EPMC10528359 | biostudies-literature
| S-EPMC7336272 | biostudies-literature
| S-EPMC8042483 | biostudies-literature
| S-EPMC7248951 | biostudies-literature
| S-EPMC11244880 | biostudies-literature
2024-10-02 | GSE244489 | GEO
2024-09-10 | GSE276399 | GEO