Dataset Information

Variant effect predictions capture some aspects of deep mutational scanning experiments.

ABSTRACT: BACKGROUND:Deep mutational scanning (DMS) studies exploit the mutational landscape of sequence variation by systematically and comprehensively assaying the effect of single amino acid variants (SAVs; also referred to as missense mutations, or non-synonymous Single Nucleotide Variants - missense SNVs or nsSNVs) for particular proteins. We assembled SAV annotations from 22 different DMS experiments and normalized the effect scores to evaluate variant effect prediction methods. Three trained on traditional variant effect data (PolyPhen-2, SIFT, SNAP2), a regression method optimized on DMS data (Envision), and a naïve prediction using conservation information from homologs. RESULTS:On a set of 32,981 SAVs, all methods captured some aspects of the experimental effect scores, albeit not the same. Traditional methods such as SNAP2 correlated slightly more with measurements and better classified binary states (effect or neutral). Envision appeared to better estimate the precise degree of effect. Most surprising was that the simple naïve conservation approach using PSI-BLAST in many cases outperformed other methods. All methods captured beneficial effects (gain-of-function) significantly worse than deleterious (loss-of-function). For the few proteins with multiple independent experimental measurements, experiments differed substantially, but agreed more with each other than with predictions. CONCLUSIONS:DMS provides a new powerful experimental means of understanding the dynamics of the protein sequence space. As always, promising new beginnings have to overcome challenges. While our results demonstrated that DMS will be crucial to improve variant effect prediction methods, data diversity hindered simplification and generalization.

SUBMITTER: Reeb J

PROVIDER: S-EPMC7077003 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Variant effect predictions capture some aspects of deep mutational scanning experiments.

Reeb Jonas J Wirth Theresa T Rost Burkhard B

BMC bioinformatics 20200317 1

<h4>Background</h4>Deep mutational scanning (DMS) studies exploit the mutational landscape of sequence variation by systematically and comprehensively assaying the effect of single amino acid variants (SAVs; also referred to as missense mutations, or non-synonymous Single Nucleotide Variants - missense SNVs or nsSNVs) for particular proteins. We assembled SAV annotations from 22 different DMS experiments and normalized the effect scores to evaluate variant effect prediction methods. Three traine ...[more]

PMID: 32183714

Similar Datasets

Project description:SLCO1B1 (solute carrier organic anion transporter family member 1B1) is an important transmembrane hepatic uptake transporter. Genetic variants in the SLCO1B1 gene have been associated with altered protein folding, resulting in protein degradation and decreased transporter activity. Next-generation sequencing (NGS) of pharmacogenes is being applied increasingly to associate variation in drug response with genetic sequence variants. However, it is difficult to link variants of unknown significance with functional phenotypes using "one-at-a-time" functional systems. Deep mutational scanning (DMS) using a "landing pad cell-based system" is a high-throughput technique designed to analyze hundreds of gene open reading frame (ORF) missense variants in a parallel and scalable fashion. We have applied DMS to analyze 137 missense variants in the SLCO1B1 ORF obtained from the Exome Aggregation Consortium project. ORFs containing these variants were fused to green fluorescent protein and were integrated into "landing pad" cells. Florescence-activated cell sorting was performed to separate the cells into four groups based on fluorescence readout indicating protein expression at the single cell level. NGS was then performed and SLCO1B1 variant frequencies were used to determine protein abundance. We found that six variants not previously characterized functionally displayed less than 25% and another 12 displayed approximately 50% of wild-type protein expression. These results were then functionally validated by transporter studies. Severely damaging variants identified by DMS may have clinical relevance for SLCO1B1-dependent drug transport, but we need to exercise caution since the relatively small number of severely damaging variants identified raise questions with regard to the application of DMS to intrinsic membrane proteins such as organic anion transporter protein 1B1. SIGNIFICANCE STATEMENT: The functional implications of a large numbers of open reading frame (ORF) "variants of unknown significance" (VUS) in transporter genes have not been characterized. This study applied deep mutational scanning to determine the functional effects of VUS that have been observed in the ORF of SLCO1B1(s olute carrier organic anion transporter family member 1B1). Several severely damaging variants were identified, studied, and validated. These observations have implications for both the application of deep mutational scanning to intrinsic membrane proteins and for the clinical effect of drugs and endogenous compounds transported by SLCO1B1.

Dataset Information

Variant effect predictions capture some aspects of deep mutational scanning experiments.

Publications

Variant effect predictions capture some aspects of deep mutational scanning experiments.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets