Dataset Information

What went wrong with variant effect predictor performance for the PCM1 challenge.

ABSTRACT: The recent years have seen a drastic increase in the amount of available genomic sequences. Alongside this explosion, hundreds of computational tools were developed to assess the impact of observed genetic variation. Critical Assessment of Genome Interpretation (CAGI) provides a platform to evaluate the performance of these tools in experimentally relevant contexts. In the CAGI-5 challenge assessing the 38 missense variants affecting the human Pericentriolar material 1 protein (PCM1), our SNAP-based submission was the top performer, although it did worse than expected from other evaluations. Here, we compare the CAGI-5 submissions, and 24 additional commonly used variant effect predictors, to analyze the reasons for this observation. We identified per residue conservation, structural, and functional PCM1 characteristics, which may be responsible. As expected, predictors had a hard time distinguishing effect variants in nonconserved positions. They were also better able to call effect variants in a structurally rich region than in a less-structured one; in the latter, they more often correctly identified benign than effect variants. Curiously, most of the protein was predicted to be functionally robust to mutation-a feature that likely makes it a harder problem for generalized variant effect predictors.

SUBMITTER: Miller M

PROVIDER: S-EPMC6744297 | biostudies-literature | 2019 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

What went wrong with variant effect predictor performance for the PCM1 challenge.

Miller Maximilian M Wang Yanran Y Bromberg Yana Y

Human mutation 20190703 9

The recent years have seen a drastic increase in the amount of available genomic sequences. Alongside this explosion, hundreds of computational tools were developed to assess the impact of observed genetic variation. Critical Assessment of Genome Interpretation (CAGI) provides a platform to evaluate the performance of these tools in experimentally relevant contexts. In the CAGI-5 challenge assessing the 38 missense variants affecting the human Pericentriolar material 1 protein (PCM1), our SNAP-b ...[more]

PMID: 31268618

Dataset Information

What went wrong with variant effect predictor performance for the PCM1 challenge.

Publications

What went wrong with variant effect predictor performance for the PCM1 challenge.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

KETASER01 protocol: What went right and what went wrong.
| S-EPMC9436287 | biostudies-literature

What went wrong? The flawed concept of cerebrospinal venous insufficiency.
| S-EPMC3652697 | biostudies-other

The Ensembl Variant Effect Predictor.
| S-EPMC4893825 | biostudies-literature

New developments in the management of non-small-cell lung cancer, focus on rociletinib: what went wrong?
| S-EPMC5063481 | biostudies-literature

What went wrong: A reckoning of Canada's contributions to evidence-based medicine through clinical trials during the COVID-19 pandemic.
| S-EPMC9629253 | biostudies-literature

A plugin for the Ensembl Variant Effect Predictor that uses MaxEntScan to predict variant spliceogenicity.
| S-EPMC6596880 | biostudies-other

Regulatory Single-Nucleotide Variant Predictor Increases Predictive Performance of Functional Regulatory Variants.
| S-EPMC6192032 | biostudies-literature

Annotating and prioritizing genomic variants using the Ensembl Variant Effect Predictor-A tutorial.
| S-EPMC7613081 | biostudies-literature

Performance of HADDOCK and a simple contact-based protein-ligand binding affinity predictor in the D3R Grand Challenge 2.
| S-EPMC5767195 | biostudies-literature

Rich annotation of DNA sequencing variants by leveraging the Ensembl Variant Effect Predictor with plugins.
| S-EPMC6283364 | biostudies-literature