Dataset Information

Machine learning-based reclassification of germline variants of unknown significance: The RENOVO algorithm.

ABSTRACT: The increasing scope of genetic testing allowed by next-generation sequencing (NGS) dramatically increased the number of genetic variants to be interpreted as pathogenic or benign for adequate patient management. Still, the interpretation process often fails to deliver a clear classification, resulting in either variants of unknown significance (VUSs) or variants with conflicting interpretation of pathogenicity (CIP); these represent a major clinical problem because they do not provide useful information for decision-making, causing a large fraction of genetically determined disease to remain undertreated. We developed a machine learning (random forest)-based tool, RENOVO, that classifies variants as pathogenic or benign on the basis of publicly available information and provides a pathogenicity likelihood score (PLS). Using the same feature classes recommended by guidelines, we trained RENOVO on established pathogenic/benign variants in ClinVar (training set accuracy = 99%) and tested its performance on variants whose interpretation has changed over time (test set accuracy = 95%). We further validated the algorithm on additional datasets including unreported variants validated either through expert consensus (ENIGMA) or laboratory-based functional techniques (on BRCA1/2 and SCN5A). On all datasets, RENOVO outperformed existing automated interpretation tools. On the basis of the above validation metrics, we assigned a defined PLS to all existing ClinVar VUSs, proposing a reclassification for 67% with >90% estimated precision. RENOVO provides a validated tool to reduce the fraction of uninterpreted or misinterpreted variants, tackling an area of unmet need in modern clinical genetics.

SUBMITTER: Favalli V

PROVIDER: S-EPMC8059374 | biostudies-literature | 2021 Apr

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Machine learning-based reclassification of germline variants of unknown significance: The RENOVO algorithm.

Favalli Valentina V Tini Giulia G Bonetti Emanuele E Vozza Gianluca G Guida Alessandro A Gandini Sara S Pelicci Pier Giuseppe PG Mazzarella Luca L

American journal of human genetics 20210323 4

The increasing scope of genetic testing allowed by next-generation sequencing (NGS) dramatically increased the number of genetic variants to be interpreted as pathogenic or benign for adequate patient management. Still, the interpretation process often fails to deliver a clear classification, resulting in either variants of unknown significance (VUSs) or variants with conflicting interpretation of pathogenicity (CIP); these represent a major clinical problem because they do not provide useful in ...[more]

PMID: 33761318

Dataset Information

Machine learning-based reclassification of germline variants of unknown significance: The RENOVO algorithm.

Publications

Machine learning-based reclassification of germline variants of unknown significance: The RENOVO algorithm.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Reclassification of Five BRCA1/2 Variants with Unknown Significance Using Complex Functional Study.
| S-EPMC9582465 | biostudies-literature

Prioritizing variants of uncertain significance for reclassification using a rule-based algorithm in inherited retinal dystrophies.
| S-EPMC7902814 | biostudies-literature

Feasibility of Follow-Up Studies and Reclassification in Spinocerebellar Ataxia Gene Variants of Unknown Significance.
| S-EPMC8990126 | biostudies-literature

Functional characterization of variants of unknown significance in a spinocerebellar ataxia patient using an unsupervised machine learning pipeline.
| S-EPMC9010413 | biostudies-literature

Diagnostic significance of plasma lipid markers and machine learning-based algorithm for gastric cancer.
| S-EPMC8020384 | biostudies-literature

Development of A Machine Learning Algorithm to Classify Drugs Of Unknown Fetal Effect.
| S-EPMC5634437 | biostudies-literature

MutPred Splice: machine learning-based prediction of exonic variants that disrupt splicing.
| S-EPMC4054890 | biostudies-other

Functional significance of germline EPAS1 variants.
| S-EPMC7989857 | biostudies-literature

Interpretation of genetic testing: variants of unknown significance.
| S-EPMC3587691 | biostudies-literature

Trans-activation-based risk assessment of BRCA1 BRCT variants with unknown clinical significance.
| S-EPMC6247502 | biostudies-literature