Unknown

Dataset Information

0

Performance evaluation of pathogenicity-computation methods for missense variants.


ABSTRACT: With expanding applications of next-generation sequencing in medical genetics, increasing computational methods are being developed to predict the pathogenicity of missense variants. Selecting optimal methods can accelerate the identification of candidate genes. However, the performances of different computational methods under various conditions have not been completely evaluated. Here, we compared 12 performance measures of 23 methods based on three independent benchmark datasets: (i) clinical variants from the ClinVar database related to genetic diseases, (ii) somatic variants from the IARC TP53 and ICGC databases related to human cancers and (iii) experimentally evaluated PPARG variants. Some methods showed different performances under different conditions, suggesting that they were not always applicable for different conditions. Furthermore, the specificities were lower than the sensitivities for most methods (especially, for the experimentally evaluated benchmark datasets), suggesting that more rigorous cutoff values are necessary to distinguish pathogenic variants. Furthermore, REVEL, VEST3 and the combination of both methods (i.e. ReVe) showed the best overall performances with all the benchmark data. Finally, we evaluated the performances of these methods with de novo mutations, finding that ReVe consistently showed the best performance. We have summarized the performances of different methods under various conditions, providing tentative guidance for optimal tool selection.

SUBMITTER: Li J 

PROVIDER: S-EPMC6125674 | biostudies-literature | 2018 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Performance evaluation of pathogenicity-computation methods for missense variants.

Li Jinchen J   Zhao Tingting T   Zhang Yi Y   Zhang Kun K   Shi Leisheng L   Chen Yun Y   Wang Xingxing X   Sun Zhongsheng Z   Sun Zhongsheng Z  

Nucleic acids research 20180901 15


With expanding applications of next-generation sequencing in medical genetics, increasing computational methods are being developed to predict the pathogenicity of missense variants. Selecting optimal methods can accelerate the identification of candidate genes. However, the performances of different computational methods under various conditions have not been completely evaluated. Here, we compared 12 performance measures of 23 methods based on three independent benchmark datasets: (i) clinical  ...[more]

Similar Datasets

| S-EPMC7214033 | biostudies-literature
| S-EPMC6744350 | biostudies-literature
| S-EPMC7820281 | biostudies-literature
2022-09-25 | PRJEB56211 | EVA
| PRJEB46587 | ENA
| S-EPMC5065685 | biostudies-literature
| S-EPMC8754197 | biostudies-literature
| S-EPMC7859701 | biostudies-literature