Unknown

Dataset Information

0

InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution.


ABSTRACT: In predicting the pathogenicity of a nonsynonymous single-nucleotide variant (nsSNV), a radical change in amino acid properties is prone to be classified as being pathogenic. However, not all such nsSNVs are associated with human diseases. We generated random forest (RF) models individually for each amino acid substitution to differentiate pathogenic nsSNVs in the Human Gene Mutation Database and common nsSNVs in dbSNP. We named a set of our models 'Individual Meta RF' (InMeRF). Ten-fold cross-validation of InMeRF showed that the areas under the curves (AUCs) of receiver operating characteristic (ROC) and precision-recall curves were on average 0.941 and 0.957, respectively. To compare InMeRF with seven other tools, the eight tools were generated using the same training dataset, and were compared using the same three testing datasets. ROC-AUCs of InMeRF were ranked first in the eight tools. We applied InMeRF to 155 pathogenic and 125 common nsSNVs in seven major genes causing congenital myasthenic syndromes, as well as in VANGL1 causing spina bifida, and found that the sensitivity and specificity of InMeRF were 0.942 and 0.848, respectively. We made the InMeRF web service, and also made genome-wide InMeRF scores available online (https://www.med.nagoya-u.ac.jp/neurogenetics/InMeRF/).

SUBMITTER: Takeda JI 

PROVIDER: S-EPMC7671370 | biostudies-literature | 2020 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

InMeRF: prediction of pathogenicity of missense variants by individual modeling for each amino acid substitution.

Takeda Jun-Ichi JI   Nanatsue Kentaro K   Yamagishi Ryosuke R   Ito Mikako M   Haga Nobuhiko N   Hirata Hiromi H   Ogi Tomoo T   Ohno Kinji K  

NAR genomics and bioinformatics 20200526 2


In predicting the pathogenicity of a nonsynonymous single-nucleotide variant (nsSNV), a radical change in amino acid properties is prone to be classified as being pathogenic. However, not all such nsSNVs are associated with human diseases. We generated random forest (RF) models individually for each amino acid substitution to differentiate pathogenic nsSNVs in the Human Gene Mutation Database and common nsSNVs in dbSNP. We named a set of our models 'Individual Meta RF' (InMeRF). Ten-fold cross-v  ...[more]

Similar Datasets

| S-EPMC8546039 | biostudies-literature
| S-EPMC10680766 | biostudies-literature
| S-EPMC2924240 | biostudies-literature
| S-EPMC10307865 | biostudies-literature
| S-EPMC6043381 | biostudies-literature
| S-EPMC7214033 | biostudies-literature
| S-EPMC4913391 | biostudies-literature
| S-EPMC7887062 | biostudies-literature
| S-EPMC10419700 | biostudies-literature
| S-EPMC6744350 | biostudies-literature