Dataset Information

ENTPRISE-X: Predicting disease-associated frameshift and nonsense mutations.

ABSTRACT: To exploit the plethora of information provided by Next Generation Sequencing, the identification of the genetic mutations responsible for disease in general or cancer in particular, among the thousands of neutral germline or somatic variations is a crucial task. Genome-wide association studies for the detection of disease-associated genes or cancer drivers can only identify common variations or driver genes in a cohort of patients. Thus, they cannot discover unique disease-associated mutations or cancer driver genes on a personal basis. Moreover, even when there are such common variations, their significance is unknown. Here, we extend the machine learning based approach ENTPRISE developed for predicting the disease association of missense mutations to frameshift and nonsense mutations. The new approach, ENTPRISE-X, is shown to outperform the state-of-the-art methods VEST-indel and DDIG-in for predicting the disease association of germline frameshift mutations in terms of balanced measure Matthew's correlation coefficient, MCC, with a MCC of 0.586 for ENTPRISE-X, versus 0.412 by VEST-indel and 0.321 by DDIG-in, respectively. Large scale testing on the ExAC dataset shows ENTPRISE-X has a much lower fraction of 16% of variations classified as disease causing, as compared to VEST-indel's 26% and DDIG-in's 65% of predictions as being disease-associated. A web server for ENTPRISE-X is freely available for academic users at http://cssb2.biology.gatech.edu/entprise-x.

SUBMITTER: Zhou H

PROVIDER: S-EPMC5933770 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

ENTPRISE-X: Predicting disease-associated frameshift and nonsense mutations.

Zhou Hongyi H Gao Mu M Skolnick Jeffrey J

PloS one 20180503 5

To exploit the plethora of information provided by Next Generation Sequencing, the identification of the genetic mutations responsible for disease in general or cancer in particular, among the thousands of neutral germline or somatic variations is a crucial task. Genome-wide association studies for the detection of disease-associated genes or cancer drivers can only identify common variations or driver genes in a cohort of patients. Thus, they cannot discover unique disease-associated mutations ...[more]

PMID: 29723276

Dataset Information

ENTPRISE-X: Predicting disease-associated frameshift and nonsense mutations.

Publications

ENTPRISE-X: Predicting disease-associated frameshift and nonsense mutations.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Gene characteristics predicting missense, nonsense and frameshift mutations in tumor samples.
| S-EPMC6245819 | biostudies-other

Sign of APOBEC editing, purifying selection, frameshift, and in-frame nonsense mutations in the microevolution of lumpy skin disease virus.
| S-EPMC10682384 | biostudies-literature

Increased frequency of FBN1 frameshift and nonsense mutations in Marfan syndrome patients with aortic dissection.
| S-EPMC6978253 | biostudies-literature

Inhibition of nonsense-mediated mRNA decay by antisense morpholino oligonucleotides restores functional expression of hERG nonsense and frameshift mutations in long-QT syndrome.
| S-EPMC3049309 | biostudies-literature

Homozygous nonsense and frameshift mutations of the ACTH receptor in children with familial glucocorticoid deficiency (FGD) are not associated with long-term mineralocorticoid deficiency.
| S-EPMC2728896 | biostudies-literature

<i>GSN</i> gene frameshift mutations in Alzheimer's disease.
| S-EPMC10314070 | biostudies-literature

Splice site, frameshift, and chimeric GFAP mutations in Alexander disease.
| S-EPMC3674965 | biostudies-literature

Nonsense mutations in FAM161A cause RP28-associated recessive retinitis pigmentosa.
| S-EPMC2933350 | biostudies-literature

Fluorescent reporters give new insights into antibiotics-induced nonsense and frameshift mistranslation.
| S-EPMC10959953 | biostudies-literature

Nonsense and frameshift mutations in ZFHX1B, encoding Smad-interacting protein 1, cause a complex developmental disorder with a great variety of clinical features.
| S-EPMC1235530 | biostudies-literature