Dataset Information

Machine learning derived risk prediction of anorexia nervosa.

ABSTRACT:

Background

Anorexia nervosa (AN) is a complex psychiatric disease with a moderate to strong genetic contribution. In addition to conventional genome wide association (GWA) studies, researchers have been using machine learning methods in conjunction with genomic data to predict risk of diseases in which genetics play an important role.

Methods

In this study, we collected whole genome genotyping data on 3940 AN cases and 9266 controls from the Genetic Consortium for Anorexia Nervosa (GCAN), the Wellcome Trust Case Control Consortium 3 (WTCCC3), Price Foundation Collaborative Group and the Children's Hospital of Philadelphia (CHOP), and applied machine learning methods for predicting AN disease risk. The prediction performance is measured by area under the receiver operating characteristic curve (AUC), indicating how well the model distinguishes cases from unaffected control subjects.

Results

Logistic regression model with the lasso penalty technique generated an AUC of 0.693, while Support Vector Machines and Gradient Boosted Trees reached AUC's of 0.691 and 0.623, respectively. Using different sample sizes, our results suggest that larger datasets are required to optimize the machine learning models and achieve higher AUC values.

Conclusions

To our knowledge, this is the first attempt to assess AN risk based on genome wide genotype level data. Future integration of genomic, environmental and family-based information is likely to improve the AN risk evaluation process, eventually benefitting AN patients and families in the clinical setting.

SUBMITTER: Guo Y

PROVIDER: S-EPMC4721143 | biostudies-literature | 2016 Jan

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Machine learning derived risk prediction of anorexia nervosa.

Guo Yiran Y Wei Zhi Z Keating Brendan J BJ Hakonarson Hakon H

BMC medical genomics 20160120

<h4>Background</h4>Anorexia nervosa (AN) is a complex psychiatric disease with a moderate to strong genetic contribution. In addition to conventional genome wide association (GWA) studies, researchers have been using machine learning methods in conjunction with genomic data to predict risk of diseases in which genetics play an important role.<h4>Methods</h4>In this study, we collected whole genome genotyping data on 3940 AN cases and 9266 controls from the Genetic Consortium for Anorexia Nervosa ...[more]

PMID: 26792494

Dataset Information

Machine learning derived risk prediction of anorexia nervosa.

Background

Methods

Results

Conclusions

Publications

Machine learning derived risk prediction of anorexia nervosa.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Prediction of Breast Cancer Estrogen Receptor Status using Machine Learning
2013-01-01 | E-GEOD-29210 | biostudies-arrayexpress

An investigation of habit learning in Anorexia Nervosa.
| S-EPMC5718042 | biostudies-literature

VNN1 : a new biomarker of anorexia nervosa
2023-10-31 | GSE245978 | GEO

Multiomic prioritisation of risk genes for anorexia nervosa.
| S-EPMC10600818 | biostudies-literature

Autism and anorexia nervosa: Longitudinal prediction of eating disorder outcomes.
| S-EPMC9533087 | biostudies-literature

Impaired reversal learning in an animal model of anorexia nervosa.
| S-EPMC7041414 | biostudies-literature

Decreased feedback learning in anorexia nervosa persists after weight restoration.
| S-EPMC5869029 | biostudies-literature

Altered learning from positive feedback in adolescents with anorexia nervosa.
| S-EPMC11773347 | biostudies-literature

Modeling anorexia nervosa: transcriptional insights from human iPSC-derived neurons.
| S-EPMC5416680 | biostudies-literature