Unknown

Dataset Information

0

Multimodel inference for biomarker development: an application to schizophrenia.


ABSTRACT: In the present study, to improve the predictive performance of a model and its reproducibility when applied to an independent data set, we investigated the use of multimodel inference to predict the probability of having a complex psychiatric disorder. We formed training and test sets using proteomic data (147 peptides from 77 proteins) from two-independent collections of first-onset drug-naive schizophrenia patients and controls. A set of prediction models was produced by applying lasso regression with repeated tenfold cross-validation to the training set. We used feature extraction and model averaging across the set of models to form two prediction models. The resulting models clearly demonstrated the utility of a multimodel based approach to make good (training set AUC?>?0.80) and reproducible predictions (test set AUC?>?0.80) for the probability of having schizophrenia. Moreover, we identified four proteins (five peptides) whose effect on the probability of having schizophrenia was modified by sex, one of which was a novel potential biomarker of schizophrenia, foetal haemoglobin. The evidence of effect modification suggests that future schizophrenia studies should be conducted in males and females separately. Future biomarker studies should consider adopting a multimodel approach and going beyond the main effects of features.

SUBMITTER: Cooper JD 

PROVIDER: S-EPMC6370882 | biostudies-literature | 2019 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Multimodel inference for biomarker development: an application to schizophrenia.

Cooper Jason D JD   Han Sung Yeon Sarah SYS   Tomasik Jakub J   Ozcan Sureyya S   Rustogi Nitin N   van Beveren Nico J M NJM   Leweke F Markus FM   Bahn Sabine S  

Translational psychiatry 20190211 1


In the present study, to improve the predictive performance of a model and its reproducibility when applied to an independent data set, we investigated the use of multimodel inference to predict the probability of having a complex psychiatric disorder. We formed training and test sets using proteomic data (147 peptides from 77 proteins) from two-independent collections of first-onset drug-naive schizophrenia patients and controls. A set of prediction models was produced by applying lasso regress  ...[more]

Similar Datasets

| S-EPMC3213087 | biostudies-literature
| S-EPMC3469468 | biostudies-literature
| S-EPMC7855182 | biostudies-literature
| S-EPMC2838508 | biostudies-other
| S-EPMC5678891 | biostudies-literature
| S-EPMC4931851 | biostudies-other
| S-EPMC5290312 | biostudies-literature
| S-EPMC7614773 | biostudies-literature
| S-EPMC5068725 | biostudies-literature
| S-EPMC7351254 | biostudies-literature