Unknown

Dataset Information

0

Application of two machine learning algorithms to genetic association studies in the presence of covariates.


ABSTRACT: Population-based investigations aimed at uncovering genotype-trait associations often involve high-dimensional genetic polymorphism data as well as information on multiple environmental and clinical parameters. Machine learning (ML) algorithms offer a straightforward analytic approach for selecting subsets of these inputs that are most predictive of a pre-defined trait. The performance of these algorithms, however, in the presence of covariates is not well characterized.In this manuscript, we investigate two approaches: Random Forests (RFs) and Multivariate Adaptive Regression Splines (MARS). Through multiple simulation studies, the performance under several underlying models is evaluated. An application to a cohort of HIV-1 infected individuals receiving anti-retroviral therapies is also provided.Consistent with more traditional regression modeling theory, our findings highlight the importance of considering the nature of underlying gene-covariate-trait relationships before applying ML algorithms, particularly when there is potential confounding or effect mediation.

SUBMITTER: Nonyane BA 

PROVIDER: S-EPMC2620353 | biostudies-literature | 2008 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Application of two machine learning algorithms to genetic association studies in the presence of covariates.

Nonyane Bareng A S BA   Foulkes Andrea S AS  

BMC genetics 20081114


<h4>Background</h4>Population-based investigations aimed at uncovering genotype-trait associations often involve high-dimensional genetic polymorphism data as well as information on multiple environmental and clinical parameters. Machine learning (ML) algorithms offer a straightforward analytic approach for selecting subsets of these inputs that are most predictive of a pre-defined trait. The performance of these algorithms, however, in the presence of covariates is not well characterized.<h4>Me  ...[more]

Similar Datasets

| S-EPMC5632292 | biostudies-literature
| S-EPMC8087002 | biostudies-literature
| S-EPMC6203400 | biostudies-literature
| S-EPMC5907739 | biostudies-literature
| S-EPMC10562960 | biostudies-literature
| S-EPMC10539075 | biostudies-literature
| S-EPMC6609504 | biostudies-literature
| S-EPMC7835636 | biostudies-literature
| S-EPMC9880585 | biostudies-literature
| S-EPMC2828904 | biostudies-literature