Unknown

Dataset Information

0

Performance of random forests and logic regression methods using mini-exome sequence data.


ABSTRACT: Machine learning approaches are an attractive option for analyzing large-scale data to detect genetic variants that contribute to variation of a quantitative trait, without requiring specific distributional assumptions. We evaluate two machine learning methods, random forests and logic regression, and compare them to standard simple univariate linear regression, using the Genetic Analysis Workshop 17 mini-exome data. We also apply these methods after collapsing multiple rare variants within genes and within gene pathways. Linear regression and the random forest method performed better when rare variants were collapsed based on genes or gene pathways than when each variant was analyzed separately. Logic regression performed better when rare variants were collapsed based on genes rather than on pathways.

SUBMITTER: Kim Y 

PROVIDER: S-EPMC3287827 | biostudies-other | 2011 Nov

REPOSITORIES: biostudies-other

altmetric image

Publications

Performance of random forests and logic regression methods using mini-exome sequence data.

Kim Yoonhee Y   Li Qing Q   Cropp Cheryl D CD   Sung Heejong H   Cai Juanliang J   Simpson Claire L CL   Perry Brian B   Dasgupta Abhijit A   Malley James D JD   Wilson Alexander F AF   Bailey-Wilson Joan E JE  

BMC proceedings 20111129


Machine learning approaches are an attractive option for analyzing large-scale data to detect genetic variants that contribute to variation of a quantitative trait, without requiring specific distributional assumptions. We evaluate two machine learning methods, random forests and logic regression, and compare them to standard simple univariate linear regression, using the Genetic Analysis Workshop 17 mini-exome data. We also apply these methods after collapsing multiple rare variants within gene  ...[more]

Similar Datasets

| S-EPMC3750505 | biostudies-literature
| S-EPMC7089425 | biostudies-literature
| S-EPMC6433899 | biostudies-literature
| S-EPMC8716055 | biostudies-literature
| S-EPMC3912194 | biostudies-literature
| S-EPMC6598279 | biostudies-literature
| S-EPMC4062420 | biostudies-literature
| S-EPMC10659741 | biostudies-literature
| S-EPMC10773894 | biostudies-literature
| S-EPMC6812352 | biostudies-literature