Unknown

Dataset Information

0

Efficient and Accurate Multiple-Phenotype Regression Method for High Dimensional Data Considering Population Structure.


ABSTRACT: A typical genome-wide association study tests correlation between a single phenotype and each genotype one at a time. However, single-phenotype analysis might miss unmeasured aspects of complex biological networks. Analyzing many phenotypes simultaneously may increase the power to capture these unmeasured aspects and detect more variants. Several multivariate approaches aim to detect variants related to more than one phenotype, but these current approaches do not consider the effects of population structure. As a result, these approaches may result in a significant amount of false positive identifications. Here, we introduce a new methodology, referred to as GAMMA for generalized analysis of molecular variance for mixed-model analysis, which is capable of simultaneously analyzing many phenotypes and correcting for population structure. In a simulated study using data implanted with true genetic effects, GAMMA accurately identifies these true effects without producing false positives induced by population structure. In simulations with this data, GAMMA is an improvement over other methods which either fail to detect true effects or produce many false positive identifications. We further apply our method to genetic studies of yeast and gut microbiome from mice and show that GAMMA identifies several variants that are likely to have true biological mechanisms.

SUBMITTER: Joo JW 

PROVIDER: S-EPMC5161272 | biostudies-literature | 2016 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Efficient and Accurate Multiple-Phenotype Regression Method for High Dimensional Data Considering Population Structure.

Joo Jong Wha J JW   Kang Eun Yong EY   Org Elin E   Furlotte Nick N   Parks Brian B   Hormozdiari Farhad F   Lusis Aldons J AJ   Eskin Eleazar E  

Genetics 20161021 4


A typical genome-wide association study tests correlation between a single phenotype and each genotype one at a time. However, single-phenotype analysis might miss unmeasured aspects of complex biological networks. Analyzing many phenotypes simultaneously may increase the power to capture these unmeasured aspects and detect more variants. Several multivariate approaches aim to detect variants related to more than one phenotype, but these current approaches do not consider the effects of populati  ...[more]

Similar Datasets

| S-EPMC6869070 | biostudies-literature
| S-EPMC7799181 | biostudies-literature
| S-EPMC9293191 | biostudies-literature
| S-EPMC6280799 | biostudies-other
| S-EPMC6612894 | biostudies-literature
| S-EPMC4654965 | biostudies-literature
| S-EPMC3954435 | biostudies-literature
| S-EPMC7214043 | biostudies-literature
| S-EPMC7610983 | biostudies-literature
| S-EPMC5870402 | biostudies-literature