Unknown

Dataset Information

0

A unified sparse representation for sequence variant identification for complex traits.


ABSTRACT: Joint adjustment of cryptic relatedness and population structure is necessary to reduce bias in DNA sequence analysis; however, existent sparse regression methods model these two confounders separately. Incorporating prior biological information has great potential to enhance statistical power but such information is often overlooked in many existent sparse regression models. We developed a unified sparse regression (USR) to incorporate prior information and jointly adjust for cryptic relatedness, population structure, and other environmental covariates. Our USR models cryptic relatedness as a random effect and population structure as fixed effect, and utilize the weighted penalties to incorporate prior knowledge. As demonstrated by extensive simulations, our USR algorithm can discover more true causal variants and maintain a lower false discovery rate than do several commonly used feature selection methods. It can handle both rare and common variants simultaneously. Applying our USR algorithm to DNA sequence data of Mexican Americans from GAW18, we replicated three hypertension pathways, demonstrating the effectiveness in identifying susceptibility genetic variants.

SUBMITTER: Cao S 

PROVIDER: S-EPMC4236284 | biostudies-literature | 2014 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

A unified sparse representation for sequence variant identification for complex traits.

Cao Shaolong S   Qin Huaizhen H   Deng Hong-Wen HW   Wang Yu-Ping YP  

Genetic epidemiology 20140904 8


Joint adjustment of cryptic relatedness and population structure is necessary to reduce bias in DNA sequence analysis; however, existent sparse regression methods model these two confounders separately. Incorporating prior biological information has great potential to enhance statistical power but such information is often overlooked in many existent sparse regression models. We developed a unified sparse regression (USR) to incorporate prior information and jointly adjust for cryptic relatednes  ...[more]

Similar Datasets

| S-EPMC7067682 | biostudies-literature
| S-EPMC5006306 | biostudies-literature
| S-EPMC4481842 | biostudies-literature
| S-EPMC5561231 | biostudies-other
| S-EPMC3124899 | biostudies-literature
| S-EPMC3477196 | biostudies-other
| S-EPMC5385541 | biostudies-literature
| S-EPMC4815578 | biostudies-literature
| S-EPMC6246171 | biostudies-literature