Dataset Information

A Robust and Powerful Set-Valued Approach to Rare Variant Association Analyses of Secondary Traits in Case-Control Sequencing Studies.

ABSTRACT: In many case-control designs of genome-wide association (GWAS) or next generation sequencing (NGS) studies, extensive data on secondary traits that may correlate and share the common genetic variants with the primary disease are available. Investigating these secondary traits can provide critical insights into the disease etiology or pathology, and enhance the GWAS or NGS results. Methods based on logistic regression (LG) were developed for this purpose. However, for the identification of rare variants (RVs), certain inadequacies in the LG models and algorithmic instability can cause severely inflated type I error, and significant loss of power, when the two traits are correlated and the RV is associated with the disease, especially at stringent significance levels. To address this issue, we propose a novel set-valued (SV) method that models a binary trait by dichotomization of an underlying continuous variable, and incorporate this into the genetic association model as a critical component. Extensive simulations and an analysis of seven secondary traits in a GWAS of benign ethnic neutropenia show that the SV method consistently controls type I error well at stringent significance levels, has larger power than the LG-based methods, and is robust in performance to effect pattern of the genetic variant (risk or protective), rare or common variants, rare or common diseases, and trait distributions. Because of the SV method's striking and profound advantage, we strongly recommend the SV method be employed instead of the LG-based methods for secondary traits analyses in case-control sequencing studies.

SUBMITTER: Kang G

PROVIDER: S-EPMC5340322 | biostudies-literature | 2017 Mar

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A Robust and Powerful Set-Valued Approach to Rare Variant Association Analyses of Secondary Traits in Case-Control Sequencing Studies.

Kang Guolian G Bi Wenjian W Zhang Hang H Pounds Stanley S Cheng Cheng C Shete Sanjay S Zou Fei F Zhao Yanlong Y Zhang Ji-Feng JF Yue Weihua W

Genetics 20161230 3

In many case-control designs of genome-wide association (GWAS) or next generation sequencing (NGS) studies, extensive data on secondary traits that may correlate and share the common genetic variants with the primary disease are available. Investigating these secondary traits can provide critical insights into the disease etiology or pathology, and enhance the GWAS or NGS results. Methods based on logistic regression (LG) were developed for this purpose. However, for the identification of rare v ...[more]

PMID: 28040743

Similar Datasets

Project description:Due to the vast variety of aspects that must be made—many of which are in opposition to one another—choosing a home can be difficult for those without much experience. Individuals need to spend more time making decisions because they are difficult, which results in making poor choices. To overcome residence selection issues, a computational approach is necessary. Unaccustomed people can use decision support systems to help them make decisions of expert quality. The current article explains the empirical procedure in that field in order to construct decision-support system for selecting a residence. The main goal of this study is to build a weighted product mechanism-based decision-support system for residential preference. The said house short-listing estimation is based on several key requirements derived from the interaction between the researchers and experts. The results of the information processing show that the normalized product strategy can rank the available alternatives to help individuals choose the best option. The interval valued fuzzy hypersoft set (IVFHS-set) is a broader variant of the fuzzy soft set that resolves the constraints of the fuzzy soft set from the perspective of the utilization of the multi-argument approximation operator. This operator maps sub-parametric tuples into a power set of universe. It emphasizes the segmentation of every attribute into a disjoint attribute valued set. These characteristics make it a whole new mathematical tool for handling problems involving uncertainties. This makes the decision-making process more effective and efficient. Furthermore, the traditional TOPSIS technique as a multi-criteria decision-making strategy is discussed in a concise manner. A new decision-making strategy, “OOPCS” is constructed with modifications in TOPSIS for fuzzy hypersoft set in interval settings. The proposed strategy is applied to a real-world multi-criteria decision-making scenario for ranking the alternatives to check and demonstrate their efficiency and effectiveness.

Dataset Information

A Robust and Powerful Set-Valued Approach to Rare Variant Association Analyses of Secondary Traits in Case-Control Sequencing Studies.

Publications

A Robust and Powerful Set-Valued Approach to Rare Variant Association Analyses of Secondary Traits in Case-Control Sequencing Studies.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets