Unknown

Dataset Information

0

Gene-environment interaction identification via penalized robust divergence.


ABSTRACT: In high-throughput cancer studies, gene-environment interactions associated with outcomes have important implications. Some commonly adopted identification methods do not respect the "main effect, interaction" hierarchical structure. In addition, they can be challenged by data contamination and/or long-tailed distributions, which are not uncommon. In this article, robust methods based on γ$\gamma$ -divergence and density power divergence are proposed to accommodate contaminated data/long-tailed distributions. A hierarchical sparse group penalty is adopted for regularized estimation and selection and can identify important gene-environment interactions and respect the "main effect, interaction" hierarchical structure. The proposed methods are implemented using an effective group coordinate descent algorithm. Simulation shows that when contamination occurs, the proposed methods can significantly outperform the existing alternatives with more accurate identification. The proposed approach is applied to the analysis of The Cancer Genome Atlas (TCGA) triple-negative breast cancer data and Gene Environment Association Studies (GENEVA) Type 2 Diabetes data.

SUBMITTER: Ren M 

PROVIDER: S-EPMC9386692 | biostudies-literature | 2022 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Gene-environment interaction identification via penalized robust divergence.

Ren Mingyang M   Zhang Sanguo S   Ma Shuangge S   Zhang Qingzhao Q  

Biometrical journal. Biometrische Zeitschrift 20211101 3


In high-throughput cancer studies, gene-environment interactions associated with outcomes have important implications. Some commonly adopted identification methods do not respect the "main effect, interaction" hierarchical structure. In addition, they can be challenged by data contamination and/or long-tailed distributions, which are not uncommon. In this article, robust methods based on γ$\gamma$ -divergence and density power divergence are proposed to accommodate contaminated data/long-tailed  ...[more]

Similar Datasets

| S-EPMC4356211 | biostudies-literature
| S-EPMC4715555 | biostudies-literature
| S-EPMC6736719 | biostudies-literature
| S-EPMC5860584 | biostudies-literature
| S-EPMC8545291 | biostudies-literature
| S-EPMC5499546 | biostudies-literature
| S-EPMC4157149 | biostudies-literature
| S-EPMC2830581 | biostudies-literature
| S-EPMC6335188 | biostudies-literature
| S-EPMC2851818 | biostudies-literature