Unknown

Dataset Information

0

A latent unknown clustering integrating multi-omics data (LUCID) with phenotypic traits.


ABSTRACT:

Motivation

Epidemiologic, clinical and translational studies are increasingly generating multiplatform omics data. Methods that can integrate across multiple high-dimensional data types while accounting for differential patterns are critical for uncovering novel associations and underlying relevant subgroups.

Results

We propose an integrative model to estimate latent unknown clusters (LUCID) aiming to both distinguish unique genomic, exposure and informative biomarkers/omic effects while jointly estimating subgroups relevant to the outcome of interest. Simulation studies indicate that we can obtain consistent estimates reflective of the true simulated values, accurately estimate subgroups and recapitulate subgroup-specific effects. We also demonstrate the use of the integrated model for future prediction of risk subgroups and phenotypes. We apply this approach to two real data applications to highlight the integration of genomic, exposure and metabolomic data.

Availability and implementation

The LUCID method is implemented through the LUCIDus R package available on CRAN (https://CRAN.R-project.org/package=LUCIDus).

Supplementary information

Supplementary materials are available at Bioinformatics online.

SUBMITTER: Peng C 

PROVIDER: S-EPMC7986585 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC6455926 | biostudies-literature
| S-EPMC7423957 | biostudies-literature
| S-EPMC9805570 | biostudies-literature
| S-EPMC7750936 | biostudies-literature
| S-EPMC9097087 | biostudies-literature
| S-EPMC6712585 | biostudies-literature
| S-EPMC1800873 | biostudies-literature
| S-EPMC9957193 | biostudies-literature
| S-EPMC9225015 | biostudies-literature
| S-EPMC7161108 | biostudies-literature