Unknown

Dataset Information

0

Genomic data integration by WON-PARAFAC identifies interpretable factors for predicting drug-sensitivity in vivo.


ABSTRACT: Integrative analyses that summarize and link molecular data to treatment sensitivity are crucial to capture the biological complexity which is essential to further precision medicine. We introduce Weighted Orthogonal Nonnegative parallel factor analysis (WON-PARAFAC), a data integration method that identifies sparse and interpretable factors. WON-PARAFAC summarizes the GDSC1000 cell line compendium in 130 factors. We interpret the factors based on their association with recurrent molecular alterations, pathway enrichment, cancer type, and drug-response. Crucially, the cell line derived factors capture the majority of the relevant biological variation in Patient-Derived Xenograft (PDX) models, strongly suggesting our factors capture invariant and generalizable aspects of cancer biology. Furthermore, drug response in cell lines is better and more consistently translated to PDXs using factor-based predictors as compared to raw feature-based predictors. WON-PARAFAC efficiently summarizes and integrates multiway high-dimensional genomic data and enhances translatability of drug response prediction from cell lines to patient-derived xenografts.

SUBMITTER: Kim Y 

PROVIDER: S-EPMC6834616 | biostudies-literature | 2019 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genomic data integration by WON-PARAFAC identifies interpretable factors for predicting drug-sensitivity in vivo.

Kim Yongsoo Y   Bismeijer Tycho T   Zwart Wilbert W   Wessels Lodewyk F A LFA   Vis Daniel J DJ  

Nature communications 20191106 1


Integrative analyses that summarize and link molecular data to treatment sensitivity are crucial to capture the biological complexity which is essential to further precision medicine. We introduce Weighted Orthogonal Nonnegative parallel factor analysis (WON-PARAFAC), a data integration method that identifies sparse and interpretable factors. WON-PARAFAC summarizes the GDSC1000 cell line compendium in 130 factors. We interpret the factors based on their association with recurrent molecular alter  ...[more]

Similar Datasets

| S-EPMC6300887 | biostudies-other
| S-EPMC9957267 | biostudies-literature
| S-EPMC6555538 | biostudies-literature
| S-EPMC6096361 | biostudies-other
| S-EPMC3018816 | biostudies-other
| S-EPMC10575865 | biostudies-literature
| S-EPMC6889397 | biostudies-literature
| S-EPMC8837212 | biostudies-literature
2023-08-23 | GSE232173 | GEO