Unknown

Dataset Information

0

Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization.


ABSTRACT: Gene expression is controlled by many simultaneous interactions, frequently measured collectively in biology and medicine by high-throughput technologies. It is a highly challenging task to infer from these data the generating effects and cooperating genes. Here, we present an unsupervised hypothesis-generating learning concept termed signal dissection by correlation maximization (SDCM) that dissects large high-dimensional datasets into signatures. Each signature captures a particular signal pattern that was consistently observed for multiple genes and samples, likely caused by the same underlying interaction. A key difference to other methods is our flexible nonlinear signal superposition model, combined with a precise regression technique. Analyzing gene expression of diffuse large B-cell lymphoma, our method discovers previously unidentified signatures that reveal significant differences in patient survival. These signatures are more predictive than those from various methods used for comparison and robustly validate across technological platforms. This implies highly specific extraction of clinically relevant gene interactions.

SUBMITTER: Grau M 

PROVIDER: S-EPMC6883077 | biostudies-literature | 2019 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Dissection of gene expression datasets into clinically relevant interaction signatures via high-dimensional correlation maximization.

Grau Michael M   Lenz Georg G   Lenz Peter P  

Nature communications 20191128 1


Gene expression is controlled by many simultaneous interactions, frequently measured collectively in biology and medicine by high-throughput technologies. It is a highly challenging task to infer from these data the generating effects and cooperating genes. Here, we present an unsupervised hypothesis-generating learning concept termed signal dissection by correlation maximization (SDCM) that dissects large high-dimensional datasets into signatures. Each signature captures a particular signal pat  ...[more]

Similar Datasets

| S-EPMC6225773 | biostudies-literature
| S-EPMC7999182 | biostudies-literature
| S-EPMC5371730 | biostudies-literature
| S-EPMC7305156 | biostudies-literature
| S-EPMC7731964 | biostudies-literature
| S-EPMC7077624 | biostudies-literature
| S-EPMC4856574 | biostudies-literature
| S-ECPF-GEOD-43358 | biostudies-other
| S-EPMC6330328 | biostudies-literature
| S-EPMC9310611 | biostudies-literature