Unknown

Dataset Information

0

Enlightening discriminative network functional modules behind Principal Component Analysis separation in differential-omic science studies.


ABSTRACT: Omic science is rapidly growing and one of the most employed techniques to explore differential patterns in omic datasets is principal component analysis (PCA). However, a method to enlighten the network of omic features that mostly contribute to the sample separation obtained by PCA is missing. An alternative is to build correlation networks between univariately-selected significant omic features, but this neglects the multivariate unsupervised feature compression responsible for the PCA sample segregation. Biologists and medical researchers often prefer effective methods that offer an immediate interpretation to complicated algorithms that in principle promise an improvement but in practice are difficult to be applied and interpreted. Here we present PC-corr: a simple algorithm that associates to any PCA segregation a discriminative network of features. Such network can be inspected in search of functional modules useful in the definition of combinatorial and multiscale biomarkers from multifaceted omic data in systems and precision biomedicine. We offer proofs of PC-corr efficacy on lipidomic, metagenomic, developmental genomic, population genetic, cancer promoteromic and cancer stem-cell mechanomic data. Finally, PC-corr is a general functional network inference approach that can be easily adopted for big data exploration in computer science and analysis of complex systems in physics.

SUBMITTER: Ciucci S 

PROVIDER: S-EPMC5347127 | biostudies-literature | 2017 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications


Omic science is rapidly growing and one of the most employed techniques to explore differential patterns in omic datasets is principal component analysis (PCA). However, a method to enlighten the network of omic features that mostly contribute to the sample separation obtained by PCA is missing. An alternative is to build correlation networks between univariately-selected significant omic features, but this neglects the multivariate unsupervised feature compression responsible for the PCA sample  ...[more]

Similar Datasets

| S-EPMC3637734 | biostudies-literature
| S-EPMC2732304 | biostudies-literature
2011-08-15 | GSE31375 | GEO
| S-EPMC2835171 | biostudies-literature
| S-EPMC4383722 | biostudies-literature
| S-EPMC4721272 | biostudies-literature
| S-EPMC3131008 | biostudies-literature
| S-EPMC5756705 | biostudies-literature
| S-EPMC4682404 | biostudies-literature
| S-EPMC5997717 | biostudies-literature