Unknown

Dataset Information

0

MixOmics: An R package for 'omics feature selection and multiple data integration.


ABSTRACT: The advent of high throughput technologies has led to a wealth of publicly available 'omics data coming from different sources, such as transcriptomics, proteomics, metabolomics. Combining such large-scale biological data sets can lead to the discovery of important biological insights, provided that relevant information can be extracted in a holistic manner. Current statistical approaches have been focusing on identifying small subsets of molecules (a 'molecular signature') to explain or predict biological conditions, but mainly for a single type of 'omics. In addition, commonly used methods are univariate and consider each biological feature independently. We introduce mixOmics, an R package dedicated to the multivariate analysis of biological data sets with a specific focus on data exploration, dimension reduction and visualisation. By adopting a systems biology approach, the toolkit provides a wide range of methods that statistically integrate several data sets at once to probe relationships between heterogeneous 'omics data sets. Our recent methods extend Projection to Latent Structure (PLS) models for discriminant analysis, for data integration across multiple 'omics data or across independent studies, and for the identification of molecular signatures. We illustrate our latest mixOmics integrative frameworks for the multivariate analyses of 'omics data available from the package.

SUBMITTER: Rohart F 

PROVIDER: S-EPMC5687754 | biostudies-literature | 2017 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

mixOmics: An R package for 'omics feature selection and multiple data integration.

Rohart Florian F   Gautier Benoît B   Singh Amrit A   Lê Cao Kim-Anh KA  

PLoS computational biology 20171103 11


The advent of high throughput technologies has led to a wealth of publicly available 'omics data coming from different sources, such as transcriptomics, proteomics, metabolomics. Combining such large-scale biological data sets can lead to the discovery of important biological insights, provided that relevant information can be extracted in a holistic manner. Current statistical approaches have been focusing on identifying small subsets of molecules (a 'molecular signature') to explain or predict  ...[more]

Similar Datasets

| S-EPMC5738110 | biostudies-literature
| S-EPMC6792475 | biostudies-literature
| S-EPMC8288516 | biostudies-literature
| S-EPMC6532608 | biostudies-literature
| S-EPMC5181536 | biostudies-literature
| S-EPMC7493161 | biostudies-literature
| S-EPMC4172658 | biostudies-literature
| S-EPMC6773870 | biostudies-literature
| S-EPMC8021195 | biostudies-literature
| S-EPMC2847380 | biostudies-literature