Unknown

Dataset Information

0

High-dimensional genomic data bias correction and data integration using MANCIE.


ABSTRACT: High-dimensional genomic data analysis is challenging due to noises and biases in high-throughput experiments. We present a computational method matrix analysis and normalization by concordant information enhancement (MANCIE) for bias correction and data integration of distinct genomic profiles on the same samples. MANCIE uses a Bayesian-supported principal component analysis-based approach to adjust the data so as to achieve better consistency between sample-wise distances in the different profiles. MANCIE can improve tissue-specific clustering in ENCODE data, prognostic prediction in Molecular Taxonomy of Breast Cancer International Consortium and The Cancer Genome Atlas data, copy number and expression agreement in Cancer Cell Line Encyclopedia data, and has broad applications in cross-platform, high-dimensional data integration.

SUBMITTER: Zang C 

PROVIDER: S-EPMC4833864 | biostudies-other | 2016 Apr

REPOSITORIES: biostudies-other

altmetric image

Publications


High-dimensional genomic data analysis is challenging due to noises and biases in high-throughput experiments. We present a computational method matrix analysis and normalization by concordant information enhancement (MANCIE) for bias correction and data integration of distinct genomic profiles on the same samples. MANCIE uses a Bayesian-supported principal component analysis-based approach to adjust the data so as to achieve better consistency between sample-wise distances in the different prof  ...[more]

Similar Datasets

| S-EPMC3044426 | biostudies-literature
| S-EPMC4239429 | biostudies-literature
| S-EPMC8215925 | biostudies-literature
2011-05-25 | GSE29427 | GEO
2011-05-25 | E-GEOD-29427 | biostudies-arrayexpress
2014-10-01 | GSE51981 | GEO
| S-EPMC4454613 | biostudies-literature
| S-EPMC3159482 | biostudies-literature
2014-10-01 | E-GEOD-51981 | biostudies-arrayexpress