Dataset Information

Identification of cancer genomic markers via integrative sparse boosting.

ABSTRACT: In high-throughput cancer genomic studies, markers identified from the analysis of single data sets often suffer a lack of reproducibility because of the small sample sizes. An ideal solution is to conduct large-scale prospective studies, which are extremely expensive and time consuming. A cost-effective remedy is to pool data from multiple comparable studies and conduct integrative analysis. Integrative analysis of multiple data sets is challenging because of the high dimensionality of genomic measurements and heterogeneity among studies. In this article, we propose a sparse boosting approach for marker identification in integrative analysis of multiple heterogeneous cancer diagnosis studies with gene expression measurements. The proposed approach can effectively accommodate the heterogeneity among multiple studies and identify markers with consistent effects across studies. Simulation shows that the proposed approach has satisfactory identification results and outperforms alternatives including an intensity approach and meta-analysis. The proposed approach is used to identify markers of pancreatic cancer and liver cancer.

SUBMITTER: Huang Y

PROVIDER: S-EPMC3577103 | biostudies-literature | 2012 Jul

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Identification of cancer genomic markers via integrative sparse boosting.

Huang Yuan Y Huang Jian J Shia Ben-Chang BC Ma Shuangge S

Biostatistics (Oxford, England) 20111031 3

In high-throughput cancer genomic studies, markers identified from the analysis of single data sets often suffer a lack of reproducibility because of the small sample sizes. An ideal solution is to conduct large-scale prospective studies, which are extremely expensive and time consuming. A cost-effective remedy is to pool data from multiple comparable studies and conduct integrative analysis. Integrative analysis of multiple data sets is challenging because of the high dimensionality of genomic ...[more]

PMID: 22045909

Dataset Information

Identification of cancer genomic markers via integrative sparse boosting.

Publications

Identification of cancer genomic markers via integrative sparse boosting.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

An integrative sparse boosting analysis of cancer genomic commonality and difference.
| S-EPMC7471599 | biostudies-literature

Pan-cancer integrative histology-genomic analysis via multimodal deep learning.
| S-EPMC10397370 | biostudies-literature

Sparse group penalized integrative analysis of multiple cancer prognosis datasets.
| S-EPMC4090387 | biostudies-literature

Identification of relevant subtypes via preweighted sparse clustering.
| S-EPMC5959300 | biostudies-literature

Identification of Molecular Markers Associated with Prostate Cancer Subtypes: An Integrative Bioinformatics Approach.
| S-EPMC10813078 | biostudies-literature

Systematic identification of genomic markers of drug sensitivity in cancer cells.
| S-EPMC3349233 | biostudies-literature

Pan-cancer identification of clinically relevant genomic subtypes using outcome-weighted integrative clustering.
| S-EPMC7716509 | biostudies-literature

Robust semiparametric gene-environment interaction analysis using sparse boosting.
| S-EPMC6736719 | biostudies-literature

Integrative genomic and transcriptomic analysis of genetic markers in Dupuytren's disease.
| S-EPMC6624179 | biostudies-literature

Integrative epigenomic and genomic filtering for methylation markers in hepatocellular carcinomas.
| S-EPMC4460673 | biostudies-literature