Unknown

Dataset Information

0

Integrative Analysis of Cancer Diagnosis Studies with Composite Penalization.


ABSTRACT: In cancer diagnosis studies, high-throughput gene profiling has been extensively conducted, searching for genes whose expressions may serve as markers. Data generated from such studies have the "large d, small n" feature, with the number of genes profiled much larger than the sample size. Penalization has been extensively adopted for simultaneous estimation and marker selection. Because of small sample sizes, markers identified from the analysis of single datasets can be unsatisfactory. A cost-effective remedy is to conduct integrative analysis of multiple heterogeneous datasets. In this article, we investigate composite penalization methods for estimation and marker selection in integrative analysis. The proposed methods use the minimax concave penalty (MCP) as the outer penalty. Under the homogeneity model, the ridge penalty is adopted as the inner penalty. Under the heterogeneity model, the Lasso penalty and MCP are adopted as the inner penalty. Effective computational algorithms based on coordinate descent are developed. Numerical studies, including simulation and analysis of practical cancer datasets, show satisfactory performance of the proposed methods.

SUBMITTER: Liu J 

PROVIDER: S-EPMC3933169 | biostudies-literature | 2014 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Integrative Analysis of Cancer Diagnosis Studies with Composite Penalization.

Liu Jin J   Huang Jian J   Ma Shuangge S  

Scandinavian journal of statistics, theory and applications 20140301 1


In cancer diagnosis studies, high-throughput gene profiling has been extensively conducted, searching for genes whose expressions may serve as markers. Data generated from such studies have the "large <i>d</i>, small <i>n</i>" feature, with the number of genes profiled much larger than the sample size. Penalization has been extensively adopted for simultaneous estimation and marker selection. Because of small sample sizes, markers identified from the analysis of single datasets can be unsatisfac  ...[more]

Similar Datasets

| S-EPMC4355402 | biostudies-literature
| S-EPMC6086364 | biostudies-literature
| S-EPMC3869641 | biostudies-literature
| S-EPMC3436748 | biostudies-literature
| S-EPMC10556091 | biostudies-literature
| S-EPMC6380428 | biostudies-literature
| S-EPMC2954437 | biostudies-literature
| S-EPMC8565848 | biostudies-literature
| S-EPMC9304893 | biostudies-literature
| S-EPMC8963360 | biostudies-literature