Unknown

Dataset Information

0

Enhancing the Lasso Approach for Developing a Survival Prediction Model Based on Gene Expression Data.


ABSTRACT: In the past decade, researchers in oncology have sought to develop survival prediction models using gene expression data. The least absolute shrinkage and selection operator (lasso) has been widely used to select genes that truly correlated with a patient's survival. The lasso selects genes for prediction by shrinking a large number of coefficients of the candidate genes towards zero based on a tuning parameter that is often determined by a cross-validation (CV). However, this method can pass over (or fail to identify) true positive genes (i.e., it identifies false negatives) in certain instances, because the lasso tends to favor the development of a simple prediction model. Here, we attempt to monitor the identification of false negatives by developing a method for estimating the number of true positive (TP) genes for a series of values of a tuning parameter that assumes a mixture distribution for the lasso estimates. Using our developed method, we performed a simulation study to examine its precision in estimating the number of TP genes. Additionally, we applied our method to a real gene expression dataset and found that it was able to identify genes correlated with survival that a CV method was unable to detect.

SUBMITTER: Kaneko S 

PROVIDER: S-EPMC4469838 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

Enhancing the Lasso Approach for Developing a Survival Prediction Model Based on Gene Expression Data.

Kaneko Shuhei S   Hirakawa Akihiro A   Hamada Chikuma C  

Computational and mathematical methods in medicine 20150603


In the past decade, researchers in oncology have sought to develop survival prediction models using gene expression data. The least absolute shrinkage and selection operator (lasso) has been widely used to select genes that truly correlated with a patient's survival. The lasso selects genes for prediction by shrinking a large number of coefficients of the candidate genes towards zero based on a tuning parameter that is often determined by a cross-validation (CV). However, this method can pass ov  ...[more]

Similar Datasets

| S-EPMC4416884 | biostudies-literature
| S-EPMC3031034 | biostudies-other
| S-EPMC5870779 | biostudies-literature
| S-EPMC8293825 | biostudies-literature
| S-EPMC2761544 | biostudies-literature
| S-EPMC6134797 | biostudies-literature
| S-EPMC3590926 | biostudies-literature
| S-EPMC8351588 | biostudies-literature
| S-EPMC7098575 | biostudies-literature
| S-EPMC7523642 | biostudies-literature