Unknown

Dataset Information

0

A Pathway-Based Kernel Boosting Method for Sample Classification Using Genomic Data.


ABSTRACT: The analysis of cancer genomic data has long suffered "the curse of dimensionality." Sample sizes for most cancer genomic studies are a few hundreds at most while there are tens of thousands of genomic features studied. Various methods have been proposed to leverage prior biological knowledge, such as pathways, to more effectively analyze cancer genomic data. Most of the methods focus on testing marginal significance of the associations between pathways and clinical phenotypes. They can identify informative pathways but do not involve predictive modeling. In this article, we propose a Pathway-based Kernel Boosting (PKB) method for integrating gene pathway information for sample classification, where we use kernel functions calculated from each pathway as base learners and learn the weights through iterative optimization of the classification loss function. We apply PKB and several competing methods to three cancer studies with pathological and clinical information, including tumor grade, stage, tumor sites and metastasis status. Our results show that PKB outperforms other methods and identifies pathways relevant to the outcome variables.

SUBMITTER: Zeng L 

PROVIDER: S-EPMC6770716 | biostudies-literature | 2019 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Pathway-Based Kernel Boosting Method for Sample Classification Using Genomic Data.

Zeng Li L   Yu Zhaolong Z   Zhao Hongyu H  

Genes 20190831 9


The analysis of cancer genomic data has long suffered "the curse of dimensionality." Sample sizes for most cancer genomic studies are a few hundreds at most while there are tens of thousands of genomic features studied. Various methods have been proposed to leverage prior biological knowledge, such as pathways, to more effectively analyze cancer genomic data. Most of the methods focus on testing marginal significance of the associations between pathways and clinical phenotypes. They can identify  ...[more]

Similar Datasets

| S-EPMC4105478 | biostudies-literature
| S-EPMC5530424 | biostudies-literature
| S-EPMC10135911 | biostudies-literature
| S-EPMC3205051 | biostudies-literature
| S-EPMC1821044 | biostudies-literature
| S-EPMC1087831 | biostudies-literature
| S-EPMC5098425 | biostudies-literature
| S-EPMC9235505 | biostudies-literature
| S-EPMC3283887 | biostudies-other
| S-EPMC5933939 | biostudies-literature