Unknown

Dataset Information

0

An Integrated Approach for Identifying Molecular Subtypes in Human Colon Cancer Using Gene Expression Data.


ABSTRACT: Identifying molecular subtypes of colorectal cancer (CRC) may allow for more rational, patient-specific treatment. Various studies have identified molecular subtypes for CRC using gene expression data, but they are inconsistent and further research is necessary. From a methodological point of view, a progressive approach is needed to identify molecular subtypes in human colon cancer using gene expression data. We propose an approach to identify the molecular subtypes of colon cancer that integrates denoising by the Bayesian robust principal component analysis (BRPCA) algorithm, hierarchical clustering by the directed bubble hierarchical tree (DBHT) algorithm, and feature gene selection by an improved differential evolution based feature selection method (DEFSW) algorithm. In this approach, the normal samples being completely and exclusively clustered into one class is considered to be the standard of reasonable clustering subtypes, and the feature selection pays attention to imbalances of samples among subtypes. With this approach, we identified the molecular subtypes of colon cancer on the mRNA gene expression dataset of 153 colon cancer samples and 19 normal control samples of the Cancer Genome Atlas (TCGA) project. The colon cancer was clustered into 7 subtypes with 44 feature genes. Our approach could identify finer subtypes of colon cancer with fewer feature genes than the other two recent studies and exhibits a generic methodology that might be applied to identify the subtypes of other cancers.

SUBMITTER: Wang WH 

PROVIDER: S-EPMC6115727 | biostudies-literature | 2018 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

An Integrated Approach for Identifying Molecular Subtypes in Human Colon Cancer Using Gene Expression Data.

Wang Wen-Hui WH   Xie Ting-Yan TY   Xie Guang-Lei GL   Ren Zhong-Lu ZL   Li Jin-Ming JM  

Genes 20180802 8


Identifying molecular subtypes of colorectal cancer (CRC) may allow for more rational, patient-specific treatment. Various studies have identified molecular subtypes for CRC using gene expression data, but they are inconsistent and further research is necessary. From a methodological point of view, a progressive approach is needed to identify molecular subtypes in human colon cancer using gene expression data. We propose an approach to identify the molecular subtypes of colon cancer that integra  ...[more]

Similar Datasets

| S-EPMC8215925 | biostudies-literature
| S-EPMC5972664 | biostudies-literature
| S-EPMC3660251 | biostudies-literature
| S-EPMC4873016 | biostudies-literature
| S-EPMC8024377 | biostudies-literature
| S-EPMC4514654 | biostudies-literature
| S-EPMC2639007 | biostudies-literature
| S-EPMC4818025 | biostudies-literature
| S-EPMC5998123 | biostudies-literature
| S-EPMC3794600 | biostudies-literature