Unknown

Dataset Information

0

Accounting for tumor purity improves cancer subtype classification from DNA methylation data.


ABSTRACT:

Motivation

Tumor sample classification has long been an important task in cancer research. Classifying tumors into different subtypes greatly benefits therapeutic development and facilitates application of precision medicine on patients. In practice, solid tumor tissue samples obtained from clinical settings are always mixtures of cancer and normal cells. Thus, the data obtained from these samples are mixed signals. The 'tumor purity', or the percentage of cancer cells in cancer tissue sample, will bias the clustering results if not properly accounted for.

Results

In this article, we developed a model-based clustering method and an R function which uses DNA methylation microarray data to infer tumor subtypes with the consideration of tumor purity. Simulation studies and the analyses of The Cancer Genome Atlas data demonstrate improved results compared with existing methods.

Availability and implementation

InfiniumClust is part of R package InfiniumPurify , which is freely available from CRAN ( https://cran.r-project.org/web/packages/InfiniumPurify/index.html ).

Contact

hao.wu@emory.edu or xqzheng@shnu.edu.cn.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Zhang W 

PROVIDER: S-EPMC6410888 | biostudies-literature | 2017 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Accounting for tumor purity improves cancer subtype classification from DNA methylation data.

Zhang Weiwei W   Feng Hao H   Wu Hao H   Zheng Xiaoqi X  

Bioinformatics (Oxford, England) 20170901 17


<h4>Motivation</h4>Tumor sample classification has long been an important task in cancer research. Classifying tumors into different subtypes greatly benefits therapeutic development and facilitates application of precision medicine on patients. In practice, solid tumor tissue samples obtained from clinical settings are always mixtures of cancer and normal cells. Thus, the data obtained from these samples are mixed signals. The 'tumor purity', or the percentage of cancer cells in cancer tissue s  ...[more]

Similar Datasets

| S-EPMC6147081 | biostudies-literature
| S-EPMC7388223 | biostudies-literature
| S-EPMC5946863 | biostudies-literature
| S-EPMC9462735 | biostudies-literature
| S-EPMC4347619 | biostudies-literature
| S-EPMC4165374 | biostudies-literature
| S-EPMC10131478 | biostudies-literature
| S-EPMC5190082 | biostudies-literature
| S-EPMC6936156 | biostudies-literature