Unknown

Dataset Information

0

Tumor purity adjusted beta values improve biological interpretability of high-dimensional DNA methylation data.


ABSTRACT: A common issue affecting DNA methylation analysis in tumor tissue is the presence of a substantial amount of non-tumor methylation signal derived from the surrounding microenvironment. Although approaches for quantifying and correcting for the infiltration component have been proposed previously, we believe these have not fully addressed the issue in a comprehensive and universally applicable way. We present a multi-population framework for adjusting DNA methylation beta values on the Illumina 450/850K platform using generic purity estimates to account for non-tumor signal. Our approach also provides an indirect estimate of the aggregate methylation state of the surrounding normal tissue. Using whole exome sequencing derived purity estimates and Illumina 450K methylation array data generated by The Cancer Genome Atlas project (TCGA), we provide a demonstration of this framework in breast cancer illustrating the effect of beta correction on the aggregate methylation beta value distribution, clustering accuracy, and global methylation profiles.

SUBMITTER: Staaf J 

PROVIDER: S-EPMC9462735 | biostudies-literature | 2022

REPOSITORIES: biostudies-literature

altmetric image

Publications

Tumor purity adjusted beta values improve biological interpretability of high-dimensional DNA methylation data.

Staaf Johan J   Aine Mattias M  

PloS one 20220909 9


A common issue affecting DNA methylation analysis in tumor tissue is the presence of a substantial amount of non-tumor methylation signal derived from the surrounding microenvironment. Although approaches for quantifying and correcting for the infiltration component have been proposed previously, we believe these have not fully addressed the issue in a comprehensive and universally applicable way. We present a multi-population framework for adjusting DNA methylation beta values on the Illumina 4  ...[more]

Similar Datasets

| S-EPMC5946863 | biostudies-literature
| S-EPMC4165374 | biostudies-literature
| S-EPMC6410888 | biostudies-literature
2014-07-08 | E-GEOD-56712 | biostudies-arrayexpress
2014-07-08 | GSE56712 | GEO
| S-EPMC5003491 | biostudies-literature
| S-EPMC6147081 | biostudies-literature
| S-EPMC9981214 | biostudies-literature
| S-EPMC10470306 | biostudies-literature
| S-EPMC10985003 | biostudies-literature