Unknown

Dataset Information

0

MethylNet: an automated and modular deep learning approach for DNA methylation analysis.


ABSTRACT: BACKGROUND:DNA methylation (DNAm) is an epigenetic regulator of gene expression programs that can be altered by environmental exposures, aging, and in pathogenesis. Traditional analyses that associate DNAm alterations with phenotypes suffer from multiple hypothesis testing and multi-collinearity due to the high-dimensional, continuous, interacting and non-linear nature of the data. Deep learning analyses have shown much promise to study disease heterogeneity. DNAm deep learning approaches have not yet been formalized into user-friendly frameworks for execution, training, and interpreting models. Here, we describe MethylNet, a DNAm deep learning method that can construct embeddings, make predictions, generate new data, and uncover unknown heterogeneity with minimal user supervision. RESULTS:The results of our experiments indicate that MethylNet can study cellular differences, grasp higher order information of cancer sub-types, estimate age and capture factors associated with smoking in concordance with known differences. CONCLUSION:The ability of MethylNet to capture nonlinear interactions presents an opportunity for further study of unknown disease, cellular heterogeneity and aging processes.

SUBMITTER: Levy JJ 

PROVIDER: S-EPMC7076991 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

MethylNet: an automated and modular deep learning approach for DNA methylation analysis.

Levy Joshua J JJ   Titus Alexander J AJ   Petersen Curtis L CL   Chen Youdinghuan Y   Salas Lucas A LA   Christensen Brock C BC  

BMC bioinformatics 20200317 1


<h4>Background</h4>DNA methylation (DNAm) is an epigenetic regulator of gene expression programs that can be altered by environmental exposures, aging, and in pathogenesis. Traditional analyses that associate DNAm alterations with phenotypes suffer from multiple hypothesis testing and multi-collinearity due to the high-dimensional, continuous, interacting and non-linear nature of the data. Deep learning analyses have shown much promise to study disease heterogeneity. DNAm deep learning approache  ...[more]

Similar Datasets

| S-EPMC10703017 | biostudies-literature
| S-EPMC6692109 | biostudies-literature
| S-EPMC6826785 | biostudies-literature
| S-EPMC8656222 | biostudies-literature
| S-EPMC7880197 | biostudies-literature
| S-EPMC7290553 | biostudies-literature
2020-08-12 | GSE149225 | GEO
| S-EPMC5408780 | biostudies-literature
| S-EPMC7706884 | biostudies-literature
| S-EPMC9158789 | biostudies-literature