Unknown

Dataset Information

0

Machine learning polymer models of three-dimensional chromatin organization in human lymphoblastoid cells.


ABSTRACT: We present machine learning models of human genome three-dimensional structure that combine one dimensional (linear) sequence specificity, epigenomic information, and transcription factor binding profiles, with the polymer-based biophysical simulations in order to explain the extensive long-range chromatin looping observed in ChIA-PET experiments for lymphoblastoid cells. Random Forest, Gradient Boosting Machine (GBM), and Deep Learning models were constructed and evaluated, when predicting high-resolution interactions within Topologically Associating Domains (TADs). The predicted interactions are consistent with the experimental long-read ChIA-PET interactions mediated by CTCF and RNAPOL2 for GM12878 cell line. The contribution of sequence information and chromatin state defined by epigenomic features to the prediction task is analyzed and reported, when using them separately and combined. Furthermore, we design three-dimensional models of chromatin contact domains (CCDs) using real (ChIA-PET) and predicted looping interactions. Initial results show a similarity between both types of 3D computational models (constructed from experimental or predicted interactions). This observation confirms the association between genome sequence, epigenomic and transcription factor profiles, and three-dimensional interactions.

SUBMITTER: Al Bkhetan Z 

PROVIDER: S-EPMC6800180 | biostudies-literature | 2019 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Machine learning polymer models of three-dimensional chromatin organization in human lymphoblastoid cells.

Al Bkhetan Ziad Z   Kadlof Michal M   Kraft Agnieszka A   Plewczynski Dariusz D  

Methods (San Diego, Calif.) 20190307


We present machine learning models of human genome three-dimensional structure that combine one dimensional (linear) sequence specificity, epigenomic information, and transcription factor binding profiles, with the polymer-based biophysical simulations in order to explain the extensive long-range chromatin looping observed in ChIA-PET experiments for lymphoblastoid cells. Random Forest, Gradient Boosting Machine (GBM), and Deep Learning models were constructed and evaluated, when predicting high  ...[more]

Similar Datasets

| S-EPMC8246089 | biostudies-literature
| S-EPMC6586364 | biostudies-literature
| S-EPMC11331798 | biostudies-literature
| S-EPMC10086523 | biostudies-literature
| S-EPMC4665721 | biostudies-literature
| S-EPMC8142020 | biostudies-literature
| S-EPMC7164942 | biostudies-literature
| S-EPMC7759288 | biostudies-literature
2023-02-02 | GSE214047 | GEO
| S-EPMC8242018 | biostudies-literature