Unknown

Dataset Information

0

Graph embedding and unsupervised learning predict genomic sub-compartments from HiC chromatin interaction data.


ABSTRACT: Chromatin interaction studies can reveal how the genome is organized into spatially confined sub-compartments in the nucleus. However, accurately identifying sub-compartments from chromatin interaction data remains a challenge in computational biology. Here, we present Sub-Compartment Identifier (SCI), an algorithm that uses graph embedding followed by unsupervised learning to predict sub-compartments using Hi-C chromatin interaction data. We find that the network topological centrality and clustering performance of SCI sub-compartment predictions are superior to those of hidden Markov model (HMM) sub-compartment predictions. Moreover, using orthogonal Chromatin Interaction Analysis by in-situ Paired-End Tag Sequencing (ChIA-PET) data, we confirmed that SCI sub-compartment prediction outperforms HMM. We show that SCI-predicted sub-compartments have distinct epigenetic marks, transcriptional activities, and transcription factor enrichment. Moreover, we present a deep neural network to predict sub-compartments using epigenome, replication timing, and sequence data. Our neural network predicts more accurate sub-compartment predictions when SCI-determined sub-compartments are used as labels for training.

SUBMITTER: Ashoor H 

PROVIDER: S-EPMC7054322 | biostudies-literature | 2020 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Graph embedding and unsupervised learning predict genomic sub-compartments from HiC chromatin interaction data.

Ashoor Haitham H   Chen Xiaowen X   Rosikiewicz Wojciech W   Wang Jiahui J   Cheng Albert A   Wang Ping P   Ruan Yijun Y   Li Sheng S  

Nature communications 20200303 1


Chromatin interaction studies can reveal how the genome is organized into spatially confined sub-compartments in the nucleus. However, accurately identifying sub-compartments from chromatin interaction data remains a challenge in computational biology. Here, we present Sub-Compartment Identifier (SCI), an algorithm that uses graph embedding followed by unsupervised learning to predict sub-compartments using Hi-C chromatin interaction data. We find that the network topological centrality and clus  ...[more]

Similar Datasets

2023-09-27 | GSE216270 | GEO
| S-EPMC8493040 | biostudies-literature
| S-EPMC5899072 | biostudies-literature
| S-EPMC6883002 | biostudies-literature
| S-EPMC7514053 | biostudies-literature
| S-EPMC7931948 | biostudies-literature
| S-EPMC9848054 | biostudies-literature
| S-EPMC7325230 | biostudies-literature
| S-EPMC7300092 | biostudies-literature
| S-EPMC10190044 | biostudies-literature