Unknown

Dataset Information

0

Spectral identification of topological domains.


ABSTRACT:

Motivation

Topological domains have been proposed as the backbone of interphase chromosome structure. They are regions of high local contact frequency separated by sharp boundaries. Genes within a domain often have correlated transcription. In this paper, we present a computational efficient spectral algorithm to identify topological domains from chromosome conformation data (Hi-C data). We consider the genome as a weighted graph with vertices defined by loci on a chromosome and the edge weights given by interaction frequency between two loci. Laplacian-based graph segmentation is then applied iteratively to obtain the domains at the given compactness level. Comparison with algorithms in the literature shows the advantage of the proposed strategy.

Results

An efficient algorithm is presented to identify topological domains from the Hi-C matrix.

Availability and implementation

The Matlab source code and illustrative examples are available at http://bionetworks.ccmb.med.umich.edu/

Contact

: indikar@med.umich.edu

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Chen J 

PROVIDER: S-EPMC4937202 | biostudies-literature | 2016 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Spectral identification of topological domains.

Chen Jie J   Hero Alfred O AO   Rajapakse Indika I  

Bioinformatics (Oxford, England) 20160505 14


<h4>Motivation</h4>Topological domains have been proposed as the backbone of interphase chromosome structure. They are regions of high local contact frequency separated by sharp boundaries. Genes within a domain often have correlated transcription. In this paper, we present a computational efficient spectral algorithm to identify topological domains from chromosome conformation data (Hi-C data). We consider the genome as a weighted graph with vertices defined by loci on a chromosome and the edge  ...[more]

Similar Datasets

| S-EPMC4019371 | biostudies-literature
| S-EPMC1933582 | biostudies-literature
| S-EPMC3259042 | biostudies-literature
| S-EPMC3941104 | biostudies-literature
2020-12-01 | GSE123245 | GEO
| S-EPMC7508461 | biostudies-literature
| S-EPMC3873163 | biostudies-literature
| S-EPMC5961096 | biostudies-literature
2013-06-07 | E-GEOD-45457 | biostudies-arrayexpress
2015-05-08 | E-GEOD-66383 | biostudies-arrayexpress