Unknown

Dataset Information

0

Robust single-cell Hi-C clustering by convolution- and random-walk-based imputation.


ABSTRACT: Three-dimensional genome structure plays a pivotal role in gene regulation and cellular function. Single-cell analysis of genome architecture has been achieved using imaging and chromatin conformation capture methods such as Hi-C. To study variation in chromosome structure between different cell types, computational approaches are needed that can utilize sparse and heterogeneous single-cell Hi-C data. However, few methods exist that are able to accurately and efficiently cluster such data into constituent cell types. Here, we describe scHiCluster, a single-cell clustering algorithm for Hi-C contact matrices that is based on imputations using linear convolution and random walk. Using both simulated and real single-cell Hi-C data as benchmarks, scHiCluster significantly improves clustering accuracy when applied to low coverage datasets compared with existing methods. After imputation by scHiCluster, topologically associating domain (TAD)-like structures (TLSs) can be identified within single cells, and their consensus boundaries were enriched at the TAD boundaries observed in bulk cell Hi-C samples. In summary, scHiCluster facilitates visualization and comparison of single-cell 3D genomes.

SUBMITTER: Zhou J 

PROVIDER: S-EPMC6628819 | biostudies-literature | 2019 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Robust single-cell Hi-C clustering by convolution- and random-walk-based imputation.

Zhou Jingtian J   Ma Jianzhu J   Chen Yusi Y   Cheng Chuankai C   Bao Bokan B   Peng Jian J   Sejnowski Terrence J TJ   Dixon Jesse R JR   Ecker Joseph R JR  

Proceedings of the National Academy of Sciences of the United States of America 20190624 28


Three-dimensional genome structure plays a pivotal role in gene regulation and cellular function. Single-cell analysis of genome architecture has been achieved using imaging and chromatin conformation capture methods such as Hi-C. To study variation in chromosome structure between different cell types, computational approaches are needed that can utilize sparse and heterogeneous single-cell Hi-C data. However, few methods exist that are able to accurately and efficiently cluster such data into c  ...[more]

Similar Datasets

| S-EPMC9502147 | biostudies-literature
| S-EPMC11364663 | biostudies-literature
| S-EPMC6427844 | biostudies-literature
| S-EPMC8481482 | biostudies-literature
| S-EPMC7804119 | biostudies-literature
| S-EPMC9110639 | biostudies-literature
| S-EPMC10686958 | biostudies-literature
| S-EPMC5371246 | biostudies-literature
| S-EPMC10140034 | biostudies-literature
| S-EPMC5843666 | biostudies-literature