Unknown

Dataset Information

0

ScHi-CSim: a flexible simulator that generates high-fidelity single-cell Hi-C data for benchmarking.


ABSTRACT: Single-cell Hi-C technology provides an unprecedented opportunity to reveal chromatin structure in individual cells. However, high sequencing cost impedes the generation of biological Hi-C data with high sequencing depths and multiple replicates for downstream analysis. Here, we developed a single-cell Hi-C simulator (scHi-CSim) that generates high-fidelity data for benchmarking. scHi-CSim merges neighboring cells to overcome the sparseness of data, samples interactions in distance-stratified chromosomes to maintain the heterogeneity of single cells, and estimates the empirical distribution of restriction fragments to generate simulated data. We demonstrated that scHi-CSim can generate high-fidelity data by comparing the performance of single-cell clustering and detection of chromosomal high-order structures with raw data. Furthermore, scHi-CSim is flexible to change sequencing depth and the number of simulated replicates. We showed that increasing sequencing depth could improve the accuracy of detecting topologically associating domains. We also used scHi-CSim to generate a series of simulated datasets with different sequencing depths to benchmark scHi-C clustering methods.

SUBMITTER: Fan S 

PROVIDER: S-EPMC10308180 | biostudies-literature | 2023 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

scHi-CSim: a flexible simulator that generates high-fidelity single-cell Hi-C data for benchmarking.

Fan Shichen S   Dang Dachang D   Ye Yusen Y   Zhang Shao-Wu SW   Gao Lin L   Zhang Shihua S  

Journal of molecular cell biology 20230601 1


Single-cell Hi-C technology provides an unprecedented opportunity to reveal chromatin structure in individual cells. However, high sequencing cost impedes the generation of biological Hi-C data with high sequencing depths and multiple replicates for downstream analysis. Here, we developed a single-cell Hi-C simulator (scHi-CSim) that generates high-fidelity data for benchmarking. scHi-CSim merges neighboring cells to overcome the sparseness of data, samples interactions in distance-stratified ch  ...[more]

Similar Datasets

| S-EPMC8147071 | biostudies-literature
| S-EPMC8136837 | biostudies-literature
| S-EPMC11770341 | biostudies-literature
| S-EPMC11328424 | biostudies-literature
| S-EPMC8748196 | biostudies-literature
| S-EPMC8337000 | biostudies-literature
| S-EPMC10359091 | biostudies-literature
| S-EPMC11163381 | biostudies-literature
| S-EPMC9792779 | biostudies-literature
| S-EPMC8162587 | biostudies-literature