Unknown

Dataset Information

0

PreciseTAD: A transfer learning framework for 3D domain boundary prediction at base-pair resolution.


ABSTRACT:

Motivation

Chromosome conformation capture technologies (Hi-C) revealed extensive DNA folding into discrete 3D domains, such as Topologically Associating Domains and chromatin loops. The correct binding of CTCF and cohesin at domain boundaries is integral in maintaining the proper structure and function of these 3D domains. 3D domains have been mapped at the resolutions of 1 kilobase and above. However, it has not been possible to define their boundaries at the resolution of boundary-forming proteins.

Results

To predict domain boundaries at base-pair resolution, we developed preciseTAD, an optimized transfer learning framework trained on high-resolution genome annotation data. In contrast to current TAD/loop callers, preciseTAD-predicted boundaries are strongly supported by experimental evidence. Importantly, this approach can accurately delineate boundaries in cells without Hi-C data. preciseTAD provides a powerful framework to improve our understanding of how genomic regulators are shaping the 3D structure of the genome at base-pair resolution.

Availability

preciseTAD is an R/Bioconductor package available at https://bioconductor.org/packages/preciseTAD/.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Stilianoudakis SC 

PROVIDER: S-EPMC8756196 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

2011-10-26 | E-GEOD-30551 | biostudies-arrayexpress
| S-EPMC3215028 | biostudies-literature
| S-EPMC3036623 | biostudies-literature
2021-04-16 | GSE144336 | GEO
2011-10-27 | GSE30551 | GEO
2013-08-27 | E-GEOD-43423 | biostudies-arrayexpress
| S-EPMC519116 | biostudies-literature
| S-EPMC3696636 | biostudies-literature
2021-04-07 | GSE171636 | GEO