Genomics

Dataset Information

0

Reproducible inference of transcription factor footprints in ATAC-seq and DNase-seq datasets via protocol-specific bias modeling


ABSTRACT: DNase-seq and ATAC-seq are broadly used methods to assay open chromatin regions genome-wide. The single nucleotide resolution of DNase-seq has been further exploited to infer transcription factor binding sites (TFBS) in regulatory regions via footprinting. Recent studies have demonstrated the sequence bias of DNase I and its adverse effects on footprinting efficiency. However, footprinting and the impact of sequence bias have not been extensively studied for ATAC-seq. Here, we undertake a systematic comparison of the two methods and show that a modification to the ATAC-seq protocol increases its yield and its agreement with DNase-seq data from the same cell line. We demonstrate that the two methods have distinct sequence biases and correct for these protocol-specific biases when performing footprinting. Despite differences in footprint shapes, the locations of the inferred footprints in ATAC-seq and DNase-seq are largely concordant. However, the protocol-specific sequence biases in conjunction with the sequence content of TFBSs impacts the discrimination of footprint from background, which leads to one method outperforming the other for some TFs. Finally, we address the depth required for reproducible identification of open chromatin regions and TF footprints.

ORGANISM(S): Homo sapiens

PROVIDER: GSE108513 | GEO | 2018/11/23

REPOSITORIES: GEO

Dataset's files

Source:
Action DRS
Other
Items per page:
1 - 1 of 1

Similar Datasets

2014-09-04 | E-GEOD-61105 | biostudies-arrayexpress
2014-09-04 | GSE61105 | GEO
2024-05-12 | GSE267154 | GEO
2012-07-26 | E-GEOD-37744 | biostudies-arrayexpress
| PRJNA427530 | ENA
2012-07-26 | GSE37744 | GEO
2014-11-19 | E-GEOD-51341 | biostudies-arrayexpress
2016-12-21 | GSE92674 | GEO
2023-03-29 | GSE216403 | GEO
2023-03-29 | GSE216462 | GEO