Unknown

Dataset Information

0

Bedshift: perturbation of genomic interval sets.


ABSTRACT: Functional genomics experiments, like ChIP-Seq or ATAC-Seq, produce results that are summarized as a region set. There is no way to objectively evaluate the effectiveness of region set similarity metrics. We present Bedshift, a tool for perturbing BED files by randomly shifting, adding, and dropping regions from a reference file. The perturbed files can be used to benchmark similarity metrics, as well as for other applications. We highlight differences in behavior between metrics, such as that the Jaccard score is most sensitive to added or dropped regions, while coverage score is most sensitive to shifted regions.

SUBMITTER: Gu A 

PROVIDER: S-EPMC8379854 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC6408804 | biostudies-literature
| S-EPMC3867968 | biostudies-literature
| S-EPMC8077120 | biostudies-literature
| S-EPMC3947791 | biostudies-other
| S-EPMC6901075 | biostudies-literature
| S-EPMC2375126 | biostudies-literature
| S-EPMC3333775 | biostudies-literature
| S-EPMC7249077 | biostudies-literature
| S-EPMC3740633 | biostudies-literature
| S-EPMC6434014 | biostudies-literature