Unknown

Dataset Information

0

Underlying causes for prevalent false positives and false negatives in STARR-seq data.


ABSTRACT: Self-transcribing active regulatory region sequencing (STARR-seq) and its variants have been widely used to characterize enhancers. However, it has been reported that up to 87% of STARR-seq peaks are located in repressive chromatin and are not functional in the tested cells. While some of the STARR-seq peaks in repressive chromatin might be active in other cell/tissue types, some others might be false positives. Meanwhile, many active enhancers may not be identified by the current STARR-seq methods. Although methods have been proposed to mitigate systematic errors caused by the use of plasmid vectors, the artifacts due to the intrinsic limitations of current STARR-seq methods are still prevalent and the underlying causes are not fully understood. Based on predicted cis-regulatory modules (CRMs) and non-CRMs in the human genome as well as predicted active CRMs and non-active CRMs in a few human cell lines/tissues with STARR-seq data available, we reveal prevalent false positives and false negatives in STARR-seq peaks generated by major variants of STARR-seq methods and possible underlying causes. Our results will help design strategies to improve STARR-seq methods and interpret the results.

SUBMITTER: Ni P 

PROVIDER: S-EPMC10516709 | biostudies-literature | 2023 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Underlying causes for prevalent false positives and false negatives in STARR-seq data.

Ni Pengyu P   Wu Siwen S   Su Zhengchang Z  

NAR genomics and bioinformatics 20230922 3


Self-transcribing active regulatory region sequencing (STARR-seq) and its variants have been widely used to characterize enhancers. However, it has been reported that up to 87% of STARR-seq peaks are located in repressive chromatin and are not functional in the tested cells. While some of the STARR-seq peaks in repressive chromatin might be active in other cell/tissue types, some others might be false positives. Meanwhile, many active enhancers may not be identified by the current STARR-seq meth  ...[more]

Similar Datasets

| S-EPMC9677119 | biostudies-literature
| S-EPMC2747693 | biostudies-literature
| S-EPMC5082614 | biostudies-literature
| S-EPMC6104401 | biostudies-literature
| S-EPMC2643841 | biostudies-literature
| S-EPMC9392255 | biostudies-literature
| S-EPMC5583356 | biostudies-literature
| S-EPMC11526515 | biostudies-literature
| S-EPMC2628906 | biostudies-literature
| S-EPMC8092017 | biostudies-literature