Unknown

Dataset Information

0

Improved discovery of RNA-binding protein binding sites in eCLIP data using DEWSeq.


ABSTRACT: Enhanced crosslinking and immunoprecipitation (eCLIP) sequencing is a method for transcriptome-wide detection of binding sites of RNA-binding proteins (RBPs). However, identified crosslink sites can deviate from experimentally established functional elements of even well-studied RBPs. Current peak-calling strategies result in low replication and high false positive rates. Here, we present the R/Bioconductor package DEWSeq that makes use of replicate information and size-matched input controls. We benchmarked DEWSeq on 107 RBPs for which both eCLIP data and RNA sequence motifs are available and were able to more than double the number of motif-containing binding regions relative to standard eCLIP processing. The improvement not only relates to the number of binding sites (3.1-fold with known motifs for RBFOX2), but also their subcellular localization (1.9-fold of mitochondrial genes for FASTKD2) and structural targets (2.2-fold increase of stem-loop regions for SLBP. On several orthogonal CLIP-seq datasets, DEWSeq recovers a larger number of motif-containing binding sites (3.3-fold). DEWSeq is a well-documented R/Bioconductor package, scalable to adequate numbers of replicates, and tends to substantially increase the proportion and total number of RBP binding sites containing biologically relevant features.

SUBMITTER: Schwarzl T 

PROVIDER: S-EPMC10783507 | biostudies-literature | 2024 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improved discovery of RNA-binding protein binding sites in eCLIP data using DEWSeq.

Schwarzl Thomas T   Sahadevan Sudeep S   Lang Benjamin B   Miladi Milad M   Backofen Rolf R   Huber Wolfgang W   Hentze Matthias W MW   Tartaglia Gian Gaetano GG  

Nucleic acids research 20240101 1


Enhanced crosslinking and immunoprecipitation (eCLIP) sequencing is a method for transcriptome-wide detection of binding sites of RNA-binding proteins (RBPs). However, identified crosslink sites can deviate from experimentally established functional elements of even well-studied RBPs. Current peak-calling strategies result in low replication and high false positive rates. Here, we present the R/Bioconductor package DEWSeq that makes use of replicate information and size-matched input controls. W  ...[more]

Similar Datasets

| S-EPMC9834051 | biostudies-literature
| S-EPMC4887338 | biostudies-literature
2016-03-28 | E-GEOD-77634 | biostudies-arrayexpress
| S-EPMC5219607 | biostudies-literature
2022-10-18 | GSE205536 | GEO
2020-06-23 | GSE144318 | GEO
| PRJNA603360 | ENA
2020-10-27 | E-MTAB-9031 | biostudies-arrayexpress
| PRJNA846294 | ENA
| S-EPMC10108518 | biostudies-literature