Unknown

Dataset Information

0

An Annotation Agnostic Algorithm for Detecting Nascent RNA Transcripts in GRO-Seq.


ABSTRACT: We present a fast and simple algorithm to detect nascent RNA transcription in global nuclear run-on sequencing (GRO-seq). GRO-seq is a relatively new protocol that captures nascent transcripts from actively engaged polymerase, providing a direct read-out on bona fide transcription. Most traditional assays, such as RNA-seq, measure steady state RNA levels which are affected by transcription, post-transcriptional processing, and RNA stability. GRO-seq data, however, presents unique analysis challenges that are only beginning to be addressed. Here, we describe a new algorithm, Fast Read Stitcher (FStitch), that takes advantage of two popular machine-learning techniques, hidden Markov models and logistic regression, to classify which regions of the genome are transcribed. Given a small user-defined training set, our algorithm is accurate, robust to varying read depth, annotation agnostic, and fast. Analysis of GRO-seq data without a priori need for annotation uncovers surprising new insights into several aspects of the transcription process.

SUBMITTER: Azofeifa JG 

PROVIDER: S-EPMC5667649 | biostudies-literature | 2017 Sep-Oct

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC6213952 | biostudies-literature
| S-EPMC9463444 | biostudies-literature
| S-EPMC4054009 | biostudies-literature
| S-EPMC6269517 | biostudies-literature
| S-EPMC8097831 | biostudies-literature
| S-EPMC10723487 | biostudies-literature
| S-EPMC4066803 | biostudies-other
| S-EPMC3409466 | biostudies-literature
| S-EPMC9482196 | biostudies-literature
| S-EPMC3694665 | biostudies-literature