Unknown

Dataset Information

0

Simultaneous characterization of sense and antisense genomic processes by the double-stranded hidden Markov model.


ABSTRACT: Hidden Markov models (HMMs) have been extensively used to dissect the genome into functionally distinct regions using data such as RNA expression or DNA binding measurements. It is a challenge to disentangle processes occurring on complementary strands of the same genomic region. We present the double-stranded HMM (dsHMM), a model for the strand-specific analysis of genomic processes. We applied dsHMM to yeast using strand specific transcription data, nucleosome data, and protein binding data for a set of 11 factors associated with the regulation of transcription.The resulting annotation recovers the mRNA transcription cycle (initiation, elongation, termination) while correctly predicting strand-specificity and directionality of the transcription process. We find that pre-initiation complex formation is an essentially undirected process, giving rise to a large number of bidirectional promoters and to pervasive antisense transcription. Notably, 12% of all transcriptionally active positions showed simultaneous activity on both strands. Furthermore, dsHMM reveals that antisense transcription is specifically suppressed by Nrd1, a yeast termination factor.

SUBMITTER: Glas J 

PROVIDER: S-EPMC4797261 | biostudies-other | 2016 Mar

REPOSITORIES: biostudies-other

altmetric image

Publications

Simultaneous characterization of sense and antisense genomic processes by the double-stranded hidden Markov model.

Glas Julia J   Dümcke Sebastian S   Zacher Benedikt B   Poron Don D   Gagneur Julien J   Tresch Achim A  

Nucleic acids research 20151117 5


Hidden Markov models (HMMs) have been extensively used to dissect the genome into functionally distinct regions using data such as RNA expression or DNA binding measurements. It is a challenge to disentangle processes occurring on complementary strands of the same genomic region. We present the double-stranded HMM (dsHMM), a model for the strand-specific analysis of genomic processes. We applied dsHMM to yeast using strand specific transcription data, nucleosome data, and protein binding data fo  ...[more]

Similar Datasets

| S-EPMC4065698 | biostudies-literature
| S-EPMC8204269 | biostudies-literature
| S-EPMC2912474 | biostudies-literature
| S-EPMC7373196 | biostudies-literature
| S-EPMC6373422 | biostudies-literature
| S-EPMC2735038 | biostudies-literature
2012-10-18 | GSE34490 | GEO
| S-EPMC3114652 | biostudies-literature
| S-EPMC7896036 | biostudies-literature
| S-EPMC4834946 | biostudies-literature