Unknown

Dataset Information

0

Identifying splicing regulatory elements with de Bruijn graphs.


ABSTRACT: Splicing regulatory elements (SREs) are short, degenerate sequences on pre-mRNA molecules that enhance or inhibit the splicing process via the binding of splicing factors, proteins that regulate the functioning of the spliceosome. Existing methods for identifying SREs in a genome are either experimental or computational. Here, we propose a formalism based on de Bruijn graphs that combines genomic structure, word count enrichment analysis, and experimental evidence to identify SREs found in exons. In our approach, SREs are not restricted to a fixed length (i.e., k-mers, for a fixed k). As a result, we identify 2001 putative exonic enhancers and 3080 putative exonic silencers for human genes, with lengths varying from 6 to 15 nucleotides. Many of the predicted SREs overlap with experimentally verified binding sites. Our model provides a novel method to predict variable length putative regulatory elements computationally for further experimental investigation.

SUBMITTER: Badr E 

PROVIDER: S-EPMC4253301 | biostudies-literature | 2014 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Identifying splicing regulatory elements with de Bruijn graphs.

Badr Eman E   Heath Lenwood S LS  

Journal of computational biology : a journal of computational molecular cell biology 20141201 12


Splicing regulatory elements (SREs) are short, degenerate sequences on pre-mRNA molecules that enhance or inhibit the splicing process via the binding of splicing factors, proteins that regulate the functioning of the spliceosome. Existing methods for identifying SREs in a genome are either experimental or computational. Here, we propose a formalism based on de Bruijn graphs that combines genomic structure, word count enrichment analysis, and experimental evidence to identify SREs found in exons  ...[more]

Similar Datasets

| S-EPMC5872255 | biostudies-literature
| S-EPMC4120145 | biostudies-literature
| S-EPMC9528980 | biostudies-literature
| S-EPMC6612864 | biostudies-other
| S-EPMC8326735 | biostudies-literature
| S-EPMC3607606 | biostudies-literature
| S-EPMC3421212 | biostudies-literature
| S-EPMC6061703 | biostudies-literature
| S-EPMC8016496 | biostudies-literature
| S-EPMC3272472 | biostudies-literature