Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

0

Genome-wide discovery of human splicing branchpoints [RNAse]


ABSTRACT: Gene splicing requires three basal genetic elements; the 3M-bM-^@M-^Y and 5M-bM-^@M-^Y splice sites and the branchpoint to which the 5M-bM-^@M-^Y intron termini is ligated to form a closed lariat during the splicing reaction. The 5M-bM-^@M-^Y and 3M-bM-^@M-^Y splice sites that define exon boundaries have been widely identified, revealing pervasive transcription and splicing of human genes. However, the locations of the third requisite element, the branchpoint, are still largely unknown. Here we employ two complementary approaches, targeted RNA sequencing and exoribonuclease digestion, to distil sequenced reads that traverse the lariat junction and, via non-conventional alignment, locate human branchpoint nucleotides. Alignments identify 88,748 branchpoints that correspond to 20% of known introns, with 76% supported by diagnostic sequence mismatch errors. This affords a first genome-wide analysis of branchpoints, describing their distribution, selection, and the existence of a diverse array of overlapping sequence motifs with distinct usage, evolutionary histories, and co-variation with distal splicing elements. The overlap of branchpoints with noncoding human genetic variation also indicates a notable contribution to disease. This annotation and analysis incorporates branchpoints into transcriptomic research and reflects a core role for this element in the regulatory code that governs gene splicing and expression. RNaseR validation of branchpoint nucleotides

ORGANISM(S): Homo sapiens

SUBMITTER: Tim Mercer 

PROVIDER: E-GEOD-53327 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

Similar Datasets

2014-10-01 | E-GEOD-53326 | biostudies-arrayexpress
2014-10-01 | GSE53327 | GEO
2014-10-01 | GSE53326 | GEO
2014-10-01 | E-GEOD-53328 | biostudies-arrayexpress
2019-04-17 | GSE117416 | GEO
2022-02-03 | GSE195586 | GEO
| PRJNA482052 | ENA
2006-10-16 | GSE5470 | GEO
2014-03-01 | E-GEOD-52503 | biostudies-arrayexpress
| PRJNA231721 | ENA