A compendium of conserved cleavage and polyadenylation events in mammalian genes
Ontology highlight
ABSTRACT: Cleavage and polyadenylation is essential for 3’ end processing of almost all eukaryotic mRNAs. Recent studies have shown widespread alternative cleavage and polyadenylation (APA) events leading to mRNA isoforms with different 3’UTRs and/or coding sequences. Here we present a compendium of conserved cleavage and polyadenylation sites (PASs) in mammalian genes, based on ~1.2 billion 3’ end sequencing reads from over 360 human, mouse and rat samples. We show that ~80% of mammalian mRNA genes contain at least one conserved PAS, and ~50% have conserved APA events. PAS conservation generally reduces promiscuous 3’ end processing, stabling gene expression levels across species. Conservation of APA correlates with gene age, gene expression features, and gene functions. Genes with certain functions, such as cell morphology, cell proliferation, and mRNA metabolism, are particularly enriched with APA events. While tissue-specific genes typically have a low APA rate, brain-specific genes tend to evolve APA. We show enrichment of mRNA destabilizing motifs in alternative 3’UTR sequences, leading to substantial differences in mRNA stability between 3’UTR APA isoforms. Using conserved PASs, we reveal sequence motifs surrounding APA sites and a preference of adenosine at the cleavage site. Mutations of the U-rich motif around the PAS often accompany APA profile changes between species. Analysis of lncRNA PASs indicates a mechanism of PAS fixation involving evolution of A-rich motifs. Taken together, our results present a comprehensive view of PAS evolution in mammals, and a phylogenic understanding of APA functions.
ORGANISM(S): Mus musculus Rattus norvegicus Homo sapiens
PROVIDER: GSE111134 | GEO | 2018/08/08
REPOSITORIES: GEO
ACCESS DATA