Species-specific factors mediate extensive heterogeneity of mRNA 3' ends in yeasts
Ontology highlight
ABSTRACT: Most eukaryotic genes express mRNAs with alternative polyadenylation sites at their 3’ ends. Here we show that polyadenylated 3’ termini in three yeast species (S. cerevisiae, K. lactis, D. hansenii) are remarkably heterogeneous. Instead of a few discrete 3’ ends, the average yeast gene has an “end zone”, a >200 bp window with >60 distinct poly(A) sites, the most utilized of which represents only 20% of the mRNA molecules. The pattern of polyadenylation within this zone varies across species, with D. hansenii possessing a higher focus on a single dominant point closer to the ORF terminus. Some polyadenylation occurs within mRNA coding regions with a strong bias towards the promoter. The polyadenylation pattern is determined by a highly degenerate sequence over a broad region and by a local sequence that relies on A residues after the cleavage point. Many dominant poly(A) sites are predicted to adopt a common secondary structure that may be recognized by the cleavage/polyadenylation machinery. We suggest that the end zone reflects a region permissive for polyadenylation, within which cleavage occurs preferentially at the A-rich sequence. In S. cerevisiae strains, D. hansenii genes adopt the S. cerevisiae polyadenylation profile, indicating that the polyadenylation pattern is mediated primarily by species-specific factors. Four sequencing lanes containing direct RNA sequence from S. cerevisiae (strains JGY2000 and two replicates of AB1380), K. lactis strain CLIB209, D. hansenii strain NCYC2572, and S. cerevisiae strains JYAC06 and JYAC07, each harboring D. hansenii sequences on a YAC. AB1380 is the non-YAC-containing S. cerevisiae parental strain for JYAC06 and JYAC07.
ORGANISM(S): Saccharomyces cerevisiae
SUBMITTER: Zarmik Moqtaderi
PROVIDER: E-GEOD-47661 | biostudies-arrayexpress |
REPOSITORIES: biostudies-arrayexpress
ACCESS DATA