Project description:We report widespread ChIP-seq bias at highly expressed genes in yeast that could lead to misinterpretation ChIP-seq for multiple transcription or chromatin-associated factors and negative controls
Project description:Usage of synonymous codons represents a characteristic pattern of preference in each organism. It has been inferred that such bias of codon usage has evolved as a result of adaptation for efficient synthesis of proteins. Here we examined synonymous codon usage in genes of the fission yeast Schizosaccharomyces pombe, and compared codon usage bias with expression levels of the gene. In this organism, synonymous codon usage bias was correlated with expression levels of the gene; the bias was most obvious in two-codon amino acids. A similar pattern of the codon usage bias was also observed in Saccharomyces cerevisiae, Arabidopsis thaliana, and Caenorhabditis elegans, but was not obvious in Oryza sativa, Drosophila melanogaster, Takifugu rubripes and Homo sapiens. As codons of the highly expressed genes have greater influence on translational efficiency than codons of genes expressed at lower levels, it is likely that codon usage in the S. pombe genome has been optimized by translational selection through evolution.
Project description:MicroRNAs (miRNAs) modulate diverse biological and pathological processes via post-transcriptional gene silencing. High-throughput small RNA sequencing (sRNA-seq) has been widely adopted to investigate the functions and regulatory mechanisms of miRNAs. However, accurate quantification of miRNAs has been limited owing to the severe ligation bias in conventional sRNA-seq methods. Here we quantify miRNAs and their variants (known as isomiRs) by an improved sRNA-seq protocol, termed AQ-seq (accurate quantification by sequencing), that utilizes adapters with terminal degenerate sequences and a high concentration of polyethylene glycol (PEG), which minimize the ligation bias during library preparation. Measurement using AQ-seq allows us to correct the previously misannotated 5′ end usage and strand preference in public databases. Importantly, the analysis of 5′ terminal heterogeneity reveals widespread alternative processing events which have been underestimated. We also identify highly uridylated miRNAs originating from the 3p strands, indicating regulations mediated by terminal uridylyl transferases at the pre-miRNA stage. Taken together, our study reveals the complexity of the miRNA isoform landscape, allowing us to refine miRNA annotation and to advance our understanding of miRNA regulation. Furthermore, AQ-seq can be adopted to improve other ligation-based sequencing methods including crosslinking-immunoprecipitation-sequencing (CLIP-seq) and ribosome profiling (Ribo-seq).
Project description:Usage of synonymous codons represents a characteristic pattern of preference in each organism. It has been inferred that such bias of codon usage has evolved as a result of adaptation for efficient synthesis of proteins. Here we examined synonymous codon usage in genes of the fission yeast Schizosaccharomyces pombe, and compared codon usage bias with expression levels of the gene. In this organism, synonymous codon usage bias was correlated with expression levels of the gene; the bias was most obvious in two-codon amino acids. A similar pattern of the codon usage bias was also observed in Saccharomyces cerevisiae, Arabidopsis thaliana, and Caenorhabditis elegans, but was not obvious in Oryza sativa, Drosophila melanogaster, Takifugu rubripes and Homo sapiens. As codons of the highly expressed genes have greater influence on translational efficiency than codons of genes expressed at lower levels, it is likely that codon usage in the S. pombe genome has been optimized by translational selection through evolution. Relative amounts of mRNA for each ORF were measured by DNA microarray using genomic DNA as a reference, and the copy number of mRNA was calculated using an estimate of the total mRNA number in the cell as 100,000 copies.
Project description:Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is widely used to detect genome-wide interactions between a protein of interest and DNA in vivo. Loci showing strong enrichment over adjacent background regions are typically considered to be sites of binding. Insufficient attention has been given to systematic artifacts inherent to the ChIP-seq procedure that might generate a misleading picture of protein binding to certain loci. We show here that unrelated transcription factors appear to consistently bind to the gene bodies of highly transcribed genes in yeast. Strikingly, several types of negative control experiments, including a protein that is not expected to bind chromatin, also showed similar patterns of strong binding within gene bodies. These false positive signals were evident across sequencing platforms and immunoprecipitation protocols, as well as in previously published datasets from other labs. We show that these false positive signals derive from high rates of transcription, and are inherent to the ChIP procedure, although they are exacerbated by sequencing library construction procedures. This expression bias is strong enough that a known transcriptional repressor like Tup1 can erroneously appear to be an activator. Another type of background bias stems from the inherent nucleosomal structure of chromatin, and can potentially make it seem like certain factors bind nucleosomes even when they don't. Our analysis suggests that a mock ChIP sample offers a better normalization control for the expression bias, whereas the ChIP input is more appropriate for the nucleosomal periodicity bias. While these controls alleviate the effect of the biases to some extent, they are unable to eliminate it completely. Caution is therefore warranted regarding the interpretation of data that seemingly show the association of various transcription and chromatin factors with highly transcribed genes in yeast.
Project description:71 liver RNA samples from three mouse strains - DBA/2J, C57BL/6J and C3H/HeJ - were profiled using a custom-designed microarray monitoring exon and exon-junction expression of 1,020 genes representing 9,406 exons. Gene expression was calculated via two different methods, using the 3'-most exon probe ("3' gene expression profiling") and using all probes associated with the gene ("whole-transcript gene expression profiling"), while exon expression was determined using exon probes and flanking junction probes that spanned across the neighboring exons("exon expression profiling"). Widespread strain and sex influences were detected using a two-way Analysis of Variance (ANOVA) regardless of the profiling method used. However, over 90% of the genes identified in 3' gene expression profiling or whole transcript profiling were identified in exon profiling, along with 75% and 38% more genes, respectively, showing evidence of differential isoform expression. Overall, 55% and 32% of genes, respectively, exhibited strain- and sex-bias differential gene or exon expression. Keywords: Grouped