Project description:Purple-grain wheat are caused by anthocyanin accumulation in the seed coat. But little is known about molecular mechanism of anthocyanin biosynthesis. The anthocyanin biosynthesis and accumulation were affected by light in purple-grain wheat. The spikes of purple-grain wheat Luozhen No.1 were bagged with four-layer Kraft paper bags after pollination. To identify genes involved in the anthocyanin biosynthesis, we sequenced four pericarp cDNA libraries, D15 (15 DAP), D20 (20 DAP) of shading treatment, and L15 (15 DAP), L20 (20 DAP) of untreated control using an Illumina HiSeqTM 2000. After quality control, raw reads are filtered into clean reads which will be aligned to the reference sequences. The alignment data is utilized to calculate distribution of reads on reference genes and mapping ratio, and proceed with downstream analysis including gene and isoform expression, deep analysis based on gene expression (PCA/correlation/screening differentially expressed genes and so on),exon expression, gene structure refinement, alternative splicing, novel transcript prediction and annotation, SNP detection, Indel detection. Further, we also perform deep analysis based on different expression genes, including Gene Ontology (GO) enrichment analysis, Pathway enrichment analysis, cluster analysis, and finding transcriptor factor.
Project description:The Cancer Genome Atlas (TCGA) Isoform Expression Quantification Data is the largest ressource of isomiR level sequenced cancer data publicly available. Since the datasets were built up over years and through different contributing institutions, it is not free of batch effects. We evaluated different batch correction approaches to remove batch effects in the data, details of the best performing algorithm and batch variables are included in the supplementary file. Additionally, annotation of the chromosomal end position of each isomiR feature was corrected by the offset of 1 to account for exclusive annotation.
Project description:Gene regulatory elements such as enhancers have profound effects on cellular function, health and disease. Our understanding of mammalian enhancer function is limited by the lack of technology that would allow for a rapid and thorough test of their cell type-specific function. Here we describe a novel Cas9-effector system that enables rapid testing and functional annotation of native enhancers in embryonic stem cells. V6.5 mouse ES cells according to the protocol as described