Project description:<p><strong>BACKGROUND:</strong> Manchurian walnut (Juglans mandshurica Maxim.) is a tree with multiple industrial uses and medicinal properties in the Juglandaceae family (walnuts and hickories). J. mandshurica produces juglone, which is a toxic allelopathic agent and has potential utilization value. Furthermore, the seed of J. mandshurica is rich in various unsaturated fatty acids and has high nutritive value.</p><p><strong>FINDINGS:</strong> Here, we present a high-quality chromosome-scale reference genome assembly and annotation for J. mandshurica (n = 16) with a contig N50 of 21.4 Mb by combining PacBio high-fidelity reads with high-throughput chromosome conformation capture data. The assembled genome has an estimated sequence size of 548.7 Mb and consists of 657 contigs, 623 scaffolds and 40,453 protein-coding genes. In total, 60.99% of the assembled genome consists of repetitive sequences. Sixteen super-scaffolds corresponding to the 16 chromosomes were assembled, with a scaffold N50 length of 33.7 Mb and a BUSCO complete gene percentage of 98.3%. J. mandshurica displays a close sequence relationship with Juglans cathayensis, with a divergence time of 13.8 million years ago. Combining the high-quality genome, transcriptome and metabolomics data, we constructed a gene-to-metabolite network and identified 566 core and conserved differentially expressed genes, which may be involved in juglone biosynthesis. Five CYP450 genes were found that may contribute to juglone accumulation. NAC, bZip, NF-YA and NF-YC are positively correlated with the juglone content. Some candidate regulators (e.g., FUS3, ABI3, LEC2 and WRI1 transcription factors) involved in the regulation of lipid biosynthesis were also identified.</p><p><strong>CONCLUSIONS:</strong> Our genomic data provide new insights into the evolution of the walnut genome and create a new platform for accelerating molecular breeding and improving the comprehensive utilization of these economically important tree species.</p>
Project description:We performed RNA-Seq across multiple time points of development (stages 13, 19, 21, 23) in Parhyale hawaiensis and generated an improved genome annotation.
Project description:Genome/chromosome organization is highly ordered and controls nuclear events. Here, we show that the TATA box-binding protein (TBP) interacts with the Cnd2 kleisin subunit of condensin to mediate interphase and mitotic chromosome organization in fission yeast. TBP recruits condensin onto RNA polymerase III-transcribed (Pol III) genes and highly transcribed Pol II genes; condensin in turn associates these genes with centromeres. Inhibition of the Cnd2-TBP interaction disrupts condensin localization across the genome and the proper assembly of mitotic chromosomes, leading to severe defects in chromosome segregation and eventually causing cellular lethality. We propose that the Cnd2-TBP interaction coordinates transcription with chromosomal architecture by linking dispersed gene loci with centromeres. This chromosome arrangement can contribute to the efficient transmission of physical force at the kinetochore to chromosomal arms, thereby supporting the fidelity of chromosome segregation. Genome-wide distributions of condensin and Pol III factors in fission yeast.
Project description:More than 2x10E9 sequences made on Illumina platform derived from the genome of E14 embryonic stem cells cultured in our laboratory were used to build a database of about 2.7x10E6 single nucleotide variant. The database was validated using other two sequencing datasets from other laboratory and high overlap was observed. The identified variant are enriched on intergenic regions, but several thousands reside on gene exons and regulatory regions, such as promoters, enhancers, splicing site and untranslated regions of RNA, thus indicating high probability of an important functional impact on the molecular biology of this cells. We created a new E14 genome assembly including the new identified variants and used it to map reads from next generation sequencing data generated in our laboratory or in others on E14 cell line. We observed an increase in the number of mapped reads of about 5%. CpG dinucleotide showed the higher variation frequency, probably because of it could be target of DNA methylation. We performed a reduced representation bisulfite sequencing on E14 cell line to test our new genome assembly with respect to the mm9 genome reference. After mapping and methylation status calling, we obtained an increase of about 120,000 called CpG and we avoided about 20,000 wrong CpG calling. genotyping of E14 embryonic stem cells (ESCs) and Reduced representation Bisulfite Sequencing (RRBS) of E14 ESCs.
Project description:Centromeres are chromosomal regions that serve as platforms for kinetochore assembly and spindle attachments, ensuring accurate chromosome segregation during cell division. Despite functional conservation, centromeric sequences are diverse and usually repetitive across species, making them challenging to assemble and identify. Here, we describe centromeres in the model oomycete Phytophthora sojae by combining long-read sequencing-based genome assembly and chromatin immunoprecipitation for the centromeric histone CENP-A followed by high-throughput sequencing (ChIP-seq). P. sojae centromeres cluster at a single focus in the nucleus at different life stages and during nuclear division. We report a highly contiguous genome assembly of the P. sojae reference strain, which enabled identification of 15 highly enriched CENP-A binding regions as putative centromeres. By focusing on 10 intact regions, we demonstrate that centromeres in P. sojae are regional, spanning 211 to 356 kb. Most of these regions are transposon-rich, poorly transcribed, and lack the euchromatin mark H3K4me2 but are embedded within regions with the heterochromatin marks H3K9me3 and H3K27me3.
Project description:This is an auto-generated model with COBRA Matlab toolbox. The gadMorTrinigy de novo Trinity transcript assembly and peptide sequences are available at https://doi.org/10.6084/m9.figshare.c.5168303.v2