Project description:Primary objectives: The primary objective is to investigate circulating tumor DNA (ctDNA) via deep sequencing for mutation detection and by whole genome sequencing for copy number analyses before start (baseline) with regorafenib and at defined time points during administration of regorafenib for treatment efficacy in colorectal cancer patients in terms of overall survival (OS).
Primary endpoints: circulating tumor DNA (ctDNA) via deep sequencing for mutation detection and by whole genome sequencing for copy number analyses before start (baseline) with regorafenib and at defined time points during administration of regorafenib for treatment efficacy in colorectal cancer patients in terms of overall survival (OS).
Project description:Genome-wide maps of cytosine methylation, cytosine hydroxylmethylation and small non coding RNAs in mouse ES cells and upon guided differentiation to mesoendoderm cells. Mouse embryonic stem cells (E14) were guided differentiated into mesoendoderm lineages by activin-A induction. cells in three time points (day0, day4 and day6) were collected. The genome-wide studies on three cell types were summerized as following: cytosine methylation data were generated using methylated DNA immunoprecipitation followed by sequencing (MeDIP-seq) and DNA digestion by methyl-sensitive restriction enzymes followed by sequencing (MRE-seq); DNA product for 5-hmC_ChIP-seq is generated by a selctive chemical labeling method (Nat. Biotechnol. 2011, 29, 68-72). E14 Day0 data for MRE-seq and MeDIP-seq are released first in previous publication and included in prior series GSE36114 ChIP-seq, 5-hmC-seq, MeDIP-Seq, MRE-Seq, ncRNA-Seq, and RNA-seq on activin-induced differentiating ES cells at 3 time points.
Project description:To elucidate the molecular mechanisms that determine the differences in flowering times between two radish lines, NH-JS1 (late bolting) and NH-JS2 (early bolting) by high-throughput RNA sequencing analysis. In total, 71,188 unigenes were identified by reference-guided assembly, of which 309, 788, and 980 genes were differentially expressed between the two inbred lines after 0, 15, and 35 days of vernalization, respectively.
Project description:Accurate annotation of transcript isoforms is crucial to understand gene functions, but automated methods for reconstructing full-length transcripts from RNA sequencing (RNA-seq) data remain imprecise. We developed Bookend, a software package for transcript assembly that incorporates data from different RNA-seq techniques, with a focus on identifying and utilizing RNA 5′ and 3′ ends. Through end-guided assembly with Bookend we demonstrate that correct modeling of transcript start and end sites is essential for precise transcript assembly. Furthermore, we discovered that utilization of end-labeled reads present in full-length single-cell RNA-seq (scRNA-seq) datasets dramatically improves the precision of transcript assembly in single cells. Finally, we show that hybrid assembly across short-read, long-read, and end-capture RNA-seq datasets from Arabidopsis, as well as meta-assembly of RNA-seq from single mouse embryonic stem cells (mESCs) can produce end-to-end transcript annotations of comparable quality to reference annotations in these model organisms.
Project description:The naked mole-rat (NMR; Heterocephalus glaber) has recently gained considerable attention in the scientific community for its unique potential to unveil novel insights in the fields of medicine, biochemistry, and evolution. NMRs exhibit unique adaptations that include protracted fertility, cancer resistance, eusociality, and anoxia. This suite of adaptations is not found in other rodent species, suggesting that interrogating conserved and accelerated regions in the NMR genome will find regions of the NMR genome fundamental to their unique adaptations. However, the current NMR genome assembly has limits that make studying structural variations, heterozygosity, and non-coding adaptations challenging. We present a complete diploid naked-mole rat genome assembly by integrating long-read and 10X-linked read genome sequencing of a male NMR and its parents, and Hi-C sequencing in the NMR hypothalamus (N=2). Reads were identified as maternal, paternal or ambiguous (TrioCanu). We then polished genomes with Flye, Racon and Medaka. Assemblies were then scaffolded using the following tools in order: Scaff10X, Salsa2, 3d-DNA, Minimap2-alignment between assemblies, and the Juicebox Assembly Tools. We then subjected the assemblies to another round of polishing, including short-read polishing with Freebayes. We assembled the NMR mitochondrial genome with mitoVGP. Y chromosome contigs were identified by aligning male and female 10X linked reads to the paternal genome and finding male-biased contigs not present in the maternal genome. Contigs were assembled with publicly available male NMR Fibroblast Hi-C-seq data (SRR820318). Both assemblies have their sex chromosome haplotypes merged so that both assemblies have a high-quality X and Y chromosome. Finally, assemblies were evaluated with Quast, BUSCO, and Merqury, which all reported the base-pair quality and contiguity of both assemblies as high-quality. The assembly will next be annotated by Ensembl using public RNA-seq data from multiple tissues (SRP061363). Together, this assembly will provide a high-quality resource to the NMR and comparative genomics communities.