Project description:a chromosome-level nuclear genome and organelle genomes of the alpine snow alga Chloromonas typhlos were sequenced and assembled by integrating short- and long-read sequencing and proteogenomic strategy
Project description:To profile the Daphnia species methylome and to achieve a better understanding of the level of variations in the methylome of Daphnia species, we performed whole genome bisulfite sequencing (WGBSeq) of adult Daphnia magna Bham2 strain and Daphnia pulex Eloise Butler strain (EB45 and EB31 strains). We also analysed the correlation between gene expression and methylation in the two species, using data generated in this study and RNA-seq data from Orsini, et al. 2016. We found that methylation percentage across the genome of Daphnia spp. follows a bimodal distribution. Furthermore, CpG methylation in Daphnia predominantly occurs at coding regions. Although methylation levels significantly decrease towards the 3’ end of a gene with a significant drop in methylation levels from one exon to the neighbouring intron, there is a clear spike in relative methylation levels between exon and intron boundaries, which may be linked to regulation of splicing. We further demonstrate that DNA methylation in Daphnia is responsive to intrinsic and extrinsic factors. We also compared the methylation and gene expression correlations found in Daphnia to publicly available dataset from two other invertebrate species (Apis mellifera and Nasonia vitripennis) and two vertebrate species (Homo sapiens and Mus musculus). We observed that similar to other invertebrates, Daphnia’s genome is sparsely methylated at a lower level and the methylation is predominantly focused at gene body while in vertebrate species the genome is heavily methylated (global methylation). Although the level and distribution of methylation across CpG sites is different between vertebrates and invertebrates it is possible that methylation density at coding regions has the same function between vertebrates and invertebrates. We demonstrate evolutionary conservation of a positive correlation between high methylation density at coding regions and gene expression across vertebrates and invertebrates, leading to potentially ensuring continuous high expression of genes required throughout the life in both vertebrates and invertebrates.
Project description:We produced an extensive transcript catalog for LCLs of 5 primate species by leveraging isoform sequencing and short-read RNA-seq. The curated transcriptomes were used to assist mass spectrometry protein identifications.
Project description:Primary objectives: The primary objective is to investigate circulating tumor DNA (ctDNA) via deep sequencing for mutation detection and by whole genome sequencing for copy number analyses before start (baseline) with regorafenib and at defined time points during administration of regorafenib for treatment efficacy in colorectal cancer patients in terms of overall survival (OS).
Primary endpoints: circulating tumor DNA (ctDNA) via deep sequencing for mutation detection and by whole genome sequencing for copy number analyses before start (baseline) with regorafenib and at defined time points during administration of regorafenib for treatment efficacy in colorectal cancer patients in terms of overall survival (OS).
Project description:Nematodes encompass over 24,000 described species, which were discovered in almost every ecological habitat, and make up over 80% of metazoan taxonomic diversity in soils. The last common ancestor of nematodes is believed to date back to around 650–750 million years, generating a large and phylogenetically diverse group to be explored. However, for most species high quality gene annotations are incomprehensive or missing. Combining short-read RNA sequencing with mass spectrometry-based proteomics and machine learning quality control in an approach called proteotranscriptomics, we improve gene annotations for 9 genome-sequenced nematode species and provide new gene annotations for 3 additional species without genome assemblies. Emphasizing the sensitivity of our methodology, we provide evidence for two hitherto undescribed genes in the model organism Caenorhabditis elegans. Extensive phylogenetic systems analysis using this comprehensive proteome annotation provides new insights into evolutionary processes of this metazoan group.
2022-11-12 | PXD034107 | Pride
Project description:Whole genome short read sequencing of the new genus and species, Mooraboolomyces wintlei
Project description:Adenovirus is a common human pathogen that relies on host cell processes for transcription and processing of viral RNA and protein production. Although adenoviral promoters, splice junctions, and cleavage and polyadenylation sites have been characterized using low-throughput biochemical techniques or short read cDNA-based sequencing, these technologies do not fully capture the complexity of the adenoviral transcriptome. By combining Illumina short-read and nanopore long-read direct RNA sequencing approaches, we mapped transcription start sites and cleavage and polyadenylation sites across the adenovirus genome. In addition to confirming the known canonical viral early and late RNA cassettes, our analysis of splice junctions within long RNA reads revealed an additional 35 novel viral transcripts. These RNAs include fourteen new splice junctions which lead to expression of canonical open reading frames (ORF), six novel ORF-containing transcripts, and fifteen transcripts encoding for messages that potentially alter protein functions through truncations or fusion of canonical ORFs. In addition, we also detect RNAs that bypass canonical cleavage sites and generate potential chimeric proteins by linking separate gene transcription units. Of these, an evolutionary conserved protein was detected containing the N-terminus of E4orf6 fused to the downstream DBP/E2A ORF. Loss of this novel protein, E4orf6/DBP, was associated with aberrant viral replication center morphology and poor viral spread. Our work highlights how long-read sequencing technologies can reveal further complexity within viral transcriptomes.