Project description:a chromosome-level nuclear genome and organelle genomes of the alpine snow alga Chloromonas typhlos were sequenced and assembled by integrating short- and long-read sequencing and proteogenomic strategy
Project description:Primary objectives: The primary objective is to investigate circulating tumor DNA (ctDNA) via deep sequencing for mutation detection and by whole genome sequencing for copy number analyses before start (baseline) with regorafenib and at defined time points during administration of regorafenib for treatment efficacy in colorectal cancer patients in terms of overall survival (OS).
Primary endpoints: circulating tumor DNA (ctDNA) via deep sequencing for mutation detection and by whole genome sequencing for copy number analyses before start (baseline) with regorafenib and at defined time points during administration of regorafenib for treatment efficacy in colorectal cancer patients in terms of overall survival (OS).
Project description:We applied Single Molecule Real-Time long-read whole-genome sequencing in Dux knockout mouse and confirmed the success of our Dux knockout mouse model.
Project description:Adenovirus is a common human pathogen that relies on host cell processes for transcription and processing of viral RNA and protein production. Although adenoviral promoters, splice junctions, and cleavage and polyadenylation sites have been characterized using low-throughput biochemical techniques or short read cDNA-based sequencing, these technologies do not fully capture the complexity of the adenoviral transcriptome. By combining Illumina short-read and nanopore long-read direct RNA sequencing approaches, we mapped transcription start sites and cleavage and polyadenylation sites across the adenovirus genome. In addition to confirming the known canonical viral early and late RNA cassettes, our analysis of splice junctions within long RNA reads revealed an additional 35 novel viral transcripts. These RNAs include fourteen new splice junctions which lead to expression of canonical open reading frames (ORF), six novel ORF-containing transcripts, and fifteen transcripts encoding for messages that potentially alter protein functions through truncations or fusion of canonical ORFs. In addition, we also detect RNAs that bypass canonical cleavage sites and generate potential chimeric proteins by linking separate gene transcription units. Of these, an evolutionary conserved protein was detected containing the N-terminus of E4orf6 fused to the downstream DBP/E2A ORF. Loss of this novel protein, E4orf6/DBP, was associated with aberrant viral replication center morphology and poor viral spread. Our work highlights how long-read sequencing technologies can reveal further complexity within viral transcriptomes.
2022-08-30 | PXD034464 | Pride
Project description:Long-read whole genome sequencing of mulberry
Project description:Objectives: To perform long-read transcriptome and proteome profiling of pathogen-stimulated peripheral blood mononuclear cells (PBMCs) from healthy donors. We aim to discover new transcripts and protein isoforms expressed during immune responses to diverse pathogens. Methods: PBMCs were exposed to four microbial stimuli for 24 hours: the TLR4 ligand lipopolysaccharide (LPS), the TLR3 ligand Poly(I:C), heat-inactivated Staphylococcus aureus, Candida albicans, and RPMI medium as negative controls. Long-read sequencing (PacBio) of one donor and secretome proteomics and short-read sequencing of five donors were performed. IsoQuant was used for transcriptome construction, Metamorpheus/FlashLFQ for proteome analysis, and Illumina short-read 3’-end mRNA sequencing for transcript quantification. Results: Long-read transcriptome profiling reveals the expression of novel sequences and isoform switching induced upon pathogen stimulation, including transcripts that are difficult to detect using traditional short-read sequencing. We observe widespread loss of intron retention as a common result of all pathogen stimulations. We highlight novel transcripts of NFKB1 and CASP1 that may indicate novel immunological mechanisms. In general, RNA expression differences did not result in differences in the amounts of secreted proteins. Interindividual differences in the proteome were larger than the differences between stimulated and unstimulated PBMCs. Clustering analysis of secreted proteins revealed a correlation between chemokine (receptor) expression on the RNA and protein levels in C. albicans- and Poly(I:C)-stimulated PBMCs. Conclusion: Isoform aware long-read sequencing of pathogen-stimulated immune cells highlights the potential of these methods to identify novel transcripts, revealing a more complex transcriptome landscape than previously appreciated.
Project description:The Australian Acute Care Genomics program provides ultra-rapid diagnostic testing to critically ill infants and children with suspected genetic conditions. Over two years, we performed whole genome sequencing (WGS) in 290 families, with average time to result of 2.9 days, and diagnostic yield of 47%. We performed additional bioinformatic analyses and transcriptome sequencing in all patients who remained undiagnosed. Long-read sequencing and functional assays, ranging from clinically accredited enzyme analysis to bespoke quantitative proteomics, were deployed in selected cases. This resulted in an additional 19 diagnoses, and an overall diagnostic yield of 54%. Diagnostic variants ranged from structural chromosomal abnormalities through to an intronic retrotransposon, disrupting splicing. Critical care management changed in 120 diagnosed patients (77%). Results informed precision treatments; surgical and transplant decisions; and palliation in 94 (60%). We propose that integration of multi-omic approaches into mainstream diagnostic practice is necessary to realise the full potential of genomic testing.
Project description:The human neural retina is enriched for alternative splicing, and it is estimated that more than 10% of variants associated with inherited retinal diseases (IRDs) alter splicing. Previous research mainly used short-read RNA-sequencing techniques to investigate retina-specific splicing and splicing factors. However, this technique provides limited information about transcript isoforms. To gain a deeper understanding of the human neural retina and its isoforms, we generated a proteogenomic atlas that combined PacBio long-read RNA-sequencing data with mass-spectrometry and whole-genome sequencing data from three healthy human neural retina samples. RNA-sequencing revealed that one-third of all transcripts were novel, and for IRD-associated genes, even 43% were novel. The most common novel elements of these transcripts were alternative poly(A) sites, exon elongation, and intron retention. Some novel elements affect the non-coding region but for more than 50% of the novel transcripts a novel open reading frame was predicted. Using proteomics, ten novel peptides confirmed novel isoforms in five genes. Additionally, we found novel isoforms of IMPDH1, an IRD-associated gene, with supporting peptide evidence. This study provides a comprehensive overview of the transcript and protein isoforms expressed in the healthy human neural retina. Moreover, it highlights the importance of studying tissue specific transcriptomes in greater detail to better understand tissue-specific regulation and to identify disease-causing variants.