Project description:We apply hierarchical clustering (HC) of DNA k-mer counts on multiple Fastq files. The tree structures produced by HC may reflect experimental groups and thereby indicate experimental effects, but clustering of preparation groups indicates the presence of batch effects. Hence, HC of DNA k-mer counts may serve as a diagnostic device. In order to provide a simple applicable tool we implemented sequential analysis of Fastq reads with low memory usage in an R package (seqTools) available on Bioconductor. The approach is validated by analysis of Fastq file batches containing RNAseq data. Analysis of three Fastq batches downloaded from ArrayExpress indicated experimental effects. Analysis of RNAseq data from two cell types (dermal fibroblasts and Jurkat cells) sequenced in our facility indicate presence of batch effects. The observed batch effects were also present in reads mapped to the human genome and also in reads filtered for high quality (Phred > 30). We propose, that hierarchical clustering of DNA k-mer counts provides an unspecific diagnostic tool for RNAseq experiments. Further exploration is required once samples are identified as outliers in HC derived trees.
Project description:Kilian2024 - Immune cell dynamics in Cue-Induced Extended Human Colitis Model
Single-cell technologies such as scRNA-seq and flow cytometry provide critical insights into immune cell behavior in inflammatory bowel disease (IBD). However, integrating these datasets into computational models for dynamic analysis remains challenging. Here, Kilian et al., (2024) developed a deterministic ODE-based model that incorporates these technologies to study immune cell population changes in murine colitis. The model parameters were optimized to fit experimental data, ensuring an accurate representation of immune cell behavior over time. It was then validated by comparing simulations with experimental data using Pearson’s correlation and further tested on independent datasets to confirm its robustness. Additionally, the model was applied to clinical bulk RNA-seq data from human IBD patients, providing valuable insights into immune system dynamics and potential therapeutic strategies.
Figure 4c, obtained from the simulation of human colitis model is highlighted here.
This model is described in the article:
Kilian, C., Ulrich, H., Zouboulis, V.A. et al. Longitudinal single-cell data informs deterministic modelling of inflammatory bowel disease. npj Syst Biol Appl 10, 69 (2024). https://doi.org/10.1038/s41540-024-00395-9
Abstract:
Single-cell-based methods such as flow cytometry or single-cell mRNA sequencing (scRNA-seq) allow deep molecular and cellular profiling of immunological processes. Despite their high throughput, however, these measurements represent only a snapshot in time. Here, we explore how longitudinal single-cell-based datasets can be used for deterministic ordinary differential equation (ODE)-based modelling to mechanistically describe immune dynamics. We derived longitudinal changes in cell numbers of colonic cell types during inflammatory bowel disease (IBD) from flow cytometry and scRNA-seq data of murine colitis using ODE-based models. Our mathematical model generalised well across different protocols and experimental techniques, and we hypothesised that the estimated model parameters reflect biological processes. We validated this prediction of cellular turnover rates with KI-67 staining and with gene expression information from the scRNA-seq data not used for model fitting. Finally, we tested the translational relevance of the mathematical model by deconvolution of longitudinal bulk mRNA-sequencing data from a cohort of human IBD patients treated with olamkicept. We found that neutrophil depletion may contribute to IBD patients entering remission. The predictive power of IBD deterministic modelling highlights its potential to advance our understanding of immune dynamics in health and disease.
This model was curated during the Hackathon hosted by BioMed X GmbH in 2024.
Project description:Clear cell renal cell carcinoma (ccRCC) is the most common form of kidney cancer. Following primary tumour resection approximately 30% of patients experience disease recurrence associated with metastasis. To date, long-read RNA sequencing has not been applied to kidney cancer. Here, we used ONT long-read Direct RNA sequencing to profile the transcriptomes of ccRCC archival tumours, 6 of which were from patients who went on to relapse. Our results revealed a loss of immune infiltrate in tumours of patients who relapse. Moreover, thousands of novel isoforms were discovered, including a novel PD-L1 transcript encoding for the soluble version of the protein but having a longer 3'UTR than the currently annotated transcript. Finally, we have identified a novel non-coding gene that was over-expressed in patients who experience recurrence. Our data shows that DRS can be used in archival tumour samples to comprehensively characterise tumour transcriptomes, and to reveal novel features that would have been missed by short-read RNAseq.