Dataset Information

ABSTRACT: Inference of drowning sites using aquatic bacterial composition and random forest algorithm

PROVIDER: PRJNA962514 | ENA |

REPOSITORIES: ENA

ACCESS DATA

Dataset's files

Source:

			Action	DRS
	SRR24322925_1.fastq.gz	Fastqsanger.gz
	SRR24322925_2.fastq.gz	Fastqsanger.gz
	SRR24322926_1.fastq.gz	Fastqsanger.gz
	SRR24322926_2.fastq.gz	Fastqsanger.gz
	SRR24322927_1.fastq.gz	Fastqsanger.gz

Items per page:

1 - 5 of 88

Similar Datasets

Project description:Objectives Our goal was to evaluate the diagnostic value of DNA methylation analysis in combination with machine learning to differentiate pleural mesothelioma (PM) from important histopathological mimics. Material and methods DNA methylation data of PM, lung adenocarcinomas, lung squamous cell carcinomas and chronic pleuritis was used to train a random forest as well as a support vector machine. These classifiers were validated using an independent validation cohort including pleural carcinosis and pleomorphic variants of lung adeno- and squamous cell carcinomas. Furthermore, we used a deconvolution method to estimate the composition of the tumor microenvironment. Results T-distributed stochastic neighbor embedding clearly separated PM from lung adenocarcinomas and squamous cell carcinomas, but there was a considerable overlap between chronic pleuritis specimens and PM with low tumor cell content. While both machine learning algorithms achieved comparable accuracies in a nested cross validation on the training cohort (random forest: 94.9%; support vector machine: 95.5%), the support vector machine outperformed the random forest in distinguishing PM from chronic pleuritis. Differential methylation analysis revealed promoter hypermethylation in PM specimens, including the tumor suppressor genes BCL11B, EBF1, FOXA1, and WNK2. Furthermore, we observed comparable accuracies for the support vector machine on the validation cohort (97.1%) while the random forest performed considerably worse (89.9%). Deconvolution of the stromal and immune cell composition revealed higher rates of regulatory T-cells and endothelial cells in tumor specimens and a heterogenous inflammation including macrophages, B-cells and natural killer cells in chronic pleuritis. Conclusion DNA methylation in combination with machine learning is a promising tool to reliably differentiate PM from chronic pleuritis and lung cancer, including pleomorphic carcinomas. Furthermore, our study highlights new candidate genes for PM carcinogenesis and shows that deconvolution of DNA methylation data can provide reasonable insights into the composition of the tumor microenvironment.

Dataset Information

Dataset's files

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets