Project description:The paper "Metabolomic Machine Learning Predictor for Diagnosis and Prognosis of Gastric Cancer" addresses the need for non-invasive diagnostic tools for gastric cancer (GC). Traditional methods like endoscopy are invasive and expensive. The authors conducted a targeted metabolomics analysis of 702 plasma samples to develop machine learning models for GC diagnosis and prognosis. The diagnostic model, using 10 metabolites, achieved a sensitivity of 0.905, outperforming conventional protein marker-based methods. The prognostic model effectively stratified patients into risk groups, surpassing traditional clinical models.
I have successfully reproduced the diagnosis model from the paper. This machine learning-based system differentiates GC patients from non-GC controls using metabolomics data from plasma samples analyzed by liquid chromatography-mass spectrometry (LC-MS). The model focuses on 10 metabolites, including succinate, uridine, lactate, and serotonin. Employing LASSO regression and a random forest classifier, the model achieved an AUROC of 0.967, with a sensitivity of 0.854 and specificity of 0.926. This model significantly outperforms traditional diagnostic methods and underscores the potential of integrating machine learning with metabolomics for early GC detection and treatment.
Project description:We performed a retrospective study on CSF from 20 DMT-naïve MS patients to investigate the correlation between intrathecal immune proteins and clinical MS phenotype.
Project description:A total of 116 patients, 96 with treatment-naïve unresectable hepatocellular carcinoma (uHCC) and 20 chronic liver disease without any cancer, were analysed for 17 cytokines and chemokines from serum and analysed for oncological features.
Project description:Purpose: We generated extensive transcriptional and proteomic profiles from a Her2-driven mouse model of breast cancer that closely recapitulates human breast cancer. This report makes these data publicly available in raw and processed forms, as a resource to the community. Importantly, we previously made biospecimens from this same mouse model freely available through a sample repository, so researchers can obtain samples to test biological hypotheses without the need of breeding animals and collecting biospecimens. Experimental design: Twelve datasets are available, encompassing 841 LC-MS/MS experiments (plasma and tissues) and 255 microarray analyses of multiple tissues (thymus, spleen, liver, blood cells, and breast). Cases and controls were rigorously paired to avoid bias. Results: In total, 18,880 unique peptides were identified (PeptideProphet peptide error rate ≤1%), with 3884 and 1659 non-redundant protein groups identified in plasma and tissue datasets, respectively. Sixty-one of these protein groups overlapped between cancer plasma and cancer tissue. Conclusions and clinical relevance: These data are of use for advancing our understanding of cancer biology, for software and quality control tool development, investigations of analytical variation in MS/MS data, and selection of proteotypic peptides for MRM-MS. The availability of these datasets will contribute positively to clinical proteomics.
Project description:Purpose: We generated extensive transcriptional and proteomic profiles from a Her2-driven mouse model of breast cancer that closely recapitulates human breast cancer. This report makes these data publicly available in raw and processed forms, as a resource to the community. Importantly, we previously made biospecimens from this same mouse model freely available through a sample repository, so researchers can obtain samples to test biological hypotheses without the need of breeding animals and collecting biospecimens. Experimental design: Twelve datasets are available, encompassing 841 LC-MS/MS experiments (plasma and tissues) and 255 microarray analyses of multiple tissues (thymus, spleen, liver, blood cells, and breast). Cases and controls were rigorously paired to avoid bias. Results: In total, 18,880 unique peptides were identified (PeptideProphet peptide error rate ≤1%), with 3884 and 1659 non-redundant protein groups identified in plasma and tissue datasets, respectively. Sixty-one of these protein groups overlapped between cancer plasma and cancer tissue. Conclusions and clinical relevance: These data are of use for advancing our understanding of cancer biology, for software and quality control tool development, investigations of analytical variation in MS/MS data, and selection of proteotypic peptides for MRM-MS. The availability of these datasets will contribute positively to clinical proteomics.
Project description:A non-invasive diagnostic test does not exist for acute graft versus host disease (aGVHD). We therefore sought to identify biomarkers for aGVHD using antibody microarrays (Schleicher and Schuell Serum Biomarker Chips, Whatman) that simultaneously assayed 120 plasma proteins. We measured these proteins in a set of 42 patient plasma samples following an allogeneic bone marrow transplant (BMT): 21 patients with a diagnosis of aGVHD grade II-IV (+GVHD) and 21 patients without aGVHD (–GVHD) at similar times after transplant. We excluded data from 2 hybridizations that had very bright dots and appeared as outliers in preliminary principal components analysis, so that we finally compared 20 +GVHD to 20 -GVHD samples. Keywords: disease state analysis, antibody microarray
Project description:Purpose: We generated extensive transcriptional and proteomic profiles from a Her2-driven mouse model of breast cancer that closely recapitulates human breast cancer. This report makes these data publicly available in raw and processed forms, as a resource to the community. Importantly, we previously made biospecimens from this same mouse model freely available through a sample repository, so researchers can obtain samples to test biological hypotheses without the need of breeding animals and collecting biospecimens. Experimental design: Twelve datasets are available, encompassing 841 LC-MS/MS experiments (plasma and tissues) and 255 microarray analyses of multiple tissues (thymus, spleen, liver, blood cells, and breast). Cases and controls were rigorously paired to avoid bias. Results: In total, 18,880 unique peptides were identified (PeptideProphet peptide error rate â¤1%), with 3884 and 1659 non-redundant protein groups identified in plasma and tissue datasets, respectively. Sixty-one of these protein groups overlapped between cancer plasma and cancer tissue. Conclusions and clinical relevance: These data are of use for advancing our understanding of cancer biology, for software and quality control tool development, investigations of analytical variation in MS/MS data, and selection of proteotypic peptides for MRM-MS. The availability of these datasets will contribute positively to clinical proteomics. Custom Agilent 44K whole mouse genome expression oligonucleotide microarrays were used to profile breast tumors from three Her2/Neu mice compared to normal breast epithelium from two control mice transgenic for TetO-NeuNT only and littermates of the bitransgenic mice. All samples were laser-capture microdissected and total RNA isolated and amplified prior to hybridization against a reference pool of normal adult mouse tissues.
Project description:This dataset contains peptide array information from 120 patients from 5 different cancer types using classic blinded test/train method. This array is library 1 (GPL17600). A 1:500 dilution of human serum is added to a peptide array (GPL17600). This array is a two-up design, with 10420 peptides printed on the top and bottom of a standard glass microscope slide. Samples were run in duplicate. The average of the duplicates are listed here. 20 train and 20 blinded test samples were run.
Project description:Purpose: We generated extensive transcriptional and proteomic profiles from a Her2-driven mouse model of breast cancer that closely recapitulates human breast cancer. This report makes these data publicly available in raw and processed forms, as a resource to the community. Importantly, we previously made biospecimens from this same mouse model freely available through a sample repository, so researchers can obtain samples to test biological hypotheses without the need of breeding animals and collecting biospecimens. Experimental design: Twelve datasets are available, encompassing 841 LC-MS/MS experiments (plasma and tissues) and 255 microarray analyses of multiple tissues (thymus, spleen, liver, blood cells, and breast). Cases and controls were rigorously paired to avoid bias. Results: In total, 18,880 unique peptides were identified (PeptideProphet peptide error rate â¤1%), with 3884 and 1659 non-redundant protein groups identified in plasma and tissue datasets, respectively. Sixty-one of these protein groups overlapped between cancer plasma and cancer tissue. Conclusions and clinical relevance: These data are of use for advancing our understanding of cancer biology, for software and quality control tool development, investigations of analytical variation in MS/MS data, and selection of proteotypic peptides for MRM-MS. The availability of these datasets will contribute positively to clinical proteomics. Affymetrix GeneChip Mouse Genome 430 2.0 microarrays were used to profile whole tissues from 5 different tissue types of 25 tumor-bearing and 25 control mice of the Her2/Neu breast cancer mouse model. The 5 tissues tested were from breast, liver, spleen, blood cell, and thymus. The tumor-bearing mice were bitransgenic for MMTV-rtTA/TetO-NeuNT, and the control mice were transgenic for TetO-NeuNT only. The control mice were age- and cage-matched to the tumor-bearing mice. All samples were lysed and total RNA isolated and amplified prior to hybridization.
Project description:small RNA and degradome sequencing was carried out on samples isolated from developing barley grains. The datasets were analysed to identify putative miRNAs and their target mRNAs