Browse
Submit Data
Databases
API
Help

Dataset Information

0 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

ABSTRACT: A Random Forest Algorithm Based on a cgMLST Scheme to Predict hvKP

PROVIDER: PRJEB34922 | ENA |

REPOSITORIES: ENA

ACCESS DATA

Json Xml

Dataset's files

Source:

			Action	DRS
		Other

Items per page:

1 - 1 of 1

Similar Datasets

Robust methylation based classification of brain tumors using nanopore sequencing

Project description:Using a public reference data set of 82 unique entities, 382 nanopore-sequenced brain tumor samples were classified based on their methylation status through an ad hoc random forest algorithm. As a measure of confidence, score recalibration was performed and platform-specific thresholds were defined.

2022-07-30 | GSE209865 | GEO

Patterson2022 - Tumour mutation data driven Random Forest model to predict immune checkpoint inhibitor therapy benefit in metastatic melanoma

Project description:A Random Forest model is developed to incorporate tumor mutation data within the context of the biological process known as leukocyte proliferation regulation. This model aims to predict a patient's response to anti-PD1 treatment. The authors conducted experiments using four different types of classifiers: Random Forest, Gradient Boosting, Feed Forward Neural Network, and Long Short-Term Memory (LSTM) recurrent neural network. Among these classifiers, the Random Forest algorithm yielded the best predictive performance when modeling gene mutation data associated with the 'leukocyte proliferation regulation' biological process. Hence, this curated version of the model focuses on the Random Forest model trained specifically on the 'Leukocyte Proliferation Regulation' process. In this model, a value of '0' is assigned to NonResponders, while a value of '1' is assigned to Responders. Please note that to obtain predictions, users should provide mutation data containing only the genes corresponding to the 'GO_REGULATION_OF_LEUKOCYTE_PROLIFERATION' process keyword, as specified in the 'GO_test_genes_dict_intersection' dictionary.

2023-07-03 | BIOMD0000001073 | BioModels

Deep Learning Predicts Peptide Transmission Profiles through FAIMS Directly from Sequence

Project description:Peptide ion mobility adds an extra dimension of separation to mass spectrometry-based proteomics. The ability to accurately pre-dict peptide ion mobility would be useful to expedite assay development and to discriminate true answers in database search. There are methods to accurately predict peptide ion mobility through drift tube devices, but methods to predict mobility through high-field asymmetric waveform ion mobility (FAIMS) are underexplored. Here, we successfully model peptide ions’ FAIMS mobility using a multi-label multi-output classification scheme to account for non-normal transmission distributions. We trained two models from over 100,000 human peptide precursors: a random forest and a long-term short-term memory (LSTM) neural network. Both models had different strengths, and the ensemble average of model predictions produced higher F2 score than either model alone. Finally, we explore cases where the models make mistakes, and demonstrate predictive performance of F2=0.66 (AUROC=0.928) on a new test dataset of nearly 40,000 different E. coli peptide ions.

2025-01-16 | PXD055252 | Pride

Liu2023 - Predicting the efficacy of immune checkpoint inhibitors monotherapy in advanced non-small cell lung cancer: a machine learning method based on multidimensional data

Project description:Immunotherapy has improved the prognosis of patients with advanced non-small cell lung cancer (NSCLC), but only a small subset of patients achieved clinical benefit. The purpose of our study was to integrate multidimensional data using a machine learning method to predict the therapeutic efficacy of immune checkpoint inhibitors (ICIs) monotherapy in patients with advanced NSCLC.The authors retrospectively enrolled 112 patients with stage IIIB-IV NSCLC receiving ICIs monotherapy. The random forest (RF) algorithm was used to establish efficacy prediction models based on five different input datasets, including precontrast computed tomography (CT) radiomic data, postcontrast CT radiomic data, combination of the two CT radiomic data, clinical data, and a combination of radiomic and clinical data. The 5-fold cross-validation was used to train and test the random forest classifier. The performance of the models was assessed according to the area under the curve (AUC) in the receiver operating characteristic (ROC) curve. Among these models(RF MLP LR XGBoost), our reproduced onnx models have better performance, especially for random forest. The response variable with a value (1/0) indicates the (efficacy/inefficacy) of PD-1/PD-L1 monotherapy in patients with advanced NSCLC

2023-07-11 | BIOMD0000001074 | BioModels

Deshpande2019 - Random Forest model to predict long non-coding RNAs from coding RNAs in Zea Mays plant transcriptomic data

Project description:This is a Random Forest algorithm-based machine learning model to predict lncRNAs from coding mRNAs in plant transcriptomic data. The model assigns 1 for coding sequences and 2 for long non-coding sequences. The prediction is performed using a combination of Open Reading Frame (ORF) based, Sequence-based and Codon-bias features. Users need to download the curated ONNX model and also need to convert the sequences into feature matrix as mentioned in PLIT paper (Deshpande et al. 2019) to make predictions on sequences from Zea Mays sequence data.

2024-12-09 | BIOMD0000001067 | BioModels

Homo sapiens

Project description:A Random-Forest Based Algorithm for Prediction of Enhancers From Histone Modifications

| PRJNA165163 | ENA

Pseudomonas aeruginosa cgMLST scheme development

Project description:Pseudomonas aeruginosa cgMLST scheme development

| PRJEB38241 | ENA

Transcription profiling of acute lymphoblastic leukaemia patient samples that represent six different subgroups defined by cytogenetic features and immunophenotype

Project description:We examined published microarray data from 104 acute lymphoblastic leukaemia patient specimens, that represent six different subgroups defined by cytogenetic features and immunophenotypes. Using the decision-tree based supervised learning algorithm Random Forest (RF), we determined a small set of genes for optimal subgroup distinction and subsequently validated their predictive power in an independent cohort of 68 specimens that were assessed using Affymetrix HG-U133A arrays.

2007-05-11 | E-TABM-125 | biostudies-arrayexpress

tRForest: a novel random forest-based algorithm for tRNA-derived fragment target prediction

Project description:tRForest: a novel random forest-based algorithm for tRNA-derived fragment target prediction

| PRJNA783302 | ENA

Chowell2022 - Random Forest model to predict efficacy of immune checkpoint blockade across multiple cancer patient cohorts

Project description:This is a Random Forest algorithm-based machine learning model called RF16, which incorporates a total of 16 genomic, molecular, demographic, and clinical features to predict the immunotherapy response for a patient. The model assigns a value of 0 for NonResponder and 1 for Responder. Please be aware that the column names in the GitHub code and the downloaded dataset from the publication may vary. Users are advised to make minor adjustments to either the code or the dataset to ensure compatibility. The curated version of the model has modified the column names in the training code to align with the dataset. GitHub repository: https://github.com/CCF-ChanLab/MSK-IMPACT-IO

2023-05-09 | BIOMD0000001066 | BioModels

OmicsDI is part of the ELIXIR infrastructure

OmicsDI is an Elixir interoperability service. Learn more ›

Tweets

OmicsDI Databases

PRIDE
PeptideAtlas
MassIVE
JPOST Repository
Physiome Model Repository

EGA
EVA
ENA
LINCS
PAXDB
Cell Collective

MetaboLights
Metabolomics Workbench
MetabolomeExpress
GNPS
BioModels
FAIRDOMHub

ArrayExpress
dbGaP
ExpressionAtlas
GEO
NODE

Information

Databases
Help
API
Contact us
Code on GitHub
Terms of use
Submit Data