Unknown

Dataset Information

0

Biomarker discovery studies for patient stratification using machine learning analysis of omics data: a scoping review.


ABSTRACT:

Objective

To review biomarker discovery studies using omics data for patient stratification which led to clinically validated FDA-cleared tests or laboratory developed tests, in order to identify common characteristics and derive recommendations for future biomarker projects.

Design

Scoping review.

Methods

We searched PubMed, EMBASE and Web of Science to obtain a comprehensive list of articles from the biomedical literature published between January 2000 and July 2021, describing clinically validated biomarker signatures for patient stratification, derived using statistical learning approaches. All documents were screened to retain only peer-reviewed research articles, review articles or opinion articles, covering supervised and unsupervised machine learning applications for omics-based patient stratification. Two reviewers independently confirmed the eligibility. Disagreements were solved by consensus. We focused the final analysis on omics-based biomarkers which achieved the highest level of validation, that is, clinical approval of the developed molecular signature as a laboratory developed test or FDA approved tests.

Results

Overall, 352 articles fulfilled the eligibility criteria. The analysis of validated biomarker signatures identified multiple common methodological and practical features that may explain the successful test development and guide future biomarker projects. These include study design choices to ensure sufficient statistical power for model building and external testing, suitable combinations of non-targeted and targeted measurement technologies, the integration of prior biological knowledge, strict filtering and inclusion/exclusion criteria, and the adequacy of statistical and machine learning methods for discovery and validation.

Conclusions

While most clinically validated biomarker models derived from omics data have been developed for personalised oncology, first applications for non-cancer diseases show the potential of multivariate omics biomarker design for other complex disorders. Distinctive characteristics of prior success stories, such as early filtering and robust discovery approaches, continuous improvements in assay design and experimental measurement technology, and rigorous multicohort validation approaches, enable the derivation of specific recommendations for future studies.

SUBMITTER: Glaab E 

PROVIDER: S-EPMC8650485 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

2020-09-01 | E-MTAB-9501 | biostudies-arrayexpress
| S-EPMC4315157 | biostudies-literature
| S-EPMC6438573 | biostudies-literature
| S-EPMC6731682 | biostudies-literature
| S-EPMC6052089 | biostudies-literature
| S-EPMC7683339 | biostudies-literature
| S-EPMC8450182 | biostudies-literature
| S-EPMC4512773 | biostudies-literature
| S-EPMC7689198 | biostudies-literature
| S-EPMC4281796 | biostudies-literature