Unknown

Dataset Information

0

Ensemble analyses improve signatures of tumour hypoxia and reveal inter-platform differences.


ABSTRACT:

Background

The reproducibility of transcriptomic biomarkers across datasets remains poor, limiting clinical application. We and others have suggested that this is in-part caused by differential error-structure between datasets, and their incomplete removal by pre-processing algorithms.

Methods

To test this hypothesis, we systematically assessed the effects of pre-processing on biomarker classification using 24 different pre-processing methods and 15 distinct signatures of tumour hypoxia in 10 datasets (2,143 patients).

Results

We confirm strong pre-processing effects for all datasets and signatures, and find that these differ between microarray versions. Importantly, exploiting different pre-processing techniques in an ensemble technique improved classification for a majority of signatures.

Conclusions

Assessing biomarkers using an ensemble of pre-processing techniques shows clear value across multiple diseases, datasets and biomarkers. Importantly, ensemble classification improves biomarkers with initially good results but does not result in spuriously improved performance for poor biomarkers. While further research is required, this approach has the potential to become a standard for transcriptomic biomarkers.

SUBMITTER: Fox NS 

PROVIDER: S-EPMC4061774 | biostudies-literature | 2014 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Ensemble analyses improve signatures of tumour hypoxia and reveal inter-platform differences.

Fox Natalie S NS   Starmans Maud H W MH   Haider Syed S   Lambin Philippe P   Boutros Paul C PC  

BMC bioinformatics 20140606


<h4>Background</h4>The reproducibility of transcriptomic biomarkers across datasets remains poor, limiting clinical application. We and others have suggested that this is in-part caused by differential error-structure between datasets, and their incomplete removal by pre-processing algorithms.<h4>Methods</h4>To test this hypothesis, we systematically assessed the effects of pre-processing on biomarker classification using 24 different pre-processing methods and 15 distinct signatures of tumour h  ...[more]

Similar Datasets

| S-EPMC8146304 | biostudies-literature
| S-EPMC5153655 | biostudies-literature
| S-EPMC5841351 | biostudies-literature
| S-EPMC8104997 | biostudies-literature
| S-EPMC8012204 | biostudies-literature
| S-EPMC5659840 | biostudies-literature
| S-EPMC10227023 | biostudies-literature
| S-EPMC4569581 | biostudies-literature
| S-EPMC4075047 | biostudies-literature