Unknown

Dataset Information

0

An evaluation protocol for subtype-specific breast cancer event prediction.


ABSTRACT: In recent years increasing evidence appeared that breast cancer may not constitute a single disease at the molecular level, but comprises a heterogeneous set of subtypes. This suggests that instead of building a single monolithic predictor, better predictors might be constructed that solely target samples of a designated subtype, which are believed to represent more homogeneous sets of samples. An unavoidable drawback of developing subtype-specific predictors, however, is that a stratification by subtype drastically reduces the number of samples available for their construction. As numerous studies have indicated sample size to be an important factor in predictor construction, it is therefore questionable whether the potential benefit of subtyping can outweigh the drawback of a severe loss in sample size. Factors like unequal class distributions and differences in the number of samples per subtype, further complicate comparisons. We present a novel experimental protocol that facilitates a comprehensive comparison between subtype-specific predictors and predictors that do not take subtype information into account. Emphasis lies on careful control of sample size as well as class and subtype distributions. The methodology is applied to a large breast cancer compendium involving over 1500 arrays, using a state-of-the-art subtyping scheme. We show that the resulting subtype-specific predictors outperform those that do not take subtype information into account, especially when taking sample size considerations into account.

SUBMITTER: Sontrop HM 

PROVIDER: S-EPMC3132736 | biostudies-literature | 2011

REPOSITORIES: biostudies-literature

altmetric image

Publications

An evaluation protocol for subtype-specific breast cancer event prediction.

Sontrop Herman M J HM   Verhaegh Wim F J WF   Reinders Marcel J T MJ   Moerland Perry D PD  

PloS one 20110708 7


In recent years increasing evidence appeared that breast cancer may not constitute a single disease at the molecular level, but comprises a heterogeneous set of subtypes. This suggests that instead of building a single monolithic predictor, better predictors might be constructed that solely target samples of a designated subtype, which are believed to represent more homogeneous sets of samples. An unavoidable drawback of developing subtype-specific predictors, however, is that a stratification b  ...[more]

Similar Datasets

| S-EPMC8497829 | biostudies-literature
| S-EPMC7151639 | biostudies-literature
| S-ECPF-GEOD-37614 | biostudies-other
| S-EPMC3286973 | biostudies-literature
| S-EPMC8599309 | biostudies-literature
| S-EPMC6026876 | biostudies-literature
| S-EPMC3505468 | biostudies-literature
| S-EPMC6167771 | biostudies-literature
| S-EPMC3206184 | biostudies-literature
| S-EPMC7297326 | biostudies-literature