Unknown

Dataset Information

0

A machine learned classifier that uses gene expression data to accurately predict estrogen receptor status.


ABSTRACT: BACKGROUND: Selecting the appropriate treatment for breast cancer requires accurately determining the estrogen receptor (ER) status of the tumor. However, the standard for determining this status, immunohistochemical analysis of formalin-fixed paraffin embedded samples, suffers from numerous technical and reproducibility issues. Assessment of ER-status based on RNA expression can provide more objective, quantitative and reproducible test results. METHODS: To learn a parsimonious RNA-based classifier of hormone receptor status, we applied a machine learning tool to a training dataset of gene expression microarray data obtained from 176 frozen breast tumors, whose ER-status was determined by applying ASCO-CAP guidelines to standardized immunohistochemical testing of formalin fixed tumor. RESULTS: This produced a three-gene classifier that can predict the ER-status of a novel tumor, with a cross-validation accuracy of 93.17±2.44%. When applied to an independent validation set and to four other public databases, some on different platforms, this classifier obtained over 90% accuracy in each. In addition, we found that this prediction rule separated the patients' recurrence-free survival curves with a hazard ratio lower than the one based on the IHC analysis of ER-status. CONCLUSIONS: Our efficient and parsimonious classifier lends itself to high throughput, highly accurate and low-cost RNA-based assessments of ER-status, suitable for routine high-throughput clinical use. This analytic method provides a proof-of-principle that may be applicable to developing effective RNA-based tests for other biomarkers and conditions.

SUBMITTER: Bastani M 

PROVIDER: S-EPMC3846850 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

A machine learned classifier that uses gene expression data to accurately predict estrogen receptor status.

Bastani Meysam M   Vos Larissa L   Asgarian Nasimeh N   Deschenes Jean J   Graham Kathryn K   Mackey John J   Greiner Russell R  

PloS one 20131202 12


<h4>Background</h4>Selecting the appropriate treatment for breast cancer requires accurately determining the estrogen receptor (ER) status of the tumor. However, the standard for determining this status, immunohistochemical analysis of formalin-fixed paraffin embedded samples, suffers from numerous technical and reproducibility issues. Assessment of ER-status based on RNA expression can provide more objective, quantitative and reproducible test results.<h4>Methods</h4>To learn a parsimonious RNA  ...[more]

Similar Datasets

| S-EPMC5759031 | biostudies-literature
2013-01-01 | E-GEOD-29210 | biostudies-arrayexpress
2013-01-01 | GSE29210 | GEO
| S-ECPF-GEOD-29210 | biostudies-other
| S-EPMC3716652 | biostudies-literature
| S-EPMC7327696 | biostudies-literature
2009-05-21 | GSE15885 | GEO
2010-05-18 | E-GEOD-15885 | biostudies-arrayexpress
| S-ECPF-GEOD-15885 | biostudies-other
| S-EPMC7855939 | biostudies-literature