Dataset Information

Identification of a minimum number of genes to predict triple negative breast cancer subgroups from gene expression profiles

ABSTRACT: Background: Triple-negative breast cancer (TNBC) is a very heterogeneous disease. Several gene expression and mutation profiling approaches were used to classify it and all converged to the identification of distinct molecular subtypes, with some overlapping across different approaches. However, a standardised tool to routinely classify TNBC in the clinics and guide personalised treatment is lacking. We aimed at defining a specific gene signature for each of the six TNBC subtypes proposed by Lehman et al. in 2011 (Basal-like 1 (BL1); Basal-like 2 (BL2); Mesenchymal (M); Immunomodulatory (IM); Mesenchymal Stem like (MSL) and Luminal androgen receptor (LAR)), to be able to accurately predict them. Methods: Lehman’s TNBCtype subtyping tool was applied to RNA-sequencing data from 482 TNBC (GSE164458) and a minimal subtype-specific gene signature was defined by combining two class comparison techniques with seven attribute selection methods. Several machine learning algorithms for subtype prediction were used and the best classifier was applied on microarray data from 72 Italian TNBC and on the TNBC subset of the BRCA-TCGA dataset. Results: we defined two signatures with the 120 and 81 top up- and down-regulated genes that define the six TNBC subtypes, with prediction accuracy ranging from 88,6% to 89,4%, and even improving after removal of the least important genes. Network analysis was used to identify highly interconnected genes within each subgroup. Two druggable matrix metalloproteinases were found in the BL1 and BL2 subsets, and several druggable targets complementary to androgen receptor or aromatase in the LAR subset. Several secondary drug-target interactions were found among the up-regulated genes in the M, IM and MSL subsets. Conclusions: Our study took full advantage of available TNBC datasets to stratify samples and genes into distinct subtypes, according to gene expression profiles. The development of a data mining approach to acquire a large amount of information from several datasets, has allowed us to identify a well-determined minimal number of genes that may help in the recognition of TNBC subtypes. These genes, most of which have been previously found to be associated with breast cancer, have the potential to become novel diagnostic markers and/or therapeutic targets for specific TNBC subsets.

ORGANISM(S): Homo sapiens

PROVIDER: GSE206912 | GEO | 2022/12/24

REPOSITORIES: GEO

ACCESS DATA

Dataset's files

Source:

			Action	DRS
		Other

Items per page:

1 - 1 of 1

Dataset Information

Identification of a minimum number of genes to predict triple negative breast cancer subgroups from gene expression profiles

Dataset's files

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Identifying High-Risk Triple-Negative Breast Cancer Patients by Molecular Subtyping
2021-07-31 | GSE167213 | GEO

Affymetrix SNP array data (Oncoscan CNV) for Fudan University Shanghai Cancer Center Triple Negative Breast Cancer (FUSCCTNBC) project
2019-03-09 | GSE118527 | GEO

Comprehensive genomic analysis identify novel subtypes and targets of triple-negative breast cancer (198 TNBC tumors)
2015-12-18 | E-GEOD-76124 | biostudies-arrayexpress

Identification of a Therapeutically Targetable JAK-STAT Enriched Androgen Receptor (AR) and AR Splice Variant Positive Triple Negative Breast Cancer Subtype [MDA-MB-453]
2023-10-31 | GSE245554 | GEO

Identification of a Therapeutically Targetable JAK-STAT Enriched Androgen Receptor (AR) and AR Splice Variant Positive Triple Negative Breast Cancer Subtype [project3]
2023-10-04 | GSE244272 | GEO

Identification of a Therapeutically Targetable JAK-STAT Enriched Androgen Receptor (AR) and AR Splice Variant Positive Triple Negative Breast Cancer Subtype [project4]
2023-10-04 | GSE244282 | GEO

Identification of a Therapeutically Targetable JAK-STAT Enriched Androgen Receptor (AR) and AR Splice Variant Positive Triple Negative Breast Cancer Subtype [Spacial]
2023-10-31 | GSE245202 | GEO

Identification of a Therapeutically Targetable JAK-STAT Enriched Androgen Receptor (AR) and AR Splice Variant Positive Triple Negative Breast Cancer Subtype [project2]
2023-10-04 | GSE244271 | GEO

Comprehensive genomic analysis identify novel subtypes and targets of triple-negative breast cancer (198 TNBC tumors)
2015-12-18 | GSE76124 | GEO

Comprehensive genomic analysis identify novel subtypes and targets of triple-negative breast cancer (67 not triple-negative tumors)
2015-12-23 | GSE76274 | GEO