Project description:Background: Breast cancer is a heterogeneous disease at the clinical and molecular level. In this study we integrate classifications extracted from five different molecular levels in order to identify integrated subtypes. Methods: Tumor tissue from 425 primary breast cancer patients of the Oslo2 study was cut and blended before being divided into fractions for DNA-, RNA- and protein isolation, and metabolomics, allowing for representative and comparable molecular data. Patients were stratified into groups based on their tumor characteristics from five different molecular levels, using various clustering methods. Finally, all previously identified and newly determined subgroups were combined in a multilevel classification using a “cluster-of-clusters” approach with consensus clustering. Results: Based on DNA copy number data, tumors were scored into three groups according to the complex arm aberration index. mRNA expression profiles divided tumors into five molecular subgroups according to PAM50 subtyping, and clustering based on microRNA expression revealed four subgroups. Reverse-phase protein array data divided tumors into five subgroups. Hierarchical clustering of tumor metabolic profiles revealed three clusters. Combining DNA copy number and mRNA expression classified tumors into seven clusters based on pathway activity levels, and using integrative clustering, tumors were classified into ten subtypes. The final consensus clustering incorporating all the aforementioned subtypes revealed six major groups. Five corresponded well with the mRNA subtypes, while a sixth group resulted from a split of the luminal A subtype where these tumors belonged to distinct microRNA clusters. Gain-of-function studies using MCF-7 cells showed that microRNAs differentially expressed between the luminal A clusters were important for cancer cell survival. These microRNAs were used to validate the split in luminal A tumors in four independent breast cancer cohorts. In two cohorts the microRNAs divided tumors into subgroups with significant different outcome, and in another a trend was observed. Conclusions: The herein identified six integrated subtypes confirm the heterogeneity of breast cancer and show that finer subdivisions of subtypes are evident. Increasing the knowledge of the luminal A subtype heterogeneity may add pivotal information guiding therapeutic choices, evidently bringing us closer to more personalized treatment for this largest subgroup of breast cancer.
Project description:Background: Breast cancer is a heterogeneous disease at the clinical and molecular level. In this study we integrate classifications extracted from five different molecular levels in order to identify integrated subtypes. Methods: Tumor tissue from 425 primary breast cancer patients of the Oslo2 study was cut and blended before being divided into fractions for DNA-, RNA- and protein isolation, and metabolomics, allowing for representative and comparable molecular data. Patients were stratified into groups based on their tumor characteristics from five different molecular levels, using various clustering methods. Finally, all previously identified and newly determined subgroups were combined in a multilevel classification using a “cluster-of-clusters” approach with consensus clustering. Results: Based on DNA copy number data, tumors were scored into three groups according to the complex arm aberration index. mRNA expression profiles divided tumors into five molecular subgroups according to PAM50 subtyping, and clustering based on microRNA expression revealed four subgroups. Reverse-phase protein array data divided tumors into five subgroups. Hierarchical clustering of tumor metabolic profiles revealed three clusters. Combining DNA copy number and mRNA expression classified tumors into seven clusters based on pathway activity levels, and using integrative clustering, tumors were classified into ten subtypes. The final consensus clustering incorporating all the aforementioned subtypes revealed six major groups. Five corresponded well with the mRNA subtypes, while a sixth group resulted from a split of the luminal A subtype where these tumors belonged to distinct microRNA clusters. Gain-of-function studies using MCF-7 cells showed that microRNAs differentially expressed between the luminal A clusters were important for cancer cell survival. These microRNAs were used to validate the split in luminal A tumors in four independent breast cancer cohorts. In two cohorts the microRNAs divided tumors into subgroups with significant different outcome, and in another a trend was observed. Conclusions: The herein identified six integrated subtypes confirm the heterogeneity of breast cancer and show that finer subdivisions of subtypes are evident. Increasing the knowledge of the luminal A subtype heterogeneity may add pivotal information guiding therapeutic choices, evidently bringing us closer to more personalized treatment for this largest subgroup of breast cancer.
Project description:Accurate characterization and understanding of the breast cancer subtypes is of crucial clinical importance to the heterogeneity of this disease. Several layers of information, including immunohistochemical markers, mRNA and microRNA expression profiles, and pathway analysis have been used for such purpose in several studies. However, a comprehensively integrative approach is currently missing. This paper provides microRNA and mRNA expression profiles, characterizing four breast tumor subtypes, as defined by four immunohistochemical markers. The defined sets of features were validated in two independent data sets at multiple levels, including unsupervised clustering and supervised classification. Moreover, the gene expression signatures of the tumor subtypes were screened by in-depth analysis of 12 cancer core pathways. We successfully identified and validated a novel breast cancer subtypes gene expression signature composed of 976 mRNAs and 69 miRNAs. Luminal and non-luminal tumors are shown to significantly differ both at the mRNA and miRNA levels. HER2 positive tumors are more closely related to triple negative tumors by mRNA profiling than by miRNA expression. Closely related miRNAs sharing the same targets may exert opposite roles during tumor progression. Besides the core cancer pathways, other pathways such as those controling biomass synthesis are shown to be important to enable the core basal subtype with additional progressive nature compared with the other triple negative tumors. Some therapeutic strategies are proposed for breast cancer treatment, including the combined blockage of MAPK/ERK and PI3K/Pten signalings for tumors with poor clinical outcome, and targeting Wnt and JAK/STAT and/or Hedgehog, depending on tumor subtypes, together with conventional chemotherapy with the purpose of achieving an eradicative outcome. The pathway analysis also reveals that the clinical strategy and pivotal targets need to be tuned according to different tumor subtypes. This study is the first attempt to elucidate breast cancer subtypes by combining microRNA and mRNA expression, immunohistochemical markers, and cancer core pathways. The results can avail the functional studies of the etiology of breast cancer and translated for clinical use given their intrinsic link in terms of immunohistochemistry and survival. This submission consists of microRNA profiles of 115 breast cancer tumors of several subtypes only.
Project description:Breast cancer is a profoundly heterogeneous disease with respect to biological and clinical behavior. Gene expression profiling has been used to dissect this complexity and stratify tumors into intrinsic gene expression subtypes associated with distinct biology, patient outcome and different genomic alterations. Additionally, breast tumors occurring in individuals with germline BRCA1 or BRCA2 mutations typically fall into distinct subtypes. We applied global DNA copy number and gene expression profiling in 359 breast tumors. All tumors were classified according to intrinsic gene expression subtypes and included cases from genetically predisposed women. The Genomic Identification of Significant Targets in Cancer (GISTIC) algorithm was used to identify significant DNA copy number aberrations and genomic subgroups of breast cancer. We identified 31 genomic regions that were highly amplified in >1% of the 359 breast tumors. Several amplicons were found to co-occur, the 8p12 and 11q13.3 regions being the most frequent combination besides amplicons on the same chromosomal arm. Unsupervised hierarchical clustering with 133 significant GISTIC regions (66 and 67 with DNA copy number gain and loss, respectively) revealed six genomic subtypes, termed: 17q12, basal-complex, luminal-simple, luminal-complex, amplifier and mixed subtype. Four of them had striking similarity to intrinsic gene expression subtypes and showed association to conventional tumor biomarkers and clinical outcome. However, luminal A-classified tumors were distributed in two main genomic subtypes, luminal-simple and luminal-complex, the former group having better prognosis while the latter group included also luminal B and the majority of BRCA2-mutated tumors. The basal-complex subtype displayed extensive genomic homogeneity and harbored the majority of BRCA1-mutated tumors. The 17q12 subtype comprised mostly HER2-amplified and HER2-enriched subtype tumors and had the worst prognosis. The amplifier and mixed subtypes contained tumors from all gene expression subtypes, the former being enriched for 8p12-amplified cases while the mixed subtype included many tumors with predominantly DNA copy number losses and poor prognosis. Genomic profiling of 359 breast tumors using tiling BAC aCGH. A number of cases were hybridized as replicates or replicate as dye-swaps. Gene expression profiling of 359 breast tumors using 55K oligonucleotide microarrays.
Project description:A microarray targeting promoters of cancer-related genes was used to evaluate DNA methylation at 935 CpG sites in 517 invasive breast tumors from the Carolina Breast Cancer Study (CBCS), a population-based study of invasive breast cancer. Concensus clustering using methylation (β) values for the 167 most variant CpG loci defined 4 clusters differing most distinctly in hormone receptor (HR) status, intrinsic subtype (luminal versus basal-like) and p53 mutation status. Supervised analyses for HR status, subtype, and p53 status identified differentially methylated CpG loci with considerable overlap (n=266). Concensus clustering also defined a hypermethylated luminal-enriched tumor cluster 3; gene ontology analysis of cluster 3 hypermethylated loci revealed enrichment for developmental genes, including homeobox domain genes (HOXB13, PAX6, IPF1, EYA4, DLK1, IHH, ISL1, TBX1, SOX1, SOX17). The hypermethylated luminal-enriched cluster 3 independently predicted poorer survival in multivariate Cox proportional hazard analysis, and this finding was confirmed in analysis of luminal A tumors. This study demonstrates epigenetic heterogeneity among breast tumors of a single intrinsic subtype, and shows that epigenetic patterns are strongly associated with HR status, subtype, and p53 mutation status. Among HR+ tumors, a gene signature characterized by hypermethylation of developmental genes may have prognostic value. Genes differentially methylated between clinically-important tumor subsets have roles in differentiation, development, and tumor growth and may be critical to inducing and maintaining tumor phenotypes and clinical outcomes. 517 breast tumors, 9 normal breast tissues
Project description:Breast cancer is a profoundly heterogeneous disease with respect to biological and clinical behavior. Gene expression profiling has been used to dissect this complexity and stratify tumors into intrinsic gene expression subtypes associated with distinct biology, patient outcome and different genomic alterations. Additionally, breast tumors occurring in individuals with germline BRCA1 or BRCA2 mutations typically fall into distinct subtypes. We applied global DNA copy number and gene expression profiling in 359 breast tumors. All tumors were classified according to intrinsic gene expression subtypes and included cases from genetically predisposed women. The Genomic Identification of Significant Targets in Cancer (GISTIC) algorithm was used to identify significant DNA copy number aberrations and genomic subgroups of breast cancer. We identified 31 genomic regions that were highly amplified in >1% of the 359 breast tumors. Several amplicons were found to co-occur, the 8p12 and 11q13.3 regions being the most frequent combination besides amplicons on the same chromosomal arm. Unsupervised hierarchical clustering with 133 significant GISTIC regions (66 and 67 with DNA copy number gain and loss, respectively) revealed six genomic subtypes, termed: 17q12, basal-complex, luminal-simple, luminal-complex, amplifier and mixed subtype. Four of them had striking similarity to intrinsic gene expression subtypes and showed association to conventional tumor biomarkers and clinical outcome. However, luminal A-classified tumors were distributed in two main genomic subtypes, luminal-simple and luminal-complex, the former group having better prognosis while the latter group included also luminal B and the majority of BRCA2-mutated tumors. The basal-complex subtype displayed extensive genomic homogeneity and harbored the majority of BRCA1-mutated tumors. The 17q12 subtype comprised mostly HER2-amplified and HER2-enriched subtype tumors and had the worst prognosis. The amplifier and mixed subtypes contained tumors from all gene expression subtypes, the former being enriched for 8p12-amplified cases while the mixed subtype included many tumors with predominantly DNA copy number losses and poor prognosis.
Project description:Summary: Breast cancer cell lines have been used widely to investigate breast cancer pathobiology and new therapies. Breast cancer is a molecularly heterogeneous disease, and it is important to understand how well and which cell lines best model that diversity. In particular, microarray studies have identified molecular subtypes (luminal A, luminal B, ERBB2-associated, basal-like and normal-like) with characteristic gene-expression patterns and underlying DNA copy number alterations (CNAs). Here, we studied a collection of breast cancer cell lines to catalog molecular profiles and to assess their relation to breast cancer subtypes. Whole-genome DNA microarrays were used to profile gene expression and CNAs in a collection of 52 widely-used breast cancer cell lines, and comparisons were made to existing profiles of primary breast tumors. Hierarchical clustering was used to identify gene-expression subtypes, and Gene Set Enrichment Analysis (GSEA) to discover biological features of those subtypes. Genomic and transcriptional profiles were integrated to discover within high-amplitude CNAs candidate cancer genes with coordinately altered gene copy number and expression. Transcriptional profiling of breast cancer cell lines identified one luminal and two basal-like (A and B) subtypes. Luminal lines displayed an estrogen receptor (ER) signature and resembled luminal-A/B tumors, basal-A lines were associated with ETS-pathway and BRCA1 signatures and resembled basal-like tumors, and basal-B lines displayed mesenchymal and stem-cell characteristics. Compared to tumors, cell lines exhibited similar patterns of CNA, but an overall higher complexity of CNA (genetically simple luminal-A tumors were not represented), and only partial conservation of subtype-specific CNAs. We identified 80 high-level DNA amplifications and 13 presumptive homozygous deletions, and the resident genes with concomitantly altered gene-expression, highlighting known and novel candidate breast cancer genes. Overall, breast cancer cell lines were genetically more complex than tumors, but retained expression patterns with relevance to the luminal-basal subtype distinction. The compendium of molecular profiles defines cell lines suitable for investigations of subtype-specific pathobiology, biomarkers and therapies, and provides a resource for discovery of new breast cancer genes. HEEBO oligonucleotide microarrays from the Stanford Functional Genomics Facility were used to perform gene expression profiling of 50 human breast epithelial cell lines, in comparison to a universal RNA reference. Expression data were analyzed by hierarchical clustering to identify subgroups, and gene set enrichment analysis to identify subgroup-specific gene pathways.