ABSTRACT: A data set of normal epithelium, serous ovarian surface epithelial-stromal tumors (benign and type II malignancies), stroma distal to tumor, and stroma adjacent to tumor (50 samples total). Additional cel files are included which represent replicate sampling from patients, and cel files that failed quality control but may be bioinformatically interesting. Additional replicate or failed cel files were not included in the final analysis (and so these samples were not included in the matrix). Background: Ovarian cancer is the most lethal gynecologic cancer in the United States. If caught in early stages, patient survival rate can reach 94%, when diagnosed at late stages survival rates drop to 28%. Correct diagnosis depends on the presence of definite symptoms: while ~90% of diagnosed ovarian cancers have symptoms, they tend to be unfocused and subacute. A definitive and early molecular signature of disease is thus desired. To further progress towards this goal, we present an Affymetrix™ human exon array data set measuring ovarian tumor expression, assembled using best practices. Method: Samples were collected from patients with benign and malignant (type II) serous ovarian surface epithelial-stromal tumors. Normal epithelium, tumor, stroma adjacent to tumor, and distal stroma were selected based on histopathology, and isolated using laser capture microdissection. Nugen products were used to perform random-primed mRNA amplification procedures (for full transcript capture) before hybridization to Affymetrix exon chips. Tumor expression and paracrine signaling was assessed using GC-RMA and a two-way Model I ANOVA. Single enrichment ontological analysis and gene network construction were performed to guide inferences about biological context. Results: In total, across 50 microarrays, ~270 million measurements were obtained. Based on comparisons to known ovarian cancer properties as established in molecular genetics literature, the initial analysis presented emphasizes data quality. Major trends between sample classes included: apical surface and tight junction activity, mitotic activity, benign tumor suppression, epithelial-mesenchymal transitioning, tumor oncogene activity, and paracrine signaling. A list of differentially expressed transcripts has been produced to enable rapid comparisons with published biomarker lists, but it is expected that detailed alternative transcript analysis will refine these predictions. Conclusions: A data set of 50 arrays, from carefully dissected serous ovarian surface epithelial-stromal tumors, has been produced, from which high quality measurements were obtained. While relatively small in number, this represents an important addition to the community pool of ovarian tumor samples, and the chosen platform enables bridging between 3' expression and exome sequencing data sets. This represents a significant contribution to the ovarian cancer genetics community.