Project description:Gene expression profiles were generated from 199 primary breast cancer patients. Samples 1-176 were used in another study, GEO Series GSE22820, and form the training data set in this study. Sample numbers 200-222 form a validation set. This data is used to model a machine learning classifier for Estrogen Receptor Status. RNA was isolated from 199 primary breast cancer patients. A machine learning classifier was built to predict ER status using only three gene features.
Project description:Large-scale serum miRNomics in combination with machine learning could lead to the development of a blood-based cancer classification system.
Project description:CD34+ Haematopoietic stem cells were differentiated ex vivo to generate ChIP-seq data for machine learning of rules underlying open chromatin dynamics.
Project description:The RNA polymerase II core promoter is the site of convergence of the signals that lead to the initiation of transcription. Here, we perform a comparative analysis of the downstream core promoter region (DPR) in Drosophila and humans by using machine learning. These studies revealed a distinct human-specific version of the DPR and led to the use of the machine learning models for the identification of synthetic extreme DPR motifs with specificity for human transcription factors relative to Drosophila factors, and vice versa. More generally, machine learning models could be analogously used to design synthetic promoter elements with customized functional properties.
Project description:CD34+ Haematopoietic stem cells were differentiated under two ex vivo protocols to generate ATAC-seq data for machine learning of rules underlying open chromatin dynamics.