Project description:The NIH Roadmap Epigenomics Mapping Consortium aims to produce a public resource of epigenomic maps for stem cells and primary ex vivo tissues selected to represent the normal counterparts of tissues and organ systems frequently involved in human disease. Study of chromatin accessibility and expression using exon arrays. **************** For data usage terms and conditions, please refer to: http://www.drugabuse.gov/funding/funding-opportunities/nih-common-fund/epigenomics-data-access-policies ****************
Project description:This experiment contains a subset of data from the BLUEPRINT Epigenome project ( http://www.blueprint-epigenome.eu ), which aims at producing a reference haemopoetic epigenomes for the research community. 4 samples of primary cells from tonsil with cell surface markes CD20med/CD38high in young individuals (3 to 10 years old) are included in this experiment. This ArrayExpress record contains only meta-data. Raw data files have been archived at the European Genome-Phenome Archive (EGA, www.ebi.ac.uk/ega) by the consortium, with restricted access to protect sample donors' identity. The relevant accessions of EGA data sets is EGAD00001001523. Details on how to apply for data access via the BLUEPRINT data access committee are on the EGA data set pages. The mapping of samples to these EGA accessions can be found in the 'Sample Data Relationship Format' file of this ArrayExpress record. Information on individual samples and sequencing libraries can also be found on the BLUEPRINT data coordination centre (DCC) website: http://dcc.blueprint-epigenome.eu
Project description:Development of free/libre open source software is usually done by a community of people with an interest in the tool. For scientific software, however, this is less often the case. Most scientific software is written by only a few authors, often a student working on a thesis. Once the paper describing the tool has been published, the tool is no longer developed further and is left to its own device. Here we describe the broad, multidisciplinary community we formed around a set of tools for statistical genomics. The GenABEL project for statistical omics actively promotes open interdisciplinary development of statistical methodology and its implementation in efficient and user-friendly software under an open source licence. The software tools developed withing the project collectively make up the GenABEL suite, which currently consists of eleven tools. The open framework of the project actively encourages involvement of the community in all stages, from formulation of methodological ideas to application of software to specific data sets. A web forum is used to channel user questions and discussions, further promoting the use of the GenABEL suite. Developer discussions take place on a dedicated mailing list, and development is further supported by robust development practices including use of public version control, code review and continuous integration. Use of this open science model attracts contributions from users and developers outside the "core team", facilitating agile statistical omics methodology development and fast dissemination.
Project description:This experiment contains a subset of data from the BLUEPRINT Epigenome project ( http://www.blueprint-epigenome.eu ), which aims at producing a reference haemopoetic epigenomes for the research community. 29 samples of primary cells or cultured primary cells of different haemopoeitc lineages from cord blood are included in this experiment. This ArrayExpress record contains only meta-data. Raw data files have been archived at the European Genome-Phenome Archive (EGA, www.ebi.ac.uk/ega) by the consortium, with restricted access to protect sample donors' identity. The relevant accessions of EGA data sets is EGAD00001001165. Details on how to apply for data access via the BLUEPRINT data access committee are on the EGA data set pages. The mapping of samples to these EGA accessions can be found in the 'Sample Data Relationship Format' file of this ArrayExpress record. Information on individual samples and sequencing libraries can also be found on the BLUEPRINT data coordination centre (DCC) website: http://dcc.blueprint-epigenome.eu
Project description:We comprehensively analyzed clinical, genomic and transcriptomic data of a cohort of 465 primary triple-negative breast cancer (TNBC). PIK3CA mutations and copy number gains of chromosome 22q11 were more frequent in our Chinese cohort than in The Cancer Genome Atlas. We classified TNBCs into four transcriptome-based subtypes: 1) luminal androgen receptor (LAR), 2) immunomodulatory (IM), 3) basal-like immune-suppressed (BLIS), and 4) mesenchymal (MES). Putative therapeutic targets or biomarkers were identified among each subtype. Importantly, the LAR subtype showed more ERBB2 somatic mutations, infrequent mutational signature 3 and frequent CDKN2A loss. The comprehensive profile of TNBCs provided here will serve as a reference to further advance the understanding and precision treatment of TNBC.
Project description:This experiment contains a subset of data from the BLUEPRINT Epigenome project ( http://www.blueprint-epigenome.eu ), which aims at producing a reference haemopoetic epigenomes for the research community. 74 samples of primary cells or cultured primary cells of different haemopoeitc lineages from cord blood, venous blood, bone marrow and thymus are included in this experiment. This ArrayExpress record contains only meta-data. Raw data files have been archived at the European Genome-Phenome Archive (EGA, www.ebi.ac.uk/ega) by the consortium, with restricted access to protect sample donors' identity. There are 32 EGA data set accessions, which can be found under the Comment[EGA_DATA_SET] column in the 'Sample Data Relationship Format' (SDRF) file of this ArrayExpress record (http://www.ebi.ac.uk/arrayexpress/files/E-MTAB-3827/E-MTAB-3827.sdrf.txt). Details on how to apply for data access via the BLUEPRINT data access committee are on the EGA data set pages. Likewise, mapping of samples to these EGA accessions can be found in the SDRF file. Please note that the raw data files for 11 sequencing runs have yet been deposited at EGA, so they are marked with \\ot available\\ under the Comment[SUBMITTED_FILE_NAME] field in the SDRF file, and were included for the sake of completeness. Further iInformation on individual samples and sequencing libraries can also be found on the BLUEPRINT data coordination centre (DCC) website: http://dcc.blueprint-epigenome.eu\