Project description:Higher order chromatin structure is important for regulation of genes by distal regulatory sequences. Structural variants that alter 3D genome organization can lead to enhancer-promoter rewiring and human disease, particularly in the context of cancer. However, only a small minority of structural variants are associated with altered gene expression and it remains unclear why certain structural variants lead to changes in distal gene expression and others do not. To address these questions, we used a combination of genomic profiling and genome engineering to identify sites of recurrent changes in 3D genome structure in cancer and determine the effects of specific rearrangements on oncogene activation. By analyzing Hi-C data from 92 cancer cell lines and patient samples, we identified loci affected by recurrent alterations to 3D genome structure, including oncogenes such as MYC, TERT, and CCND1. Using CRISPR/Cas9 genome engineering to generate de novo structural variants, we show that oncogene activity can be predicted using “Activity-by-Contact” models that consider partner region chromatin contacts and enhancer activity. However, Activity-by-Contact models are only predictive of specific subsets of genes in the genome, suggesting that different classes of genes engage in distinct modes of regulation by distal regulatory elements. These results indicate that structural variants that alter 3D genome organization are widespread in cancer genomes and begin to illustrate predictive rules for the consequences of structural variants on oncogene activation.
Project description:Genetic variation amongst individual humans occurs on many different scales, ranging from gross alterations in the human karyotype to single-nucleotide changes. In this manuscript we explore variation on an intermediate scale-particularly insertions, deletions, and inversions affecting from a few thousand to a few million base pairs. We employed a clone-based method to interrogate this intermediate structural variation in eight individuals of diverse geographic ancestry. Our analysis provides a comprehensive overview of the normal pattern of structural variation present in these genomes, refining the location of 1695 structural variants. We find that 50% were seen in more than one individual and that nearly half lay outside regions of the genome previously described as structurally variant. We discover 525 new insertion sequences that are not present in the human reference genome and show that many of these are variable in copy number among individuals. Sequencing of a subset of structural variants reveals considerable locus complexity and provides insights into the different mutational processes that have shaped the human genome. These data provide the first high-resolution sequence-map of human structural variation-an important standard for genotyping platforms and a prelude to future individual genome sequencing projects. Keywords: comparitive genomic hybridization, copy number variation, structural variation, fosmid end sequencing CGH analysis targeted against sites identified by fosmid end sequencing. 8 HapMap samples (sources of libraries ABC7-ABC14) are hybed against NA15510 (source of fosmid library G248).
Project description:Genetic variation amongst individual humans occurs on many different scales, ranging from gross alterations in the human karyotype to single-nucleotide changes. In this manuscript we explore variation on an intermediate scale-particularly insertions, deletions, and inversions affecting from a few thousand to a few million base pairs. We employed a clone-based method to interrogate this intermediate structural variation in eight individuals of diverse geographic ancestry. Our analysis provides a comprehensive overview of the normal pattern of structural variation present in these genomes, refining the location of 1695 structural variants. We find that 50% were seen in more than one individual and that nearly half lay outside regions of the genome previously described as structurally variant. We discover 525 new insertion sequences that are not present in the human reference genome and show that many of these are variable in copy number among individuals. Sequencing of a subset of structural variants reveals considerable locus complexity and provides insights into the different mutational processes that have shaped the human genome. These data provide the first high-resolution sequence-map of human structural variation-an important standard for genotyping platforms and a prelude to future individual genome sequencing projects. Keywords: comparitive genomic hybridization, copy number variation, structural variation, fosmid end sequencing
Project description:Genetic variation amongst individual humans occurs on many different scales, ranging from gross alterations in the human karyotype to single-nucleotide changes. In this manuscript we explore variation on an intermediate scale-particularly insertions, deletions, and inversions affecting from a few thousand to a few million base pairs. We employed a clone-based method to interrogate this intermediate structural variation in eight individuals of diverse geographic ancestry. Our analysis provides a comprehensive overview of the normal pattern of structural variation present in these genomes, refining the location of 1695 structural variants. We find that 50% were seen in more than one individual and that nearly half lay outside regions of the genome previously described as structurally variant. We discover 525 new insertion sequences that are not present in the human reference genome and show that many of these are variable in copy number among individuals. Sequencing of a subset of structural variants reveals considerable locus complexity and provides insights into the different mutational processes that have shaped the human genome. These data provide the first high-resolution sequence-map of human structural variation-an important standard for genotyping platforms and a prelude to future individual genome sequencing projects. Keywords: comparative genomic hybridization
Project description:Genetic variation amongst individual humans occurs on many different scales, ranging from gross alterations in the human karyotype to single-nucleotide changes. In this manuscript we explore variation on an intermediate scale-particularly insertions, deletions, and inversions affecting from a few thousand to a few million base pairs. We employed a clone-based method to interrogate this intermediate structural variation in eight individuals of diverse geographic ancestry. Our analysis provides a comprehensive overview of the normal pattern of structural variation present in these genomes, refining the location of 1695 structural variants. We find that 50% were seen in more than one individual and that nearly half lay outside regions of the genome previously described as structurally variant. We discover 525 new insertion sequences that are not present in the human reference genome and show that many of these are variable in copy number among individuals. Sequencing of a subset of structural variants reveals considerable locus complexity and provides insights into the different mutational processes that have shaped the human genome. These data provide the first high-resolution sequence-map of human structural variation-an important standard for genotyping platforms and a prelude to future individual genome sequencing projects. Keywords: comparative genomic hybridization The DNA samples are a panel of 8 Hapmap samples, described by E. Eichler et al. (2007, Nature 447, 161-165). This set of 7 female, and one male samples are from from the Coriell Cell Repository, and is comprised of samples from four populations: four Yoruban, two CEPH, one Chinese, and one Japanese. The reference sample, NA15510, is female and also from the Corriel Cell Repository. This sample has been extensively characterized, (for example in Tuzan et al. 2005, Nature Genetics 10, p1038) and has been recommended for use in CNV detection programs to allow meaningful comparison of data between studies (discussed in Scherer, et al. 2007, Nature Genetics Supplement 39: S7-S15). Each of these samples was hybridized in pairs with the reversed labeling polarities. Additionally, 3 self-self control hybridizations were carried out for the reference sample, NA15510, one on each hybridization date.
Project description:SCN1A, encoding the sodium channel protein type 1 alpha subunit, is the most implicated gene in epilepsy. Pathogenic loss-of-function variants that result in SCN1A haploinsufficiency cause the most common DEE, known as Dravet syndrome (DS). Pathogenic gain-of-function variants have been found to cause a more severe, early-onset epilepsy syndrome that is distinct from DS. Here, we investigated DNA methylation patterns in these individuals with SCN1A variants.