Project description:Copy-number variants (CNVs) are large-scale amplifications or deletions of DNA that can drive rapid adaptive evolution and result in large-scale changes in gene expression. Whereas alterations in the copy number of one or more genes within a CNV can confer a selective advantage, other genes within a CNV can decrease fitness when their dosage is changed. Dosage compensation - in which the gene expression output from multiple gene copies is less than expected - is one means by which an organism can mitigate the fitness costs of deleterious gene amplification. Previous research has shown evidence for dosage compensation at both the transcriptional level and at the level of protein expression; however, the extent of compensation differs substantially between genes, strains, and studies. Here, we investigated sources of dosage compensation at multiple levels of gene expression regulation by defining the transcriptome, translatome and proteome of experimentally evolved yeast (Saccharomyces cerevisiae) strains containing adaptive CNVs.
Project description:The human UGT2B17 gene varies in copy number from zero to two per individual and also differs in mean number between populations from Africa, Europe, and East Asia. We show that such a high degree of geographical variation is unusual and investigate its evolutionary history. This required first reinterpreting the reference sequence in this region of the genome, which is misassembled from the two different alleles separated by an artifactual gap. A corrected assembly identifies the polymorphism as a 117 kb deletion arising by nonallelic homologous recombination between approximately 4.9 kb segmental duplications and allows the deletion breakpoint to be identified. We resequenced approximately 12 kb of DNA spanning the breakpoint in 91 humans from three HapMap and one extended HapMap populations and one chimpanzee. Diversity was unusually high and the time to the most recent common ancestor was estimated at approximately 2.4 or approximately 3.0 million years by two different methods, with evidence of balancing selection in Europe. In contrast, diversity was low in East Asia where a single haplotype predominated, suggesting positive selection for the deletion in this part of the world.
Project description:Background: Detecting genetic variation is a critical step in elucidating the molecular mechanisms underlying phenotypic diversity. Until recently, such detection has mostly focused on single nucleotide polymorphisms (SNPs) because of the ease in screening complete genomes. Another type of variant, copy number variation (CNV), is emerging as a significant contributor to phenotypic variation in many species. Here we describe a genome-wide CNV study using array comparative genomic hybridization (aCGH) in a wide variety of chicken breeds. Results: We identified 3,154 CNVs, grouped into 1,556 CNV regions (CNVRs). Thirty percent of the CNVs were detected in at least 2 individuals. The average size of the CNVs detected was 46.3 kb with the largest CNV, located on GGAZ, being 4.3 Mb. Approximately 75% of the CNVs are copy number losses relatively to the Red Jungle Fowl reference genome. The genome coverage of CNVRs in this study is 60 Mb, which represents almost 5.4% of the chicken genome. In particular large gene families such as the keratin gene family and the MHC show extensive CNV. Conclusions: A relative large group of the CNVs are line-specific, several of which were previously shown to be related to the causative mutation for a number of phenotypic variants. The chance that inter-specific CNVs fall into CNVRs detected in chicken is related to the evolutionary distance between the species. Our results provide a valuable resource for the study of genetic and phenotypic variation in this phenotypically diverse species.
Project description:Background: Detecting genetic variation is a critical step in elucidating the molecular mechanisms underlying phenotypic diversity. Until recently, such detection has mostly focused on single nucleotide polymorphisms (SNPs) because of the ease in screening complete genomes. Another type of variant, copy number variation (CNV), is emerging as a significant contributor to phenotypic variation in many species. Here we describe a genome-wide CNV study using array comparative genomic hybridization (aCGH) in a wide variety of chicken breeds. Results: We identified 3,154 CNVs, grouped into 1,556 CNV regions (CNVRs). Thirty percent of the CNVs were detected in at least 2 individuals. The average size of the CNVs detected was 46.3 kb with the largest CNV, located on GGAZ, being 4.3 Mb. Approximately 75% of the CNVs are copy number losses relatively to the Red Jungle Fowl reference genome. The genome coverage of CNVRs in this study is 60 Mb, which represents almost 5.4% of the chicken genome. In particular large gene families such as the keratin gene family and the MHC show extensive CNV. Conclusions: A relative large group of the CNVs are line-specific, several of which were previously shown to be related to the causative mutation for a number of phenotypic variants. The chance that inter-specific CNVs fall into CNVRs detected in chicken is related to the evolutionary distance between the species. Our results provide a valuable resource for the study of genetic and phenotypic variation in this phenotypically diverse species. In total 62 chicken DNA samples (derived from 15 lines) were analyzed against the chicken reference animal UCD001 (the same induvidual that was used to generate the chicken genome reference sequence (ICGSC, 2004)
Project description:Intra-specific polymorphism in copy number is documented in many organisms, including human and chimpanzee, but very little is known for other great apes. This study aims to provide CNVs data for orangutan, gorilla, bonobo and chimpanzee, and compare the CNV patterns among these species, as well as with human CNVs and segmental duplications from public databases.
Project description:Copy-number variants (CNVs) are large-scale amplifications or deletions of DNA that can drive rapid adaptive evolution and result in large-scale changes in gene expression. Whereas alterations in the copy number of one or more genes within a CNV can confer a selective advantage, other genes within a CNV can decrease fitness when their dosage is changed. Dosage compensation - in which the gene expression output from multiple gene copies is less than expected - is one means by which an organism can mitigate the fitness costs of deleterious gene amplification. Previous research has shown evidence for dosage compensation at both the transcriptional level and at the level of protein expression; however, the extent of compensation differs substantially between genes, strains, and studies. Here, we investigated sources of dosage compensation at multiple levels of gene expression regulation by defining the transcriptome, translatome and proteome of experimentally evolved yeast (Saccharomyces cerevisiae) strains containing adaptive CNVs. We quantified the gene expression output at each step and found evidence of widespread dosage compensation at the protein abundance (~47%) level. By contrast we find only limited evidence for dosage compensation at the transcriptional (~8%) and translational (~3%) level. We also find substantial divergence in the expression of unamplified genes in evolved strains that could be due to either the presence of a CNV or adaptation to the environment. Detailed analysis of 82 amplified and 411 unamplified genes with significantly discrepant relationships between RNA and protein abundances identified enrichment for upstream open reading frames (uORFs). These uORFs are enriched for binding site motifs for SSD1, an RNA binding protein that has previously been associated with tolerance of aneuploidy. Our findings suggest that, in the presence of CNVs, SSD1 may act to alter the expression of specific genes by potentiating uORF mediated translational regulation.
Project description:Copy-number variants (CNVs) are large-scale amplifications or deletions of DNA that can drive rapid adaptive evolution and result in large-scale changes in gene expression. Whereas alterations in the copy number of one or more genes within a CNV can confer a selective advantage, other genes within a CNV can decrease fitness when their dosage is changed. Dosage compensation - in which the gene expression output from multiple gene copies is less than expected - is one means by which an organism can mitigate the fitness costs of deleterious gene amplification. Previous research has shown evidence for dosage compensation at both the transcriptional level and at the level of protein expression; however, the extent of compensation differs substantially between genes, strains, and studies. Here, we investigated sources of dosage compensation at multiple levels of gene expression regulation by defining the transcriptome, translatome and proteome of experimentally evolved yeast (Saccharomyces cerevisiae) strains containing adaptive CNVs. We quantified the gene expression output at each step and found evidence of widespread dosage compensation at the protein abundance (~47%) level. By contrast we find only limited evidence for dosage compensation at the transcriptional (~8%) and translational (~3%) level. We also find substantial divergence in the expression of unamplified genes in evolved strains that could be due to either the presence of a CNV or adaptation to the environment. Detailed analysis of 82 amplified and 411 unamplified genes with significantly discrepant relationships between RNA and protein abundances identified enrichment for upstream open reading frames (uORFs). These uORFs are enriched for binding site motifs for SSD1, an RNA binding protein that has previously been associated with tolerance of aneuploidy. Our findings suggest that, in the presence of CNVs, SSD1 may act to alter the expression of specific genes by potentiating uORF mediated translational regulation.
Project description:Copy number variations (CNVs), which represent a significant source of genetic diversity in mammals, are currently being associated with phenotypes of clinical relevance, mostly in humans and mice. Notwithstanding, little is known about the extent of CNV that contributes to genetic variation in farm animals, including pig. This Nimblegen experiment reports a genome-wide high resolution map of copy number variation in the porcine genome. After remapping the initial CNV sequences to the latest genome assembly (Sus scrofa v.9), 84 CNV regions (CNVRs) were identified among the genomes of 21 related porcine samples from Duroc breed. We used a set of NimbleGen CGH arrays that tile across the assayable portion of the pig genome with approximately 2.1 million probes, at a 502 bp average probe spacing (Sus scrofa pre assembly version 6). These CNVRs covered 2 Mb of the genome, and ranged in size from 4 to 352 kb (median size of 12 kb). Together, this analysis provides a useful resource to assist with the assessment of CNVs in the contexts of porcine variation, health and productive efficiency.
Project description:Copy number variations (CNVs), which represent a significant source of genetic diversity in mammals, are currently being associated with phenotypes of clinical relevance, mostly in humans and mice. Notwithstanding, little is known about the extent of CNV that contributes to genetic variation in cattle. This study reports the highest resolution map of copy number variation in the cattle genome, with 304 CNV regions (CNVRs) being identified among the genomes of 20 bovine samples from 4 dairy and beef breeds. We used a set of NimbleGen CGH arrays that tile across the assayable portion of the cattle genome with approximately 6.3 million probes, at a 301 bp median probe spacing. These CNVRs covered 0.68% (23 Mb) of the genome, and ranged in size from 1.7 to 2,031 kb (median size 16.7 kb). About 20% of the CNVs colocalized with segmental duplications while 30% encompassed genes, mostly related with environmental response. About 10% of the orthologous CNV cow-human genes are related with human disease susceptibility and, hence, may have important phenotypic consequences. Together, this analysis provides a useful resource to assist with the assessment of CNVs in the contexts of bovine variation, health and productive efficiency.
Project description:Recents studies in mammalian genomes have uncovered the extent of copy number variation (CNV) that contributes to phenotypic diversity, including health and disease status. Here we report the first glimpse of CNVs in the pig genome covering part of the chromosomes 4, 7, 14 and 17 already sequenced and assembled. We used a custom tiling oligonucleotide array with a median probe spacing of 409 bp to screen 12 unrelated Duroc boar founders of a vast-family material. After a strict CNV calling pipeline it was identified 40 copy number variable regions covering all the four chromosomes, with some overlapping segmental duplications and pig unigenes. This CNV snapshot analysis lays the groundwork for a better understanding of porcine phenotypes and genotypes for the identification of important economic traits. Keywords: comparative genome hybridization, CNV, Sus Scrofa, Nimblegen tiling array