A Map of Copy Number Variations in Chinese Populations
Ontology highlight
ABSTRACT: It has been shown that the human genome contains extensive copy number variations (CNVs). Investigating the medical and evolutionary impacts of CNVs requires the knowledge of locations, sizes and frequency distribution of CNVs within and between populations. However, CNV study of Chinese populations has been underrepresented considering the same efforts in other populations. Here we constructed a Chinese CNV map by using Affymetrix SNP 6 array. We did population analysis with other HapMap populations and identified population specific CNVs as well as candidate CNV regions under selection. Our results serve as a useful resource in further evolutionary and medical studies.
Project description:It has been shown that the human genome contains extensive copy number variations (CNVs). Investigating the medical and evolutionary impacts of CNVs requires the knowledge of locations, sizes and frequency distribution of CNVs within and between populations. However, CNV study of Chinese populations has been underrepresented considering the same efforts in other populations. Here we constructed a Chinese CNV map by using Affymetrix SNP 6 array. We did population analysis with other HapMap populations and identified population specific CNVs as well as candidate CNV regions under selection. Our results serve as a useful resource in further evolutionary and medical studies. There are 155 samples included in the analysis
Project description:Copy number variations (CNVs) have been demonstrated as crucial substrates for evolution, adaptation and breed formation. Chinese indigenous cattle breeds exhibit a broad geographical distribution and diverse environmental adaptability. Here, we analyzed the population structure and adaptation to high altitude of Chinese indigenous cattle based on genome-wide CNVs derived from the high-density BovineHD SNP array. We successfully detected the genome-wide CNVs of 318 individuals from 24 Chinese indigenous cattle breeds and 37 yaks as outgroups. A total of 5,818 autosomal CNV regions (683 bp - 4,477,860 bp in size), covering ~14.34% of the bovine genome (UMD3.1), were identified, showing abundant CNV resources. Neighbor-joining clustering, principal component analysis (PCA), and population admixture analysis based on these CNVs support that most Chinese cattle breeds are hybrids of Bos taurus taurus (hereinafter to be referred as Bos taurus) and Bos taurus indicus (Bos indicus). The distribution patterns of the CNVs could to some extent be related to the geographical backgrounds of the habitat of the breeds, and admixture among cattle breeds from different districts. We analyzed the selective signatures of CNVs positively involved in high-altitude adaptation using pairwise Fst analysis within breeds with a strong Bos taurus background (taurine-type breeds) and within Bos taurus×Bos indicus hybrids, respectively. CNV-overlapping genes with strong selection signatures (at top 0.5% of Fst value), including LETM1 (Fst = 0.490), TXNRD2 (Fst=0.440) and STUB1 (Fst=0.420) within taurine-type breeds, and NOXA1 (Fst = 0.233), RUVBL1 (Fst=0.222) and SLC4A3 (Fst=0.154) within hybrids, were potentially involved in the adaptation to hypoxia. Thus, we provide a new profile of population structure from the CNV aspects of Chinese indigenous cattle and new insights into high-altitude adaptation in cattle.
Project description:Despite considerable excitement over the potential functional significance of copy number variants (CNVs), we still lack knowledge of the fine-scale architecture of the large majority of CNV regions in the human genome. In this study, we used a high-resolution array-based comparative genomic hybridization (aCGH) platform that targeted known CNV regions of the human genome at ~1 kb resolution to interrogate the genomic DNAs of 30 individuals from four HapMap populations. Our results revealed that 1,020 of 1,153 CNV loci (88%) were actually smaller in size than what is recorded in the Database of Genomic Variants based on previously published studies. A reduction in size of more than 50% was observed for 876 CNV regions (76%). We conclude that the total genomic content of common human CNVs may be smaller than previously thought. In addition, approximately 8% of the CNV regions observed in multiple individuals exhibited genomic architectural complexity in the form of smaller CNVs within larger ones and CNVs with inter-individual variation in breakpoints. Future association studies that aim to capture the potential influences of CNVs on disease phenotypes will need to consider how to best ascertain this previously underappreciated complexity. The DNA samples are a panel of 30 Hapmap samples, and a female sample NA15510 of undetermined geographic origin, which has been well characterized by fosmid-end sequencing (Tuzun et al. 2005, Nature Genetics 10, p1038). The DNA for this set of 16 female and 15 male samples are all supplied by the Coriell Cell Repository, and is comprised of samples from four populations: 10 unrelated Yoruban, 10 unrelated European-American individuals from Utah (CEPH), 5 unrelated Chinese, and 5 unrelated Japanese. The reference sample, NA10851 is also a male CEPH also from the HapMap Sample set (Corriel Cell Repository). Each of these samples was hybridized in pairs with the reversed labeling polarities. Additionally, 3 self-self control hybridizations were carried out for the reference sample, NA10851, one on each hybridization date. In the Title for each of the samples, each hybridization session is indicated by the letters A, B or C.
Project description:To reveal the risk CNVs of SZ in Chinese population, we recrited and enrolled 100 Chinese family trios with a schizophrenia affected children and both of their father and mother. SZ was diagnosed according to DSM-IV criteria by two independent psychiatrists. There gDNA we screen the genome-wide CNV using Agilent SurePrint G3 Human CGH Microarray Kit (1x1M) and Agilent sex-matched human DNA was used as reference. The CNV were called by ADM-2 statistical algorithms with a threshold of 6.0. We compared the burden of large rare CNVs and found that SZ probands carried more duplications and less deletions. Furtherly, we performed familial inheritance analysis of transmission disequilibrium and de novo CNV detection, validated several associated CNV loci with SZ susceptibility and also identify eight novel loci conferring risk of SZ
Project description:Despite considerable excitement over the potential functional significance of copy number variants (CNVs), we still lack knowledge of the fine-scale architecture of the large majority of CNV regions in the human genome. In this study, we used a high-resolution array-based comparative genomic hybridization (aCGH) platform that targeted known CNV regions of the human genome at ~1 kb resolution to interrogate the genomic DNAs of 30 individuals from four HapMap populations. Our results revealed that 1,020 of 1,153 CNV loci (88%) were actually smaller in size than what is recorded in the Database of Genomic Variants based on previously published studies. A reduction in size of more than 50% was observed for 876 CNV regions (76%). We conclude that the total genomic content of common human CNVs may be smaller than previously thought. In addition, approximately 8% of the CNV regions observed in multiple individuals exhibited genomic architectural complexity in the form of smaller CNVs within larger ones and CNVs with inter-individual variation in breakpoints. Future association studies that aim to capture the potential influences of CNVs on disease phenotypes will need to consider how to best ascertain this previously underappreciated complexity. Keywords: comparative genomic hybridization
Project description:The sheep (Ovis aries) plays a major socio-economic role in the world. Copy number variations (CNVs) are increasingly recognized as a key and potent source of genetic variation and phenotypic diversity, but little is known about the extent to which CNVs contribute to genetic variation in Chinese sheep breeds. Analyses of CNVs in the genomes of eight sheep breeds were performed using the sheep SNP50 BeadChip genotyping array. A total of 111 CNV regions (CNVRs) were obtained from 160 Chinese sheep breeds. These CNVRs covered 13.75 Mb of the sheep genome sequence. A total of 22 Go terms and 17 candidate genes were obtained from the functional analysis. Ten CNVRs were selected for validation, of which 7 CNVRs were further experimentally confirmed by quantitative PCR. Four candidate genes were selected to confirm the results of the functional analysis. These results provide a resource for furthering understanding of ruminant biology, and for further improving the genetic quality of sheep breeds.
Project description:Here we describe a genome-wide analysis of copy number variations (CNVs) in Chinese domestic cattle by using array comparative genomic hybridization (array CGH) and quantitative PCR (qPCR). We conducted array CGH analysis on 30 male cattle individuals, animals from consisting of 12 breeds of Bos taurus/Bos indicus, 1 Bos grunniens and and two ones of Bubalus bubalis breeds for with beef, and/or dairy or dual purpose. We identified over 470 candidate CNV regions (CNVRs) in Bos B. taurus/B. indicus; 118 candidate CNV regions (CNVRs) in B. grunniens, 139 CNVRs in B. bubalis. Furthermore, based on the Y haplotypes of B. taurus/ B. indicus, Wwe also identified 69, 337, and 251 candidate CNV regions (CNVRs) in the sub-groups of Y1, Y2 and Y3 haplotypes.
Project description:Tetralogy of Fallot (TOF), the most frequent cyanotic congenital heart disease, occurs as a simplex trait of unknown etiology in the majority of cases. Studies of non-Asian populations suggest that approximately 10% of TOF cases carry a de novo rare copy number variant (CNV) thought to underlie the malformation. A genome-wide CNV analysis was performed in 303 TOF and 302 controls of Han Chinese as well as compared to 1,000 common Chinese database and revealed 166 rare CNVs identified in TOF patients with 119 CNVs further evaluated as potential “TOF-specific CNVs”; 98 were validated by qPCR, and 44 CNVs showed positive results on validation (positive rate 46.9%, 44/98). The genes related to the clinical phenotypes (subpulmonary VSD, bicuspid pulmonary valve, aortic valve overriding more than 75%, and right aortic arch) and the specific CNVs were included in integrating gene-gene interaction network analysis that identified the genes covering the specific CNVs directly or indirectly correlated to TOF and the signaling pathways. Thus, this study identified novel TOF-specific/associate CNVs in the Han Chinese that occurring at higher frequency in the Han Chinese (28.4% in Chr 1-20 and 42.9% in all chromosomes) is reflective of the increased prevalence of TOF in China. These novel CNVs identify new candidate genes for TOF. Our findings provide new insight into the contribution of CNVs to the genetic basis of TOF and identify new genetic loci potentially important to the pathogenesis of TOF
Project description:Chromosomal segmental copy number variation (CNV) has been recently recognized as a very important source of genetic variability. Some CNV loci involve genes or conserved regulatory regions. Compelling evidence indicates that CNVs impact genome functions. The chicken is a very important farm animal species which has also served as model animal for biological and biomedical research for hundreds of years. A map of CNVs in chickens could facilitate the identification of chromosome regions that segregate for important agricultural and disease phenotypes. NimbleGen 385k whole genome tiling arrays were used to map CNVs in the chicken. This study has identified 96 CNVs in three lines of chickens (broiler, Leghorn and Rhode Island red). These CNVs encompass 16 Mb (1.3%) of the chicken genome. Twenty six CNVs were found in two or more animals. Smaller sized CNVs mostly affect none coding sequences while larger CNV regions involve genes, for example prolactin receptor, aldose reductase and zinc finger proteins, suggesting chicken CNVs potentially affect agricultural or disease related traits.
Project description:A CNV map in pigs could facilitate the identification of chromosomal regions that segregate for important economic and disease phenotypes. The goal of this study was to identify CNV regions (CNVRs) in pigs based on a custom array comparative genome hybridization (aCGH). We carried out a custom-made array comparative genome hybridization (aCGH) experiment in order to identify copy number variations (CNVs) in the pig genome analysing animals of diverse pig breeds (White Duroc, Yangxin, Erhualian, Tongcheng, Large White, Pietrain, Landrace and Chinese new pig line DIV ) using a tiling oligonucleotide array with ~720,000 probes designed on the pig genome (Sus scrofa genome version 9.0).