Unknown

Dataset Information

0

Large genomic region free of GWAS-based common variants contains fertility-related genes.


ABSTRACT: DNA variants, such as single nucleotide polymorphisms (SNPs) and copy number variants (CNVs), are unevenly distributed across the human genome. Currently, dbSNP contains more than 6 million human SNPs, and whole-genome genotyping arrays can assay more than 4 million of them simultaneously. In our study, we first questioned whether published genome-wide association studies (GWASs) assays cover all regions well in the genome. Using dbSNP build 135 data, we identified 50 genomic regions longer than 100 Kb that do not contain any common SNPs, i.e., those with minor allele frequency (MAF)? 1%. Secondly, because conserved regions are generally of functional importance, we tested genes in those large genomic regions without common SNPs. We found 97 genes and were enriched for reproduction function. In addition, we further filtered out regions with CNVs listed in the Database of Genomic Variants (DGV), segmental duplications from Human Genome Project and common variants identified by personal genome sequencing (UCSC). No region survived after those filtering. Our analysis suggests that, while there may not be many large genomic regions free of common variants, there are still some "holes" in the current human genomic map for common SNPs. Because GWAS only focused on common SNPs, interpretation of GWAS results should take this limitation into account. Particularly, two recent GWAS of fertility may be incomplete due to the map deficit. Additional SNP discovery efforts should pay close attention to these regions.

SUBMITTER: Qiu R 

PROVIDER: S-EPMC3629113 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Large genomic region free of GWAS-based common variants contains fertility-related genes.

Qiu Rong R   Chen Chao C   Jiang Hong H   Shen Libing L   Wu Min M   Liu Chunyu C  

PloS one 20130417 4


DNA variants, such as single nucleotide polymorphisms (SNPs) and copy number variants (CNVs), are unevenly distributed across the human genome. Currently, dbSNP contains more than 6 million human SNPs, and whole-genome genotyping arrays can assay more than 4 million of them simultaneously. In our study, we first questioned whether published genome-wide association studies (GWASs) assays cover all regions well in the genome. Using dbSNP build 135 data, we identified 50 genomic regions longer than  ...[more]

Similar Datasets

| S-EPMC3166197 | biostudies-literature
| S-EPMC3681663 | biostudies-literature
| S-EPMC5070898 | biostudies-literature
| S-EPMC2933424 | biostudies-literature
| S-EPMC4863559 | biostudies-literature
| S-EPMC7061873 | biostudies-literature
| S-EPMC6777396 | biostudies-literature
| S-EPMC4745342 | biostudies-literature
| S-EPMC2728932 | biostudies-literature
| S-EPMC4614854 | biostudies-other