Diversity of Human Copy Number Variation and Multicopy Genes
Ontology highlight
ABSTRACT: Copy number variants (CNVs) affect both disease and normal phenotypic variation but those lying within heavily duplicated, highly identical sequence have been difficult to assay. By analyzing short-read mapping depth for 159 human genomes, we demonstrate accurate estimation of absolute copy number for duplications as small as 1.9 kbp, ranging from 0-48 copies. We identified 4.1 million ‘singly unique nucleotide’ (SUN) positions informative in distinguishing specific copies, and use them to genotype the copy and content of specific paralogs within highly duplicated gene families. These data identify human-specific expansions in genes associated with brain development, reveal extensive population genetic diversity, and detect signatures consistent with gene conversion in the human species. Our approach makes ~1000 genes accessible to genetic studies of disease association.
ORGANISM(S): Homo sapiens
PROVIDER: GSE24334 | GEO | 2010/09/24
SECONDARY ACCESSION(S): PRJNA132747
REPOSITORIES: GEO
ACCESS DATA