Unknown

Dataset Information

0

A Novel Framework for Characterizing Genomic Haplotype Diversity in the Human Immunoglobulin Heavy Chain Locus.


ABSTRACT: An incomplete ascertainment of genetic variation within the highly polymorphic immunoglobulin heavy chain locus (IGH) has hindered our ability to define genetic factors that influence antibody-mediated processes. Due to locus complexity, standard high-throughput approaches have failed to accurately and comprehensively capture IGH polymorphism. As a result, the locus has only been fully characterized two times, severely limiting our knowledge of human IGH diversity. Here, we combine targeted long-read sequencing with a novel bioinformatics tool, IGenotyper, to fully characterize IGH variation in a haplotype-specific manner. We apply this approach to eight human samples, including a haploid cell line and two mother-father-child trios, and demonstrate the ability to generate high-quality assemblies (>98% complete and >99% accurate), genotypes, and gene annotations, identifying 2 novel structural variants and 15 novel IGH alleles. We show multiplexing allows for scaling of the approach without impacting data quality, and that our genotype call sets are more accurate than short-read (>35% increase in true positives and >97% decrease in false-positives) and array/imputation-based datasets. This framework establishes a desperately needed foundation for leveraging IG genomic data to study population-level variation in antibody-mediated immunity, critical for bettering our understanding of disease risk, and responses to vaccines and therapeutics.

SUBMITTER: Rodriguez OL 

PROVIDER: S-EPMC7539625 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

altmetric image

Publications


An incomplete ascertainment of genetic variation within the highly polymorphic immunoglobulin heavy chain locus (IGH) has hindered our ability to define genetic factors that influence antibody-mediated processes. Due to locus complexity, standard high-throughput approaches have failed to accurately and comprehensively capture IGH polymorphism. As a result, the locus has only been fully characterized two times, severely limiting our knowledge of human IGH diversity. Here, we combine targeted long  ...[more]

Similar Datasets

2013-06-01 | E-GEOD-47129 | biostudies-arrayexpress
2013-06-01 | GSE47129 | GEO
| S-EPMC2771211 | biostudies-literature
| S-EPMC7287348 | biostudies-literature
| S-EPMC3672396 | biostudies-literature
| S-EPMC3404511 | biostudies-literature
2018-12-21 | GSE113938 | GEO
| S-EPMC10362067 | biostudies-literature
| S-EPMC6380546 | biostudies-literature
| S-EPMC2212390 | biostudies-literature