Unknown

Dataset Information

0

Impact of three Illumina library construction methods on GC bias and HLA genotype calling.


ABSTRACT: Next-generation sequencing (NGS) is increasingly recognized for its ability to overcome allele ambiguity and deliver high-resolution typing in the HLA system. Using this technology, non-uniform read distribution can impede the reliability of variant detection, which renders high-confidence genotype calling particularly difficult to achieve in the polymorphic HLA complex. Recently, library construction has been implicated as the dominant factor in instigating coverage bias. To study the impact of this phenomenon on HLA genotyping, we performed long-range PCR on 12 samples to amplify HLA-A, -B, -C, -DRB1, and -DQB1, and compared the relative contribution of three Illumina library construction methods (TruSeq Nano, Nextera, Nextera XT) in generating downstream bias. Here, we show high GC% to be a good predictor of low sequencing depth. Compared to standard TruSeq Nano, GC bias was more prominent in transposase-based protocols, particularly Nextera XT, likely through a combination of transposase insertion bias being coupled with a high number of PCR enrichment cycles. Importantly, our findings demonstrate non-uniform read depth can have a direct and negative impact on the robustness of HLA genotyping, which has clinical implications for users when choosing a library construction strategy that aims to balance cost and throughput with data quality.

SUBMITTER: Lan JH 

PROVIDER: S-EPMC5089167 | biostudies-literature | 2015 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Impact of three Illumina library construction methods on GC bias and HLA genotype calling.

Lan James H JH   Yin Yuxin Y   Reed Elaine F EF   Moua Kevin K   Thomas Kimberly K   Zhang Qiuheng Q  

Human immunology 20141225 2-3


Next-generation sequencing (NGS) is increasingly recognized for its ability to overcome allele ambiguity and deliver high-resolution typing in the HLA system. Using this technology, non-uniform read distribution can impede the reliability of variant detection, which renders high-confidence genotype calling particularly difficult to achieve in the polymorphic HLA complex. Recently, library construction has been implicated as the dominant factor in instigating coverage bias. To study the impact of  ...[more]

Similar Datasets

| S-EPMC2732271 | biostudies-literature
| S-EPMC5808798 | biostudies-literature
| S-EPMC3535703 | biostudies-literature
| S-EPMC6389247 | biostudies-literature
| S-EPMC2946280 | biostudies-literature
2018-02-01 | GSE100127 | GEO
2015-05-15 | E-GEOD-67053 | biostudies-arrayexpress
| S-EPMC6848508 | biostudies-literature
2015-05-15 | GSE67053 | GEO
| S-EPMC4237435 | biostudies-literature