Unknown

Dataset Information

0

Efficient and cost effective population resequencing by pooling and in-solution hybridization.


ABSTRACT: High-throughput sequencing of targeted genomic loci in large populations is an effective approach for evaluating the contribution of rare variants to disease risk. We evaluated the feasibility of using in-solution hybridization-based target capture on pooled DNA samples to enable cost-efficient population sequencing studies. For this, we performed pooled sequencing of 100 HapMap samples across ? 600 kb of DNA sequence using the Illumina GAIIx. Using our accurate variant calling method for pooled sequence data, we were able to not only identify single nucleotide variants with a low false discovery rate (<1%) but also accurately detect short insertion/deletion variants. In addition, with sufficient coverage per individual in each pool (30-fold) we detected 97.2% of the total variants and 93.6% of variants below 5% in frequency. Finally, allele frequencies for single nucleotide variants (SNVs) estimated from the pooled data and the HapMap genotype data were tightly correlated (correlation coefficient >? =? 0.995).

SUBMITTER: Bansal V 

PROVIDER: S-EPMC3068187 | biostudies-literature | 2011 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Efficient and cost effective population resequencing by pooling and in-solution hybridization.

Bansal Vikas V   Tewhey Ryan R   Leproust Emily M EM   Schork Nicholas J NJ  

PloS one 20110330 3


High-throughput sequencing of targeted genomic loci in large populations is an effective approach for evaluating the contribution of rare variants to disease risk. We evaluated the feasibility of using in-solution hybridization-based target capture on pooled DNA samples to enable cost-efficient population sequencing studies. For this, we performed pooled sequencing of 100 HapMap samples across ∼ 600 kb of DNA sequence using the Illumina GAIIx. Using our accurate variant calling method for pooled  ...[more]

Similar Datasets

| S-EPMC9477502 | biostudies-literature
2020-10-22 | GSE149666 | GEO
2020-01-14 | PXD012703 | Pride
2018-02-27 | GSE89663 | GEO
| S-EPMC4978437 | biostudies-literature
| S-EPMC6214898 | biostudies-literature
| S-EPMC9468466 | biostudies-literature
| S-EPMC7703793 | biostudies-literature
| S-EPMC9873549 | biostudies-literature
2016-04-18 | E-MTAB-4553 | biostudies-arrayexpress