Unknown

Dataset Information

0

Enriching targeted sequencing experiments for rare disease alleles.


ABSTRACT:

Motivation

Next-generation targeted resequencing of genome-wide association study (GWAS)-associated genomic regions is a common approach for follow-up of indirect association of common alleles. However, it is prohibitively expensive to sequence all the samples from a well-powered GWAS study with sufficient depth of coverage to accurately call rare genotypes. As a result, many studies may use next-generation sequencing for single nucleotide polymorphism (SNP) discovery in a smaller number of samples, with the intent to genotype candidate SNPs with rare alleles captured by resequencing. This approach is reasonable, but may be inefficient for rare alleles if samples are not carefully selected for the resequencing experiment.

Results

We have developed a probability-based approach, SampleSeq, to select samples for a targeted resequencing experiment that increases the yield of rare disease alleles substantially over random sampling of cases or controls or sampling based on genotypes at associated SNPs from GWAS data. This technique allows for smaller sample sizes for resequencing experiments, or allows the capture of rarer risk alleles. When following up multiple regions, SampleSeq selects subjects with an even representation of all the regions. SampleSeq also can be used to calculate the sample size needed for the resequencing to increase the chance of successful capture of rare alleles of desired frequencies.

Software

http://biostat.mc.vanderbilt.edu/SampleSeq

SUBMITTER: Edwards TL 

PROVIDER: S-EPMC3137214 | biostudies-literature | 2011 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

Enriching targeted sequencing experiments for rare disease alleles.

Edwards Todd L TL   Song Zhuo Z   Li Chun C  

Bioinformatics (Oxford, England) 20110623 15


<h4>Motivation</h4>Next-generation targeted resequencing of genome-wide association study (GWAS)-associated genomic regions is a common approach for follow-up of indirect association of common alleles. However, it is prohibitively expensive to sequence all the samples from a well-powered GWAS study with sufficient depth of coverage to accurately call rare genotypes. As a result, many studies may use next-generation sequencing for single nucleotide polymorphism (SNP) discovery in a smaller number  ...[more]

Similar Datasets

| PRJNA507828 | ENA
| PRJNA507829 | ENA
| S-EPMC4319990 | biostudies-literature
| S-EPMC6947341 | biostudies-literature
| S-EPMC1952108 | biostudies-literature
| S-EPMC6071590 | biostudies-other
| S-EPMC4497921 | biostudies-literature
| S-EPMC6900361 | biostudies-literature
| S-EPMC6602470 | biostudies-literature
2019-08-01 | E-MTAB-3513 | biostudies-arrayexpress