Unknown

Dataset Information

0

Efficient study design for next generation sequencing.


ABSTRACT: Next Generation Sequencing represents a powerful tool for detecting genetic variation associated with human disease. Because of the high cost of this technology, it is critical that we develop efficient study designs that consider the trade-off between the number of subjects (n) and the coverage depth (µ). How we divide our resources between the two can greatly impact study success, particularly in pilot studies. We propose a strategy for selecting the optimal combination of n and µ for studies aimed at detecting rare variants and for studies aimed at detecting associations between rare or uncommon variants and disease. For detecting rare variants, we find the optimal coverage depth to be between 2 and 8 reads when using the likelihood ratio test. For association studies, we find the strategy of sequencing all available subjects to be preferable. In deriving these combinations, we provide a detailed analysis describing the distribution of depth across a genome and the depth needed to identify a minor allele in an individual. The optimal coverage depth depends on the aims of the study, and the chosen depth can have a large impact on study success.

SUBMITTER: Sampson J 

PROVIDER: S-EPMC3291790 | biostudies-literature | 2011 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Efficient study design for next generation sequencing.

Sampson Joshua J   Jacobs Kevin K   Yeager Meredith M   Chanock Stephen S   Chatterjee Nilanjan N  

Genetic epidemiology 20110501 4


Next Generation Sequencing represents a powerful tool for detecting genetic variation associated with human disease. Because of the high cost of this technology, it is critical that we develop efficient study designs that consider the trade-off between the number of subjects (n) and the coverage depth (µ). How we divide our resources between the two can greatly impact study success, particularly in pilot studies. We propose a strategy for selecting the optimal combination of n and µ for studies  ...[more]

Similar Datasets

2017-04-03 | PXD003804 | Pride
| S-EPMC4460749 | biostudies-literature
| S-EPMC3382444 | biostudies-literature
| S-EPMC4115663 | biostudies-literature
| S-EPMC9891242 | biostudies-literature
| S-EPMC5808213 | biostudies-literature
| S-EPMC4914850 | biostudies-literature
| S-EPMC5974784 | biostudies-other
| S-EPMC3526293 | biostudies-literature
| S-EPMC6035725 | biostudies-literature