Unknown

Dataset Information

0

Improved Use of Small Reference Panels for Conditional and Joint Analysis with GWAS Summary Statistics.


ABSTRACT: Due to issues of practicality and confidentiality of genomic data sharing on a large scale, typically only meta- or mega-analyzed genome-wide association study (GWAS) summary data, not individual-level data, are publicly available. Reanalyses of such GWAS summary data for a wide range of applications have become more and more common and useful, which often require the use of an external reference panel with individual-level genotypic data to infer linkage disequilibrium (LD) among genetic variants. However, with a small sample size in only hundreds, as for the most popular 1000 Genomes Project European sample, estimation errors for LD are not negligible, leading to often dramatically increased numbers of false positives in subsequent analyses of GWAS summary data. To alleviate the problem in the context of association testing for a group of SNPs, we propose an alternative estimator of the covariance matrix with an idea similar to multiple imputation. We use numerical examples based on both simulated and real data to demonstrate the severe problem with the use of the 1000 Genomes Project reference panels, and the improved performance of our new approach.

SUBMITTER: Deng Y 

PROVIDER: S-EPMC5972416 | biostudies-literature | 2018 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improved Use of Small Reference Panels for Conditional and Joint Analysis with GWAS Summary Statistics.

Deng Yangqing Y   Pan Wei W  

Genetics 20180419 2


Due to issues of practicality and confidentiality of genomic data sharing on a large scale, typically only meta- or mega-analyzed genome-wide association study (GWAS) summary data, not individual-level data, are publicly available. Reanalyses of such GWAS summary data for a wide range of applications have become more and more common and useful, which often require the use of an external reference panel with individual-level genotypic data to infer linkage disequilibrium (LD) among genetic varian  ...[more]

Similar Datasets

| S-EPMC3593158 | biostudies-literature
| S-EPMC5536980 | biostudies-literature
| S-EPMC5714448 | biostudies-literature
| S-EPMC8654883 | biostudies-literature
| S-EPMC9884206 | biostudies-literature
| S-EPMC10435383 | biostudies-literature
| S-EPMC10461826 | biostudies-literature
| S-EPMC6417431 | biostudies-literature
| S-EPMC8419981 | biostudies-literature
| S-EPMC7332650 | biostudies-literature