Unknown

Dataset Information

0

Association Analysis and Meta-Analysis of Multi-Allelic Variants for Large-Scale Sequence Data.


ABSTRACT: There is great interest in understanding the impact of rare variants in human diseases using large sequence datasets. In deep sequence datasets of >10,000 samples, ~10% of the variant sites are observed to be multi-allelic. Many of the multi-allelic variants have been shown to be functional and disease-relevant. Proper analysis of multi-allelic variants is critical to the success of a sequencing study, but existing methods do not properly handle multi-allelic variants and can produce highly misleading association results. We discuss practical issues and methods to encode multi-allelic sites, conduct single-variant and gene-level association analyses, and perform meta-analysis for multi-allelic variants. We evaluated these methods through extensive simulations and the study of a large meta-analysis of ~18,000 samples on the cigarettes-per-day phenotype. We showed that our joint modeling approach provided an unbiased estimate of genetic effects, greatly improved the power of single-variant association tests among methods that can properly estimate allele effects, and enhanced gene-level tests over existing approaches. Software packages implementing these methods are available online.

SUBMITTER: Jiang Y 

PROVIDER: S-EPMC7288273 | biostudies-literature | 2020 May

REPOSITORIES: biostudies-literature

altmetric image

Publications


There is great interest in understanding the impact of rare variants in human diseases using large sequence datasets. In deep sequence datasets of >10,000 samples, ~10% of the variant sites are observed to be multi-allelic. Many of the multi-allelic variants have been shown to be functional and disease-relevant. Proper analysis of multi-allelic variants is critical to the success of a sequencing study, but existing methods do not properly handle multi-allelic variants and can produce highly misl  ...[more]

Similar Datasets

| S-EPMC6461877 | biostudies-literature
| S-EPMC4067555 | biostudies-literature
| S-EPMC4146673 | biostudies-literature
| S-EPMC5458077 | biostudies-literature
2020-11-18 | GSE156074 | GEO
| S-EPMC3282142 | biostudies-literature
| S-EPMC5501866 | biostudies-literature
| PRJNA377193 | ENA
| S-EPMC2795912 | biostudies-literature
| S-EPMC2672416 | biostudies-literature