Unknown

Dataset Information

0

FastSKAT: Sequence kernel association tests for very large sets of markers.


ABSTRACT: The sequence kernel association test (SKAT) is widely used to test for associations between a phenotype and a set of genetic variants that are usually rare. Evaluating tail probabilities or quantiles of the null distribution for SKAT requires computing the eigenvalues of a matrix related to the genotype covariance between markers. Extracting the full set of eigenvalues of this matrix (an n×n matrix, for n subjects) has computational complexity proportional to n3 . As SKAT is often used when n>104 , this step becomes a major bottleneck in its use in practice. We therefore propose fastSKAT, a new computationally inexpensive but accurate approximations to the tail probabilities, in which the k largest eigenvalues of a weighted genotype covariance matrix or the largest singular values of a weighted genotype matrix are extracted, and a single term based on the Satterthwaite approximation is used for the remaining eigenvalues. While the method is not particularly sensitive to the choice of k, we also describe how to choose its value, and show how fastSKAT can automatically alert users to the rare cases where the choice may affect results. As well as providing faster implementation of SKAT, the new method also enables entirely new applications of SKAT that were not possible before; we give examples grouping variants by topologically associating domains, and comparing chromosome-wide association by class of histone marker.

SUBMITTER: Lumley T 

PROVIDER: S-EPMC6129408 | biostudies-literature | 2018 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

FastSKAT: Sequence kernel association tests for very large sets of markers.

Lumley Thomas T   Brody Jennifer J   Peloso Gina G   Morrison Alanna A   Rice Kenneth K  

Genetic epidemiology 20180622 6


The sequence kernel association test (SKAT) is widely used to test for associations between a phenotype and a set of genetic variants that are usually rare. Evaluating tail probabilities or quantiles of the null distribution for SKAT requires computing the eigenvalues of a matrix related to the genotype covariance between markers. Extracting the full set of eigenvalues of this matrix (an n×n matrix, for n subjects) has computational complexity proportional to n<sup>3</sup> . As SKAT is often use  ...[more]

Similar Datasets

| S-EPMC3675243 | biostudies-literature
| S-EPMC4158946 | biostudies-other
| S-EPMC4679685 | biostudies-literature
| S-EPMC8495888 | biostudies-literature
| S-EPMC8278388 | biostudies-literature
| S-EPMC7015965 | biostudies-literature
| S-EPMC4724299 | biostudies-literature
| S-EPMC3272581 | biostudies-literature
| S-EPMC4221116 | biostudies-literature
| S-EPMC3778346 | biostudies-literature