Unknown

Dataset Information

0

A novel support vector machine-based approach for rare variant detection.


ABSTRACT: Advances in next-generation sequencing technologies have enabled the identification of multiple rare single nucleotide polymorphisms involved in diseases or traits. Several strategies for identifying rare variants that contribute to disease susceptibility have recently been proposed. An important feature of many of these statistical methods is the pooling or collapsing of multiple rare single nucleotide variants to achieve a reasonably high frequency and effect. However, if the pooled rare variants are associated with the trait in different directions, then the pooling may weaken the signal, thereby reducing its statistical power. In the present paper, we propose a backward support vector machine (BSVM)-based variant selection procedure to identify informative disease-associated rare variants. In the selection procedure, the rare variants are weighted and collapsed according to their positive or negative associations with the disease, which may be associated with common variants and rare variants with protective, deleterious, or neutral effects. This nonparametric variant selection procedure is able to account for confounding factors and can also be adopted in other regression frameworks. The results of a simulation study and a data example show that the proposed BSVM approach is more powerful than four other approaches under the considered scenarios, while maintaining valid type I errors.

SUBMITTER: Fang YH 

PROVIDER: S-EPMC3737136 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

A novel support vector machine-based approach for rare variant detection.

Fang Yao-Hwei YH   Chiu Yen-Feng YF  

PloS one 20130807 8


Advances in next-generation sequencing technologies have enabled the identification of multiple rare single nucleotide polymorphisms involved in diseases or traits. Several strategies for identifying rare variants that contribute to disease susceptibility have recently been proposed. An important feature of many of these statistical methods is the pooling or collapsing of multiple rare single nucleotide variants to achieve a reasonably high frequency and effect. However, if the pooled rare varia  ...[more]

Similar Datasets

| S-EPMC4143758 | biostudies-literature
| S-EPMC8894221 | biostudies-literature
| S-EPMC4537225 | biostudies-literature
| S-EPMC6205516 | biostudies-literature
| S-EPMC7202572 | biostudies-literature
| S-EPMC9189781 | biostudies-literature
| S-EPMC6311282 | biostudies-literature
| S-EPMC2923853 | biostudies-literature
| S-EPMC7451646 | biostudies-literature
| S-EPMC4058169 | biostudies-other