Unknown

Dataset Information

0

Genetic risk prediction using a spatial autoregressive model with adaptive lasso.


ABSTRACT: With rapidly evolving high-throughput technologies, studies are being initiated to accelerate the process toward precision medicine. The collection of the vast amounts of sequencing data provides us with great opportunities to systematically study the role of a deep catalog of sequencing variants in risk prediction. Nevertheless, the massive amount of noise signals and low frequencies of rare variants in sequencing data pose great analytical challenges on risk prediction modeling. Motivated by the development in spatial statistics, we propose a spatial autoregressive model with adaptive lasso (SARAL) for risk prediction modeling using high-dimensional sequencing data. The SARAL is a set-based approach, and thus, it reduces the data dimension and accumulates genetic effects within a single-nucleotide variant (SNV) set. Moreover, it allows different SNV sets having various magnitudes and directions of effect sizes, which reflects the nature of complex diseases. With the adaptive lasso implemented, SARAL can shrink the effects of noise SNV sets to be zero and, thus, further improve prediction accuracy. Through simulation studies, we demonstrate that, overall, SARAL is comparable to, if not better than, the genomic best linear unbiased prediction method. The method is further illustrated by an application to the sequencing data from the Alzheimer's Disease Neuroimaging Initiative.

SUBMITTER: Wen Y 

PROVIDER: S-EPMC6492943 | biostudies-literature | 2018 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genetic risk prediction using a spatial autoregressive model with adaptive lasso.

Wen Yalu Y   Shen Xiaoxi X   Lu Qing Q  

Statistics in medicine 20180531 26


With rapidly evolving high-throughput technologies, studies are being initiated to accelerate the process toward precision medicine. The collection of the vast amounts of sequencing data provides us with great opportunities to systematically study the role of a deep catalog of sequencing variants in risk prediction. Nevertheless, the massive amount of noise signals and low frequencies of rare variants in sequencing data pose great analytical challenges on risk prediction modeling. Motivated by t  ...[more]

Similar Datasets

| S-EPMC11207171 | biostudies-literature
| S-EPMC8082466 | biostudies-literature
| S-EPMC9140222 | biostudies-literature
| S-EPMC7065407 | biostudies-literature
| S-EPMC7523642 | biostudies-literature
| S-EPMC10948277 | biostudies-literature
| S-EPMC8065141 | biostudies-literature
| S-EPMC3470705 | biostudies-literature
| S-EPMC6364846 | biostudies-literature
| S-EPMC4188068 | biostudies-literature