Unknown

Dataset Information

0

SR-TWAS: leveraging multiple reference panels to improve transcriptome-wide association study power by ensemble machine learning.


ABSTRACT: Multiple reference panels of a given tissue or multiple tissues often exist, and multiple regression methods could be used for training gene expression imputation models for transcriptome-wide association studies (TWAS). To leverage expression imputation models (i.e., base models) trained with multiple reference panels, regression methods, and tissues, we develop a Stacked Regression based TWAS (SR-TWAS) tool which can obtain optimal linear combinations of base models for a given validation transcriptomic dataset. Both simulation and real studies show that SR-TWAS improves power, due to increased training sample sizes and borrowed strength across multiple regression methods and tissues. Leveraging base models across multiple reference panels, tissues, and regression methods, our real studies identify 6 independent significant risk genes for Alzheimer's disease (AD) dementia for supplementary motor area tissue and 9 independent significant risk genes for Parkinson's disease (PD) for substantia nigra tissue. Relevant biological interpretations are found for these significant risk genes.

SUBMITTER: Parrish RL 

PROVIDER: S-EPMC11300466 | biostudies-literature | 2024 Aug

REPOSITORIES: biostudies-literature

altmetric image

Publications

SR-TWAS: leveraging multiple reference panels to improve transcriptome-wide association study power by ensemble machine learning.

Parrish Randy L RL   Buchman Aron S AS   Tasaki Shinya S   Wang Yanling Y   Avey Denis D   Xu Jishu J   De Jager Philip L PL   Bennett David A DA   Epstein Michael P MP   Yang Jingjing J  

Nature communications 20240805 1


Multiple reference panels of a given tissue or multiple tissues often exist, and multiple regression methods could be used for training gene expression imputation models for transcriptome-wide association studies (TWAS). To leverage expression imputation models (i.e., base models) trained with multiple reference panels, regression methods, and tissues, we develop a Stacked Regression based TWAS (SR-TWAS) tool which can obtain optimal linear combinations of base models for a given validation tran  ...[more]

Similar Datasets

| S-EPMC10327185 | biostudies-literature
| S-EPMC9992663 | biostudies-literature
| S-EPMC10764714 | biostudies-literature
| S-EPMC6323418 | biostudies-literature
| S-EPMC6324190 | biostudies-literature
| S-EPMC8532302 | biostudies-literature
| S-EPMC7815257 | biostudies-literature
| S-EPMC9487600 | biostudies-literature
| S-EPMC8887641 | biostudies-literature
| S-EPMC9825460 | biostudies-literature