Unknown

Dataset Information

0

Efficient and accurate frailty model approach for genome-wide survival association analysis in large-scale biobanks.


ABSTRACT: With decades of electronic health records linked to genetic data, large biobanks provide unprecedented opportunities for systematically understanding the genetics of the natural history of complex diseases. Genome-wide survival association analysis can identify genetic variants associated with ages of onset, disease progression and lifespan. We propose an efficient and accurate frailty model approach for genome-wide survival association analysis of censored time-to-event (TTE) phenotypes by accounting for both population structure and relatedness. Our method utilizes state-of-the-art optimization strategies to reduce the computational cost. The saddlepoint approximation is used to allow for analysis of heavily censored phenotypes (>90%) and low frequency variants (down to minor allele count 20). We demonstrate the performance of our method through extensive simulation studies and analysis of five TTE phenotypes, including lifespan, with heavy censoring rates (90.9% to 99.8%) on ~400,000 UK Biobank participants with white British ancestry and ~180,000 individuals in FinnGen. We further analyzed 871 TTE phenotypes in the UK Biobank and presented the genome-wide scale phenome-wide association results with the PheWeb browser.

SUBMITTER: Dey R 

PROVIDER: S-EPMC9481565 | biostudies-literature | 2022 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Efficient and accurate frailty model approach for genome-wide survival association analysis in large-scale biobanks.

Dey Rounak R   Zhou Wei W   Kiiskinen Tuomo T   Havulinna Aki A   Elliott Amanda A   Karjalainen Juha J   Kurki Mitja M   Qin Ashley A   Lee Seunggeun S   Palotie Aarno A   Neale Benjamin B   Daly Mark M   Lin Xihong X  

Nature communications 20220916 1


With decades of electronic health records linked to genetic data, large biobanks provide unprecedented opportunities for systematically understanding the genetics of the natural history of complex diseases. Genome-wide survival association analysis can identify genetic variants associated with ages of onset, disease progression and lifespan. We propose an efficient and accurate frailty model approach for genome-wide survival association analysis of censored time-to-event (TTE) phenotypes by acco  ...[more]

Similar Datasets

| S-EPMC8206161 | biostudies-literature
| S-EPMC1766938 | biostudies-literature
| S-EPMC5681697 | biostudies-literature
| S-EPMC4767558 | biostudies-other
| S-EPMC6508500 | biostudies-literature
| S-EPMC5449614 | biostudies-literature
| S-EPMC7806647 | biostudies-literature
| S-EPMC11020996 | biostudies-literature
| S-EPMC7261120 | biostudies-literature
| S-EPMC6527781 | biostudies-literature