Project description:AimsRecent studies have revealed that the interplay between polygenic risk scores (PRS) and large copy number variants (CNV; >500kb) is essential for the etiology of schizophrenia (SCZ). To replicate previous findings, including those for smaller CNV (>10kb), the PRS between SCZ patients with and without CNV were compared.MethodsThe PRS were calculated for 724 patients with SCZ and 1178 healthy controls (HC), genotyped using array-based comparative genomic hybridization and single nucleotide polymorphisms chips, and comparisons were made between cases and HC, or between subjects with and without 'clinically significant' CNV.ResultsFirst, we replicated the higher PRS in patients with SCZ compared to that in HC (without taking into account the CNV). For clinically significant CNV, as defined by the American College of Medical Genetics ('pathogenic' and 'uncertain clinical significance, likely pathogenic' CNV), 66 patients with SCZ carried clinically significant CNV, whereas 658 SCZ patients had no such CNV. In the comparison of PRS between cases with/without the CNV, despite no significant difference in PRS, significant enrichment of the well-established risk CNV (22q11.2 deletion and 47,XXY/47,XXX) was observed in the lowest decile of PRS in SCZ patients with the CNV.ConclusionAlthough the present study failed to replicate the significant difference in PRS between SCZ patients with and without clinically significant CNV, SCZ patients with well-established risk CNV tended to have a lower PRS. Therefore, we speculate that the CNV in SCZ patients with lower PRS may contain 'genuine' risk; PRS is a possible tool for prioritizing clinically significant CNV because the power of the CNV association analysis is limited due to their rarity.
Project description:Genome-wide association studies (GWASs) have identified at least 10 single-nucleotide polymorphisms (SNPs) associated with papillary thyroid cancer (PTC) risk. Most of these SNPs are common variants with small to moderate effect sizes. Here we assessed the combined genetic effects of these variants on PTC risk by using summarized GWAS results to build polygenic risk score (PRS) models in three PTC study groups from Ohio (1,544 patients and 1,593 controls), Iceland (723 patients and 129,556 controls), and the United Kingdom (534 patients and 407,945 controls). A PRS based on the 10 established PTC SNPs showed a stronger predictive power compared with the clinical factors model, with a minimum increase of area under the receiver-operating curve of 5.4 percentage points (P ≤ 1.0 × 10-9). Adding an extended PRS based on 592,475 common variants did not significantly improve the prediction power compared with the 10-SNP model, suggesting that most of the remaining undiscovered genetic risk in thyroid cancer is due to rare, moderate- to high-penetrance variants rather than to common low-penetrance variants. Based on the 10-SNP PRS, individuals in the top decile group of PRSs have a close to sevenfold greater risk (95% CI, 5.4-8.8) compared with the bottom decile group. In conclusion, PRSs based on a small number of common germline variants emphasize the importance of heritable low-penetrance markers in PTC.
Project description:Polygenic risk scores (PRS) can provide useful information for personalized risk stratification and disease risk assessment, especially when combined with non-genetic risk factors. However, their construction depends on the availability of summary statistics from genome-wide association studies (GWAS) independent from the target sample. For best compatibility, it was reported that GWAS and the target sample should match in terms of ancestries. Yet, GWAS, especially in the field of cancer, often lack diversity and are predominated by European ancestry. This bias is a limiting factor in PRS research. By using electronic health records and genetic data from the UK Biobank, we contrast the utility of breast and prostate cancer PRS derived from external European-ancestry-based GWAS across African, East Asian, European, and South Asian ancestry groups. We highlight differences in the PRS distributions of these groups that are amplified when PRS methods condense hundreds of thousands of variants into a single score. While European-GWAS-derived PRS were not directly transferrable across ancestries on an absolute scale, we establish their predictive potential when considering them separately within each group. For example, the top 10% of the breast cancer PRS distributions within each ancestry group each revealed significant enrichments of breast cancer cases compared to the bottom 90% (odds ratio of 2.81 [95%CI: 2.69,2.93] in European, 2.88 [1.85, 4.48] in African, 2.60 [1.25, 5.40] in East Asian, and 2.33 [1.55, 3.51] in South Asian individuals). Our findings highlight a compromise solution for PRS research to compensate for the lack of diversity in well-powered European GWAS efforts while recruitment of diverse participants in the field catches up.
Project description:BackgroundPolygenic risk scores (PRS) are valuable to translate the results of genome-wide association studies (GWAS) into clinical practice. To date, most GWAS have been based on individuals of European-ancestry leading to poor performance in populations of non-European ancestry.ResultsWe introduce the polygenic transcriptome risk score (PTRS), which is based on predicted transcript levels (rather than SNPs), and explore the portability of PTRS across populations using UK Biobank data.ConclusionsWe show that PTRS has a significantly higher portability (Wilcoxon p=0.013) in the African-descent samples where the loss of performance is most acute with better performance than PRS when used in combination.
Project description:Recently, polygenic risk scores (PRS) have been shown to be associated with certain complex diseases. The approach has been based on the contribution of counting multiple alleles associated with disease across independent loci, without requiring compelling evidence that every locus had already achieved definitive genome-wide statistical significance. Whether PRS assist in the prediction of risk of common cancers is unknown. We built PRS from lists of genetic markers prioritized by their association with breast cancer (BCa) or prostate cancer (PCa) in a training data set and evaluated whether these scores could improve current genetic prediction of these specific cancers in independent test samples. We used genome-wide association data on 1,145 BCa cases and 1,142 controls from the Nurses' Health Study and 1,164 PCa cases and 1,113 controls from the Prostate Lung Colorectal and Ovarian Cancer Screening Trial. Ten-fold cross validation was used to build and evaluate PRS with 10 to 60,000 independent single nucleotide polymorphisms (SNPs). For both BCa and PCa, the models that included only published risk alleles maximized the cross-validation estimate of the area under the ROC curve (0.53 for breast and 0.57 for prostate). We found no significant evidence that PRS using common variants improved risk prediction for BCa and PCa over replicated SNP scores.
Project description:PurposeNon-European populations are under-represented in genetics studies, hindering clinical implementation of breast cancer polygenic risk scores (PRSs). We aimed to develop PRSs using the largest available studies of Asian ancestry and to assess the transferability of PRS across ethnic subgroups.MethodsThe development data set comprised 138,309 women from 17 case-control studies. PRSs were generated using a clumping and thresholding method, lasso penalized regression, an Empirical Bayes approach, a Bayesian polygenic prediction approach, or linear combinations of multiple PRSs. These PRSs were evaluated in 89,898 women from 3 prospective studies (1592 incident cases).ResultsThe best performing PRS (genome-wide set of single-nucleotide variations [formerly single-nucleotide polymorphism]) had a hazard ratio per unit SD of 1.62 (95% CI = 1.46-1.80) and an area under the receiver operating curve of 0.635 (95% CI = 0.622-0.649). Combined Asian and European PRSs (333 single-nucleotide variations) had a hazard ratio per SD of 1.53 (95% CI = 1.37-1.71) and an area under the receiver operating curve of 0.621 (95% CI = 0.608-0.635). The distribution of the latter PRS was different across ethnic subgroups, confirming the importance of population-specific calibration for valid estimation of breast cancer risk.ConclusionPRSs developed in this study, from association data from multiple ancestries, can enhance risk stratification for women of Asian ancestry.
Project description:PurposePolygenic risk scores (PRS) summarize genome-wide genotype data into a single variable that produces an individual-level risk score for genetic liability. PRS has been used for prediction of chronic diseases and some risk factors. As PRS has been studied less for physical activity (PA), we constructed PRS for PA and studied how much variation in PA can be explained by this PRS in independent population samples.MethodsWe calculated PRS for self-reported and objectively measured PA using UK Biobank genome-wide association study summary statistics, and analyzed how much of the variation in self-reported (MET-hours per day) and measured (steps and moderate-to-vigorous PA minutes per day) PA could be accounted for by the PRS in the Finnish Twin Cohorts (FTC; N = 759-11,528) and the Northern Finland Birth Cohort 1966 (NFBC1966; N = 3263-4061). Objective measurement of PA was done with wrist-worn accelerometer in UK Biobank and NFBC1966 studies, and with hip-worn accelerometer in the FTC.ResultsThe PRS accounted from 0.07% to 1.44% of the variation (R) in the self-reported and objectively measured PA volumes (P value range = 0.023 to <0.0001) in the FTC and NFBC1966. For both self-reported and objectively measured PA, individuals in the highest PRS deciles had significantly (11%-28%) higher PA volumes compared with the lowest PRS deciles (P value range = 0.017 to <0.0001).ConclusionsPA is a multifactorial phenotype, and the PRS constructed based on UK Biobank results accounted for statistically significant but overall small proportion of the variation in PA in the Finnish cohorts. Using identical methods to assess PA and including less common and rare variants in the construction of PRS may increase the proportion of PA explained by the PRS.
Project description:Stratification of women according to their risk of breast cancer based on polygenic risk scores (PRSs) could improve screening and prevention strategies. Our aim was to develop PRSs, optimized for prediction of estrogen receptor (ER)-specific disease, from the largest available genome-wide association dataset and to empirically validate the PRSs in prospective studies. The development dataset comprised 94,075 case subjects and 75,017 control subjects of European ancestry from 69 studies, divided into training and validation sets. Samples were genotyped using genome-wide arrays, and single-nucleotide polymorphisms (SNPs) were selected by stepwise regression or lasso penalized regression. The best performing PRSs were validated in an independent test set comprising 11,428 case subjects and 18,323 control subjects from 10 prospective studies and 190,040 women from UK Biobank (3,215 incident breast cancers). For the best PRSs (313 SNPs), the odds ratio for overall disease per 1 standard deviation in ten prospective studies was 1.61 (95%CI: 1.57-1.65) with area under receiver-operator curve (AUC) = 0.630 (95%CI: 0.628-0.651). The lifetime risk of overall breast cancer in the top centile of the PRSs was 32.6%. Compared with women in the middle quintile, those in the highest 1% of risk had 4.37- and 2.78-fold risks, and those in the lowest 1% of risk had 0.16- and 0.27-fold risks, of developing ER-positive and ER-negative disease, respectively. Goodness-of-fit tests indicated that this PRS was well calibrated and predicts disease risk accurately in the tails of the distribution. This PRS is a powerful and reliable predictor of breast cancer risk that may improve breast cancer prevention programs.
Project description:To realize the goals of precision medicine in complex disease, discriminative clinical risk models are needed. One approach that has been proposed is polygenic risk scores (PRSs). PRSs incorporate information about inherited genetic risk for cancer, specifically those genetic variants that are common in the population. While PRSs are clearly associated with risk of cancer, there is an on-going debate on whether integrating PRSs into clinical practice have utility. Here, we present this important discussion to the cancer clinic. We argue that in cancer, the clinical utility of PRSs will depend on their actionability, or how such a score may guide clinical practice. In turn, the actionability depends on several factors. First, actionability depends on the discriminative power of the score, or how well it predicts who is at risk of the disease. Second, it depends on their comparative performance with respect to existing practice, as a score with good discriminative power will not be useful if there are better predictors used in the clinic. Finally, for a PRS to be useful there must also be available preventive actions. We discuss the strengths and challenges of utilizing a PRS in the context of each of these criteria, and provide insights on what is needed towards moving forward in translating PRSs into the cancer clinic. We further argue that in future studies, beyond predicting cancer risk, similarly developed PRS models may be of utility in predicting prognosis or treatment resistance.
Project description:BackgroundThe mutational profile of cancer reflects the activity of the mutagenic processes which have been operative throughout the lineage of the cancer cell. These processes leave characteristic profiles of somatic mutations called mutational signatures. Mutational signatures, including single-base substitution (SBS) signatures, may reflect the effects of exogenous or endogenous exposures.MethodsWe used polygenic risk scores (PRS) to summarize common germline variation associated with cancer risk and other cancer-related traits and examined the association between somatic mutational profiles and germline PRS in 12 cancer types from The Cancer Genome Atlas. Somatic mutational profiles were constructed from whole-exome sequencing data of primary tumors. PRS were calculated for the 12 selected cancer types and 9 non-cancer traits, including cancer risk determinants, hormonal factors, and immune-mediated inflammatory diseases, using germline genetic data and published summary statistics from genome-wide association studies.ResultsWe found 17 statistically significant associations between somatic mutational profiles and germline PRS after Bonferroni correction (p < 3.15 × 10-5), including positive associations between germline inflammatory bowel disease PRS and number of somatic mutations attributed to signature SBS1 in prostate cancer and APOBEC-related signatures in breast cancer. Positive associations were also found between age at menarche PRS and mutation counts of SBS1 in overall and estrogen receptor-positive breast cancer. Consistent with prior studies that found an inverse association between the pubertal development PRS and risk of prostate cancer, likely reflecting hormone-related mechanisms, we found an inverse association between age at menarche PRS and mutation counts of SBS1 in prostate cancer. Inverse associations were also found between several cancer PRS and tumor mutation counts.ConclusionsOur analysis suggests that there are robust associations between tumor somatic mutational profiles and germline PRS. These may reflect the mechanisms through hormone regulation and immune responses that contribute to cancer etiology and drive cancer progression.