Project description:Multi-task deep learning (DL) models can accurately predict diverse genomic marks from sequence, but whether these models learn the causal relationships between genomic marks is unknown. Here, we describe Deep Mendelian Randomization (DeepMR), a method for estimating causal relationships between genomic marks learned by genomic DL models. By combining Mendelian randomization with in silico mutagenesis, DeepMR obtains local (locus specific) and global estimates of (an assumed) linear causal relationship between marks. In a simulation designed to test recovery of pairwise causal relations between transcription factors (TFs), DeepMR gives accurate and unbiased estimates of the 'true' global causal effect, but its coverage decays in the presence of sequence-dependent confounding. We then apply DeepMR to examine the global relationships learned by a state-of-the-art DL model, BPNet, between TFs involved in reprogramming. DeepMR's causal effect estimates validate previously hypothesized relationships between TFs and suggest new relationships for future investigation.
Project description:With the rapidly increasing availability of large genetic data sets in recent years, Mendelian Randomization (MR) has quickly gained popularity as a novel secondary analysis method. Leveraging genetic variants as instrumental variables, MR can be used to estimate the causal effects of one phenotype on another even when experimental research is not feasible, and therefore has the potential to be highly informative. It is dependent on strong assumptions however, often producing biased results if these are not met. It is therefore imperative that these assumptions are well-understood by researchers aiming to use MR, in order to evaluate their validity in the context of their analyses and data. The aim of this perspective is therefore to further elucidate these assumptions and the role they play in MR, as well as how different kinds of data can be used to further support them.
Project description:BACKGROUND: Vitamin D deficiency is associated with increased cardiovascular disease risk in observational studies. Whether these associations are causal is not clear. Loss-of-function mutations in the filaggrin gene result in up to 10% higher serum vitamin D concentrations, supposedly due to a decreased UV-protection of the keratinocytes. We used a Mendelian randomization approach to estimate the causal effect of vitamin D status on serum lipids, blood pressure, body mass index, waist circumference, and the metabolic syndrome. METHODS: Three population based studies were included, Monica10 (2,656 individuals aged 40-71 years), Inter99 (6,784 individuals aged 30-60 years), and Health2006 (3,471 individuals aged 18-69 years) conducted in 1993-94, 1999-2001, and 2006-2008, respectively. Participants were genotyped for the two most common filaggrin gene mutations in European descendants R501X and 2282del4, in all three studies and further for the R2447X mutation in the Inter99 and Health2006 studies. Filaggrin genotype was used as instrumental variable for vitamin D status. Baseline measurements of serum 25-hydroxyvitamin D were performed in all three studies. RESULTS: Instrumental variable analyses showed a 23.8% (95% confidence interval, CI 3.0, 48.6) higher HDL cholesterol level and a 30.5% (95% CI: 0.8, 51.3) lower serum level of triglycerides per doubling of vitamin D. These associations were, however, not statistically significant when applying the Bonferroni adjusted significance level. The remaining lipids showed non-significant changes in a favorable direction. Doubling of vitamin D gave a non-significantly lower odds ratio = 0.26 (95% CI: 0.06, 1.17) of the metabolic syndrome. There were no statistically significant causal effects of vitamin D status on blood pressure, body mass index, or waist circumference. CONCLUSION: Our results support a causal effect of higher vitamin D status on a more favorable lipid profile, although more studies in other populations are needed to confirm our results.
Project description:Alzheimer's disease (AD) is mainly a neurodegenerative sickness. The primary characteristics are neuronal atrophy, amyloid deposition, and cognitive, behavioral, and psychiatric disorders. Numerous machine learning (ML) algorithms have been investigated and applied to AD identification over the past decades, emphasizing the subtle prodromal stage of mild cognitive impairment (MCI) to assess critical features that distinguish the disease's early manifestation and instruction for early detection and treatment. Identifying early MCI (EMCI) remains challenging due to the difficulty in distinguishing patients with cognitive normality from those with MCI. As a result, most classification algorithms for these two groups perform poorly. This paper proposes a hybrid Deep Learning Approach for the early detection of Alzheimer's disease. A method for early AD detection using multimodal imaging and Convolutional Neural Network with the Long Short-term memory algorithm combines magnetic resonance imaging (MRI), positron emission tomography (PET), and standard neuropsychological test scores. The proposed methodology updates the learning weights, and Adam's optimization is used to increase accuracy. The system has an unparalleled accuracy of 98.5% in classifying cognitively normal controls from EMCI. These results imply that deep neural networks may be trained to automatically discover imaging biomarkers indicative of AD and use them to identify the illness accurately.
Project description:The objective of this study was to explore the potential genetic link between Crohn's disease and breast cancer, with a focus on identifying druggable genes that may have therapeutic relevance. We assessed the causal relationship between these diseases through Mendelian randomization and investigated gene-drug interactions using computational predictions. This study sought to identify common genetic pathways possibly involved in immune responses and cancer progression, providing a foundation for future targeted treatment research. The dataset comprises single nucleotide polymorphisms used as instrumental variables for Crohn's disease, analyzed to explore their possible impact on breast cancer risk. Gene ontology and pathway enrichment analyses were conducted to identify genes shared between the two conditions, supported by protein-protein interaction networks, colocalization analyses, and deep learning-based predictions of gene-drug interactions. The identified hub genes and predicted gene-drug interactions offer preliminary insights into possible therapeutic targets for breast cancer and immune-related conditions. This dataset may be valuable for researchers studying genetic links between autoimmune diseases and cancer and for those interested in the early identification of potential drug targets.
Project description:ObjectiveEfforts to prevent depression, the leading cause of disability worldwide, have focused on a limited number of candidate factors. Using phenotypic and genomic data from over 100,000 UK Biobank participants, the authors sought to systematically screen and validate a wide range of potential modifiable factors for depression.MethodsBaseline data were extracted for 106 modifiable factors, including lifestyle (e.g., exercise, sleep, media, diet), social (e.g., support, engagement), and environmental (e.g., green space, pollution) variables. Incident depression was defined as minimal depressive symptoms at baseline and clinically significant depression at follow-up. At-risk individuals for incident depression were identified by polygenic risk scores or by reported traumatic life events. An exposure-wide association scan was conducted to identify factors associated with incident depression in the full sample and among at-risk individuals. Two-sample Mendelian randomization was then used to validate potentially causal relationships between identified factors and depression.ResultsNumerous factors across social, sleep, media, dietary, and exercise-related domains were prospectively associated with depression, even among at-risk individuals. However, only a subset of factors was supported by Mendelian randomization evidence, including confiding in others (odds ratio=0.76, 95% CI=0.67, 0.86), television watching time (odds ratio=1.09, 95% CI=1.05, 1.13), and daytime napping (odds ratio=1.34, 95% CI=1.17, 1.53).ConclusionsUsing a two-stage approach, this study validates several actionable targets for preventing depression. It also demonstrates that not all factors associated with depression in observational research may translate into robust targets for prevention. A large-scale exposure-wide approach combined with genetically informed methods for causal inference may help prioritize strategies for multimodal prevention in psychiatry.
Project description:Stroke and Alzheimer's disease (AD) are common neurological diseases. Several exiting studies indicated that late onset-AD and ischemic stroke have shared genetic links. Different kinds of stroke have different mechanisms. However, it remains unclear whether there is a causal relationship between different types of strokes, including any stroke (AS), any ischemic stroke (AIS), large-artery atherosclerotic stroke (LAS), and cardio-embolic stroke (CES), and AD. Herein, we conducted several Mendelian randomization (MR) studies to explore genetically causal link of different kinds of strokes and AD. The results for inverse-variance weighted (IVW) meta-analysis (β = -0.039, OR = 0.9618, and P-value = 0.750) and weighted median regression (WMR) (β = -0.156, OR = 0.8556, and P-value = 0.274) demonstrated that AS is not causally associated with AD risk. The result of MR-Egger regression (β = -1.312, P-value = 0.098) and intercept term (P-value = 0.105) illustrated no pleiotropy in this MR study. According to the results for IVW (P-value = 0.305, β = -0.103, and OR = 0.9021) and WMR (P-value = 0.487, β = -0.092, and OR = 0.9121) in the MR study between AIS and AD, there is no causal association between AIS and AD risk. In addition, the MR-Egger regression (P-value = 0.290 and β = -0.512) and intercept term (P-value = 0.387) showed no potential pleiotropy. LAS is not causally associated with AD risk according to the MR results (IVW: P-value = 0.568, β = 0.037, and OR = 1.0377; WMR: P-value = 0.793, β = -0.022, and OR = 0.9782). Additionally, the results of MR-Egger regression (P-value = 0.122 and β = -1.220) and intercept term (P-value = 0.110) showed no potential pleiotropy. Our results [IVW: P-value = 0.245, β = -0.064, and OR = 0.938; WMR: P-value = 0.331, β = -0.057, and OR = 0.9446; MR-Egger: P-value = 0.673 and β = -0.062, and intercept term (P-value = 0.985)] further demonstrated there is no causal link between CES and AD and no pleiotropy in this MR study. In conclusion, different types of stroke, including AS, AIS, LAS, and CES, would not be causally associated with AD risk.
Project description:Background The convolutional neural network (CNN) is a mainstream deep learning (DL) algorithm, and it has gained great fame in solving problems from clinical examination and diagnosis, such as Alzheimer's disease (AD). AD is a degenerative disease difficult to clinical diagnosis due to its unclear underlying pathological mechanism. Previous studies have primarily focused on investigating structural abnormalities in the brain's functional networks related to the AD or proposing different deep learning approaches for AD classification. Objective The aim of this study is to leverage the advantages of combining brain topological features extracted from functional network exploration and deep features extracted by the CNN. We establish a novel fMRI-based classification framework that utilizes Resting-state functional magnetic resonance imaging (rs-fMRI) with the phase synchronization index (PSI) and 2D-CNN to detect abnormal brain functional connectivity in AD. Methods First, PSI was applied to construct the brain network by region of interest (ROI) signals obtained from data preprocessing stage, and eight topological features were extracted. Subsequently, the 2D-CNN was applied to the PSI matrix to explore the local and global patterns of the network connectivity by extracting eight deep features from the 2D-CNN convolutional layer. Results Finally, classification analysis was carried out on the combined PSI and 2D-CNN methods to recognize AD by using support vector machine (SVM) with 5-fold cross-validation strategy. It was found that the classification accuracy of combined method achieved 98.869%. Conclusion These findings show that our framework can adaptively combine the best brain network features to explore network synchronization, functional connections, and characterize brain functional abnormalities, which could effectively detect AD anomalies by the extracted features that may provide new insights into exploring the underlying pathogenesis of AD.
Project description:Epidemiological studies have implicated systemic inflammation in the development of Alzheimer's disease (AD). However, these observations have been subject to residual confounding and reverse causation. We applied Mendelian randomization approaches to address this. We did not identify any causal associations between serum interleukin (IL)-18, IL-1ra, IL-6, or erythrocyte sedimentation rate (ESR) concentrations and AD. Our findings are limited by the low number of available instruments, though some of those identified (e.g., IL-6) were of sufficient power to indicate true negative results. Taken together, it appears there is no evidence for a causal association between these serum inflammatory cytokines and AD.
Project description:The goal of the study was to identify genes whose aberrant expression can contribute to diabetic retinopathy. We determined differential response in gene expression to high glucose in lymphoblastoid cell lines derived from matched type 1 diabetic individuals with and without retinopathy. Those genes exhibiting the largest difference in glucose response between diabetic subjects with and without retinopathy were assessed for association to diabetic retinopathy utilizing genotype data from a meta-genome-wide association study. All genetic variants associated with gene expression (expression QTLs; eQTLs) of the glucose response genes were tested for association with diabetic retinopathy. We detected an enrichment of the glucose response gene eQTLs among small association p-values for diabetic retinopathy. Among these, we identified FLCN as a susceptibility gene for diabetic retinopathy. Expression of FLCN in response to glucose is greater in individuals with diabetic retinopathy compared to diabetic individuals without retinopathy. Three large, independent cohorts of diabetic individuals revealed an enhanced association of FLCN eQTL to diabetic retinopathy. Mendelian randomization confirmed a direct positive effect of increased FLCN expression on retinopathy in diabetic individuals. Together, our studies integrating genetic association and gene expression implicate FLCN as a disease gene in diabetic retinopathy.