Project description:Human genetic variants predicted to cause loss-of-function of protein-coding genes (pLoF variants) provide natural in vivo models of human gene inactivation and can be valuable indicators of gene function and the potential toxicity of therapeutic inhibitors targeting these genes1,2. Gain-of-kinase-function variants in LRRK2 are known to significantly increase the risk of Parkinson's disease3,4, suggesting that inhibition of LRRK2 kinase activity is a promising therapeutic strategy. While preclinical studies in model organisms have raised some on-target toxicity concerns5-8, the biological consequences of LRRK2 inhibition have not been well characterized in humans. Here, we systematically analyze pLoF variants in LRRK2 observed across 141,456 individuals sequenced in the Genome Aggregation Database (gnomAD)9, 49,960 exome-sequenced individuals from the UK Biobank and over 4 million participants in the 23andMe genotyped dataset. After stringent variant curation, we identify 1,455 individuals with high-confidence pLoF variants in LRRK2. Experimental validation of three variants, combined with previous work10, confirmed reduced protein levels in 82.5% of our cohort. We show that heterozygous pLoF variants in LRRK2 reduce LRRK2 protein levels but that these are not strongly associated with any specific phenotype or disease state. Our results demonstrate the value of large-scale genomic databases and phenotyping of human loss-of-function carriers for target validation in drug discovery.
Project description:Importance:Pathogenic variants in LRRK2 are a relatively common genetic cause of Parkinson disease (PD). Currently, the molecular mechanism underlying disease is unknown, and gain and loss of function (LOF) models of pathogenesis have been postulated. LRRK2 variants are reported to result in enhanced phosphorylation of substrates and increased cell death. However, the double knockout of Lrrk2 and its homologue Lrrk1 results in neurodegeneration in a mouse model, suggesting that disease may occur by LOF. Because LRRK2 inhibitors are currently in development as potential disease-modifying treatments in PD, it is critical to determine whether LOF variants in LRRK2 increase or decrease the risk of PD. Objective:To determine whether LRRK1 and LRRK2 LOF variants contribute to the risk of developing PD. Design, Setting, and Participants:To determine the prevailing mechanism of LRRK2-mediated disease in human populations, next-generation sequencing data from a large case-control cohort (>23?000 individuals) was analyzed for LOF variants in LRRK1 and LRRK2. Data were generated at 5 different sites and 5 different data sets, including cases with clinically diagnosed PD and neurologically normal control individuals. Data were collected from 2012 through 2017. Main Outcomes and Measures:Frequencies of LRRK1 and LRRK2 LOF variants present in the general population and compared between cases and controls. Results:Among 11?095 cases with PD and 12?615 controls, LRRK1 LOF variants were identified in 0.205% of cases and 0.139% of controls (odds ratio,?1.48; SE,?0.571; 95% CI, 0.45-4.44; P?=?.49) and LRRK2 LOF variants were found in 0.117% of cases and 0.087% of controls (odds ratio, 1.48; SE,?0.431; 95% CI, 0.63-3.50; P?=?.36). All association tests suggested lack of association between LRRK1 or LRRK2 variants and PD. Further analysis of lymphoblastoid cell lines from several heterozygous LOF variant carriers found that, as expected, LRRK2 protein levels are reduced by approximately half compared with wild-type alleles. Conclusions and Relevance:Together these findings indicate that haploinsufficiency of LRRK1 or LRRK2 is neither a cause of nor protective against PD. Furthermore, these results suggest that kinase inhibition or allele-specific targeting of mutant LRRK2 remain viable therapeutic strategies in PD.
Project description:BackgroundThe Autism Sequencing Consortium identified 102 high-confidence autism spectrum disorder (ASD) genes, showing that individuals with ASD and with potentially damaging single nucleotide variation (pdSNV) in these genes had lower cognitive levels and delayed age at walking, when compared to ASD participants without pdSNV. Here, we made use of a Swedish sample of individuals with ASD (called PAGES, for Population-Based Autism Genetics & Environment Study) to evaluate the frequency of pdSNV and their impact on medical and psychiatric phenotypes, using an epidemiological frame and universal health reporting. We then combine findings with those for potentially damaging copy number variation (pdCNV).MethodsSNV and CNV calls were generated from whole-exome sequencing and chromosome microarray data, respectively. Birth and medical register data were used to collect phenotypes.ResultsOf 808 individuals assessed by sequencing, 69 (9%) had pdSNV in the 102 ASC genes, and 144 (18%) had pdSNV in the 102 ASC genes or in a larger set of curated neurodevelopmental genes (from the Deciphering Developmental Disorders study, the gene2phenotype database, and the Radboud University gene lists). Three or more individuals had pdSNV in GRIN2B, POGZ, SATB1, DYNC1H1, SCN8A, or CREBBP. In comparison, out of the 996 individuals from whom CNV were called, 105 (11%) carried one or more pdCNV, including four or more individuals with CNV in the recurrent 15q11q13, 22q11.2, and 16p11.2 loci. Carriers of pdSNV were more likely to have intellectual disability (ID) and epilepsy, while carriers of pdCNV showed increased rates of congenital anomalies and scholastic skill disorders. Carriers of either pdSNV or pdCNV were more likely to have ID, scholastic skill disorders, and epilepsy.LimitationsThe cohort only included individuals with autistic disorder, the more severe form of ASD, and phenotypes are defined from medical registers. Not all genes studied are definitively ASD genes, and we did not have de novo information to aid in classification.ConclusionsIn this epidemiological sample, rare pdSNV were more common than pdCNV and the combined yield of potentially damaging variation was substantial at 27%. The results provide compelling rationale for the use of high-throughout sequencing as part of routine clinical workup for ASD and support the development of precision medicine in ASD.
Project description:Although ~?25% of colorectal cancer or polyp (CRC/P) cases show familial aggregation, current germline genetic testing identifies a causal genotype in the 16 major genes associated with high penetrance CRC/P in only 20% of these cases. As there are likely other genes underlying heritable CRC/P, we evaluated the association of variation at novel loci with CRC/P. We evaluated 158 a priori selected candidate genes by comparing the number of rare potentially disruptive variants (PDVs) found in 84 CRC/P cases without an identified CRC/P risk-associated variant and 2440 controls. We repeated this analysis using an additional 73 CRC/P cases. We also compared the frequency of PDVs in select genes among CRC/P cases with two publicly available data sets. We found a significant enrichment of PDVs in cases vs. controls: 20% of cases vs. 11.5% of controls with ??1 PDV (OR?=?1.9, p?=?0.01) in the original set of cases. Among the second cohort of CRC/P cases, 18% had a PDV, significantly different from 11.5% (p?=?0.02). Logistic regression, adjusting for ancestry and multiple testing, indicated association between CRC/P and PDVs in NTHL1 (p?=?0.0001), BRCA2 (p?=?0.01) and BRIP1 (p?=?0.04). However, there was no significant difference in the frequency of PDVs at each of these genes between all 157 CRC/P cases and two publicly available data sets. These results suggest an increased presence of PDVs in CRC/P cases and support further investigation of the association of NTHL1, BRCA2 and BRIP1 variation with CRC/P.
Project description:The discovery of disease-causing mutations typically requires confirmation of the variant or gene in multiple unrelated individuals, and a large number of rare genetic diseases remain unsolved due to difficulty identifying second families. To enable the secure sharing of case records by clinicians and rare disease scientists, we have developed the PhenomeCentral portal (https://phenomecentral.org). Each record includes a phenotypic description and relevant genetic information (exome or candidate genes). PhenomeCentral identifies similar patients in the database based on semantic similarity between clinical features, automatically prioritized genes from whole-exome data, and candidate genes entered by the users, enabling both hypothesis-free and hypothesis-driven matchmaking. Users can then contact other submitters to follow up on promising matches. PhenomeCentral incorporates data for over 1,000 patients with rare genetic diseases, contributed by the FORGE and Care4Rare Canada projects, the US NIH Undiagnosed Diseases Program, the EU Neuromics and ANDDIrare projects, as well as numerous independent clinicians and scientists. Though the majority of these records have associated exome data, most lack a molecular diagnosis. PhenomeCentral has already been used to identify causative mutations for several patients, and its ability to find matching patients and diagnose these diseases will grow with each additional patient that is entered.
Project description:Identifying genetic variants associated with lower waist-to-hip ratio can reveal new therapeutic targets for abdominal obesity. We use exome sequences from 362,679 individuals to identify genes associated with waist-to-hip ratio adjusted for BMI (WHRadjBMI), a surrogate for abdominal fat that is causally linked to type 2 diabetes and coronary heart disease. Predicted loss of function (pLOF) variants in INHBE associate with lower WHRadjBMI and this association replicates in data from AMP-T2D-GENES. INHBE encodes a secreted protein, the hepatokine activin E. In vitro characterization of the most common INHBE pLOF variant in our study, indicates an in-frame deletion resulting in a 90% reduction in secreted protein levels. We detect associations with lower WHRadjBMI for variants in ACVR1C, encoding an activin receptor, further highlighting the involvement of activins in regulating fat distribution. These findings highlight activin E as a potential therapeutic target for abdominal obesity, a phenotype linked to cardiometabolic disease.
Project description:By analyzing the whole-exome sequences of 4,264 schizophrenia cases, 9,343 controls and 1,077 trios, we identified a genome-wide significant association between rare loss-of-function (LoF) variants in SETD1A and risk for schizophrenia (P = 3.3 × 10(-9)). We found only two heterozygous LoF variants in 45,376 exomes from individuals without a neuropsychiatric diagnosis, indicating that SETD1A is substantially depleted of LoF variants in the general population. Seven of the ten individuals with schizophrenia carrying SETD1A LoF variants also had learning difficulties. We further identified four SETD1A LoF carriers among 4,281 children with severe developmental disorders and two more carriers in an independent sample of 5,720 Finnish exomes, both with notable neuropsychiatric phenotypes. Together, our observations indicate that LoF variants in SETD1A cause a range of neurodevelopmental disorders, including schizophrenia. Combining these data with previous common variant evidence, we suggest that epigenetic dysregulation, specifically in the histone H3K4 methylation pathway, is an important mechanism in the pathogenesis of schizophrenia.
Project description:Identifying drivers of complex traits from the noisy signals of genetic variation obtained from high-throughput genome sequencing technologies is a central challenge faced by human geneticists today. We hypothesize that the variants involved in complex diseases are likely to exhibit non-neutral evolutionary signatures. Uncovering the evolutionary history of all variants is therefore of intrinsic interest for complex disease research. However, doing so necessitates the simultaneous elucidation of the targets of natural selection and population-specific demographic history.Here we characterize the action of natural selection operating across complex disease categories, and use population genetic simulations to evaluate the expected patterns of genetic variation in large samples. We focus on populations that have experienced historical bottlenecks followed by explosive growth (consistent with many human populations), and describe the differences between evolutionarily deleterious mutations and those that are neutral.Genes associated with several complex disease categories exhibit stronger signatures of purifying selection than non-disease genes. In addition, loci identified through genome-wide association studies of complex traits also exhibit signatures consistent with being in regions recurrently targeted by purifying selection. Through simulations, we show that population bottlenecks and rapid growth enable deleterious rare variants to persist at low frequencies just as long as neutral variants, but low-frequency and common variants tend to be much younger than neutral variants. This has resulted in a large proportion of modern-day rare alleles that have a deleterious effect on function and that potentially contribute to disease susceptibility.The key question for sequencing-based association studies of complex traits is how to distinguish between deleterious and benign genetic variation. We used population genetic simulations to uncover patterns of genetic variation that distinguish these two categories, especially derived allele age, thereby providing inroads into novel methods for characterizing rare genetic variation driving complex diseases.
Project description:Technology advances have promoted gene-based sequencing studies with the aim of identifying rare mutations responsible for complex diseases. A complication in these types of association studies is that the vast majority of non-synonymous mutations are believed to be neutral to phenotypes. It is thus critical to distinguish potential causative variants from neutral variation before performing association tests. In this study, we used existing predicting algorithms to predict functional amino acid substitutions, and incorporated that information into association tests. Using simulations, we comprehensively studied the effects of several influential factors, including the sensitivity and specificity of functional variant predictions, number of variants, and proportion of causative variants, on the performance of association tests. Our results showed that incorporating information regarding functional variants obtained from existing prediction algorithms improves statistical power under certain conditions, particularly when the proportion of causative variants is moderate. The application of the proposed tests to a real sequencing study confirms our conclusions. Our work may help investigators who are planning to pursue gene-based sequencing studies.
Project description:Genetic epilepsies (GEs) account for approximately 50% of all seizure disorders, and familial forms include mutations in single GABAA receptor subunit genes (GABRs). In 144 sporadic GE cases (GECs), exome sequencing of 237 ion channel genes identified 520 GABR variants. Among these variants, 33 rare variants in 11 GABR genes were present in 24 GECs. To assess functional risk of variants in GECs, we selected 8 variants found in GABRA, 3 in GABRB, and 3 in GABRG and compared them to 18 variants found in the general population for GABRA1 (n = 9), GABRB3 (n = 7), and GABRG2 (n = 2). To identify deleterious variants and gain insight into structure-function relationships, we studied the gating properties, surface expression and structural perturbations of the 32 variants. Significant reduction of GABAA receptor function was strongly associated with variants scored as deleterious and mapped within the N-terminal and transmembrane domains. In addition, 12 out of 17 variants mapped along the ?+/?- GABA binding interface, were associated with reduction in channel gating and were predicted to cause structural rearrangements of the receptor by in silico simulations. Missense or nonsense mutations of GABRA1, GABRB3 and GABRG2 primarily impair subunit biogenesis. In contrast, GABR variants affected receptor function by impairing gating, suggesting that different mechanisms are operating in GABR epilepsy susceptibility variants and disease-causing mutations. The functional impact of single GABR variants found in individuals with sporadic GEs warrants the use of molecular diagnosis and will ultimately improve the treatment of genetic epilepsies by using a personalized approach.