Project description:An emerging theme from large-scale genetic screens that identify genes essential for cell fitness is that essentiality of a given gene is highly context-specific. Identification of such contexts could be the key to defining gene function and also to develop novel therapeutic interventions. Here, we present Context-specific Essentiality Network-tools (CEN-tools), a website and python package, in which users can interrogate the essentiality of a gene from large-scale genome-scale CRISPR screens in a number of biological contexts including tissue of origin, mutation profiles, expression levels and drug responses. We show that CEN-tools is suitable for the systematic identification of genetic dependencies and for more targeted queries. The associations between genes and a given context are represented as dependency networks (CENs), and we demonstrate the utility of these networks in elucidating novel gene functions. In addition, we integrate the dependency networks with existing protein-protein interaction networks to reveal context-dependent essential cellular pathways in cancer cells. Together, we demonstrate the applicability of CEN-tools in aiding the current efforts to define the human cellular dependency map.
Project description:BackgroundAlthough considerable progress has been made in understanding molecular alterations driving gliomagenesis, the diverse metabolic programs contributing to the aggressive phenotype of glioblastoma remain unclear. The aim of this study was to define and provide molecular context to metabolic reprogramming driving gliomagenesis.MethodsIntegrative cross-platform analyses coupling global metabolomic profiling with genomics in patient-derived glioma (low-grade astrocytoma [LGA; n = 28] and glioblastoma [n = 80]) were performed. Identified programs were then metabolomically, genomically, and functionally evaluated in preclinical models.ResultsClear metabolic programs were identified differentiating LGA from glioblastoma, with aberrant lipid, peptide, and amino acid metabolism representing dominant metabolic nodes associated with malignant transformation. Although the metabolomic profiles of glioblastoma and LGA appeared mutually exclusive, considerable metabolic heterogeneity was observed in glioblastoma. Surprisingly, integrative analyses demonstrated that O6-methylguanine-DNA methyltransferase methylation and isocitrate dehydrogenase mutation status were equally distributed among glioblastoma metabolic profiles. Transcriptional subtypes, on the other hand, tightly clustered by their metabolomic signature, with proneural and mesenchymal tumor profiles being mutually exclusive. Integrating these metabolic phenotypes with gene expression analyses uncovered tightly orchestrated and highly redundant transcriptional programs designed to support the observed metabolic programs by actively importing these biochemical substrates from the microenvironment, contributing to a state of enhanced metabolic heterotrophy. These findings were metabolomically, genomically, and functionally recapitulated in preclinical models.ConclusionDespite disparate molecular pathways driving the progression of glioblastoma, metabolic programs designed to maintain its aggressive phenotype remain conserved. This contributes to a state of enhanced metabolic heterotrophy supporting survival in diverse microenvironments implicit in this malignancy.
Project description:Despite rapid progress in characterizing the role of host genetics in SARS-Cov-2 infection, there is limited understanding of genes and pathways that contribute to COVID-19. Here, we integrate a genome-wide association study of COVID-19 hospitalization (7,885 cases and 961,804 controls from COVID-19 Host Genetics Initiative) with mRNA expression, splicing, and protein levels (n = 18,502). We identify 27 genes related to inflammation and coagulation pathways whose genetically predicted expression was associated with COVID-19 hospitalization. We functionally characterize the 27 genes using phenome- and laboratory-wide association scans in Vanderbilt Biobank (n = 85,460) and identified coagulation-related clinical symptoms, immunologic, and blood-cell-related biomarkers. We replicate these findings across trans-ethnic studies and observed consistent effects in individuals of diverse ancestral backgrounds in Vanderbilt Biobank, pan-UK Biobank, and Biobank Japan. Our study highlights and reconfirms putative causal genes impacting COVID-19 severity and symptomology through the host inflammatory response.
Project description:BackgroundThe prediction of essential genes from molecular networks is a way to test the understanding of essentiality in the context of what is known about the network. However, the current knowledge on molecular network structures is incomplete yet, and consequently the strategies aimed to predict essential genes are prone to uncertain predictions. We propose that simultaneously evaluating different network structures and different algorithms representing gene essentiality (centrality measures) may identify essential genes in networks in a reliable fashion.ResultsBy simultaneously analyzing 16 different centrality measures on 18 different reconstructed metabolic networks for Saccharomyces cerevisiae, we show that no single centrality measure identifies essential genes from these networks in a statistically significant way; however, the combination of at least 2 centrality measures achieves a reliable prediction of most but not all of the essential genes. No improvement is achieved in the prediction of essential genes when 3 or 4 centrality measures were combined.ConclusionThe method reported here describes a reliable procedure to predict essential genes from molecular networks. Our results show that essential genes may be predicted only by combining centrality measures, revealing the complex nature of the function of essential genes.
Project description:Recent genome-wide association studies (GWAS) have identified multiple risk loci that show strong associations with schizophrenia. However, pinpointing the potential causal genes at the reported loci remains a major challenge. Here we identify candidate causal genes for schizophrenia using an integrative genomic approach. Sherlock integrative analysis shows that ALMS1, GLT8D1, and CSNK2B are schizophrenia risk genes, which are validated using independent brain expression quantitative trait loci (eQTL) data and integrative analysis method (SMR). Consistently, gene expression analysis in schizophrenia cases and controls further supports the potential role of these three genes in the pathogenesis of schizophrenia. Finally, we show that GLT8D1 and CSNK2B knockdown promote the proliferation and inhibit the differentiation abilities of neural stem cells, and alter morphology and synaptic transmission of neurons. These convergent lines of evidence suggest that the ALMS1, CSNK2B, and GLT8D1 genes may be involved in pathophysiology of schizophrenia.
Project description:Effective discovery of causal disease genes must overcome the statistical challenges of quantitative genetics studies and the practical limitations of human biology experiments. Here we developed diseaseQUEST, an integrative approach that combines data from human genome-wide disease studies with in silico network models of tissue- and cell-type-specific function in model organisms to prioritize candidates within functionally conserved processes and pathways. We used diseaseQUEST to predict candidate genes for 25 different diseases and traits, including cancer, longevity, and neurodegenerative diseases. Focusing on Parkinson's disease (PD), a diseaseQUEST-directed Caenhorhabditis elegans behavioral screen identified several candidate genes, which we experimentally verified and found to be associated with age-dependent motility defects mirroring PD clinical symptoms. Furthermore, knockdown of the top candidate gene, bcat-1, encoding a branched chain amino acid transferase, caused spasm-like 'curling' and neurodegeneration in C. elegans, paralleling decreased BCAT1 expression in PD patient brains. diseaseQUEST is modular and generalizable to other model organisms and human diseases of interest.
Project description:BackgroundFew driver genes have been well established in esophageal squamous cell carcinoma (ESCC). Identification of the genomic aberrations that contribute to changes in gene expression profiles can be used to predict driver genes.MethodsWe searched for driver genes in ESCC by integrative analysis of gene expression microarray profiles and copy number data. To narrow down candidate genes, we performed survival analysis on expression data and tested the genetic vulnerability of each genes using public RNAi screening data. We confirmed the results by performing RNAi experiments and evaluating the clinical relevance of candidate genes in an independent ESCC cohort.ResultsWe found 10 significantly recurrent copy number alterations accompanying gene expression changes, including loci 11q13.2, 7p11.2, 3q26.33, and 17q12, which harbored CCND1, EGFR, SOX2, and ERBB2, respectively. Analysis of survival data and RNAi screening data suggested that GRB7, located on 17q12, was a driver gene in ESCC. In ESCC cell lines harboring 17q12 amplification, knockdown of GRB7 reduced the proliferation, migration, and invasion capacities of cells. Moreover, siRNA targeting GRB7 had a synergistic inhibitory effect when combined with trastuzumab, an anti-ERBB2 antibody. Survival analysis of the independent cohort also showed that high GRB7 expression was associated with poor prognosis in ESCC.ConclusionOur integrative analysis provided important insights into ESCC pathogenesis. We identified GRB7 as a novel ESCC driver gene and potential new therapeutic target.
Project description:Building models of a biological system that are consistent with the myriad data available is one of the key challenges in biology. Modeling the structure and dynamics of macromolecular assemblies, for example, can give insights into how biological systems work, evolved, might be controlled, and even designed. Integrative structure modeling casts the building of structural models as a computational optimization problem, for which information about the assembly is encoded into a scoring function that evaluates candidate models. Here, we describe our open source software suite for integrative structure modeling, Integrative Modeling Platform (https://integrativemodeling.org), and demonstrate its use.
Project description:To identify genetic loci influencing lipid levels, we performed quantitative trait loci (QTL) analysis between inbred mouse strains MRL/MpJ and SM/J, measuring triglyceride levels at 8 weeks of age in F2 mice fed a chow diet. We identified one significant QTL on chromosome (Chr) 15 and three suggestive QTL on Chrs 2, 7, and 17. We also carried out microarray analysis on the livers of parental strains of 282 F2 mice and used these data to find cis-regulated expression QTL. We then narrowed the list of candidate genes under significant QTL using a "toolbox" of bioinformatic resources, including haplotype analysis; parental strain comparison for gene expression differences and nonsynonymous coding single nucleotide polymorphisms (SNP); cis-regulated eQTL in livers of F2 mice; correlation between gene expression and phenotype; and conditioning of expression on the phenotype. We suggest Slc25a7 as a candidate gene for the Chr 7 QTL and, based on expression differences, five genes (Polr3 h, Cyp2d22, Cyp2d26, Tspo, and Ttll12) as candidate genes for Chr 15 QTL. This study shows how bioinformatics can be used effectively to reduce candidate gene lists for QTL related to complex traits.
Project description:ObjectiveTo explore the key regulatory genes associated with lung cancer in order to reduce its occurrence and progress through silencing these key genes.MethodsTo identify the key regulatory genes involved in lung cancer, we performed a combination of gene array and bioinformatics analyses to compare gene transcription profiles in 3 monoclonal cell strains with high, medium or low metastatic abilities, which were separated from the SPC-A-1sci and SPC-A-1 cell lines by limiting dilution monoclone assay. We then analyzed those genes' biological activities by knocking down their expression in SPC-A-1sci cells using siRNA and lenti-viral shRNA vectors, followed by determinations of the invasion and migration capabilities of the resulting cell lines in vitro as well as their potential for inducing occurrence and metastasis of lung cancer in vivo. To examine the clinical relevance of these findings, we analyzed the expression levels of the identified genes in human lung cancer tissues (n = 135) and matched adjacent normal tissues by immunohistochemical (IHC) staining.ResultsThree monoclonal cell strains characterized with high, medium or low metastatic abilities were successfully selected. Gene array and bioinformatics analyses implied that osteopontin, LAMB3 and ITGB1 were key genes involved in lung cancer. Knockdown of these genes suppressed human lung cancer cell invasion and metastasis in vitro and in vivo. Clinical sample analyses indicated that osteopontin, LAMB3 and ITGB1 protein expression levels were higher in lung cancer patients, compared to non-cancerous adjacent tissues, and correlated with lymphatic metastasis.ConclusionsWe confirmed that osteopontin, LAMB3 and ITGB1 played important roles in the occurrence and metastasis of lung cancer, thus provided important clues to understanding the molecular mechanism of metastasis and contributing to the therapeutic treatment of lung cancer.