Unknown

Dataset Information

0

Data integration workflow for search of disease driving genes and genetic variants.


ABSTRACT: Comprehensive characterization of a gene's impact on phenotypes requires knowledge of the context of the gene. To address this issue we introduce a systematic data integration method Candidate Genes and SNPs (CANGES) that links SNP and linkage disequilibrium data to pathway- and protein-protein interaction information. It can be used as a knowledge discovery tool for the search of disease associated causative variants from genome-wide studies as well as to generate new hypotheses on synergistically functioning genes. We demonstrate the utility of CANGES by integrating pathway and protein-protein interaction data to identify putative functional variants for (i) the p53 gene and (ii) three glioblastoma multiforme (GBM) associated risk genes. For the GBM case, we further integrate the CANGES results with clinical and genome-wide data for 209 GBM patients and identify genes having effects on GBM patient survival. Our results show that selecting a focused set of genes can result in information beyond the traditional genome-wide association approaches. Taken together, holistic approach to identify possible interacting genes and SNPs with CANGES provides a means to rapidly identify networks for any set of genes and generate novel hypotheses. CANGES is available in http://csbi.ltdk.helsinki.fi/CANGES/

SUBMITTER: Karinen S 

PROVIDER: S-EPMC3075259 | biostudies-literature | 2011 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Data integration workflow for search of disease driving genes and genetic variants.

Karinen Sirkku S   Heikkinen Tuomas T   Nevanlinna Heli H   Hautaniemi Sampsa S  

PloS one 20110412 4


Comprehensive characterization of a gene's impact on phenotypes requires knowledge of the context of the gene. To address this issue we introduce a systematic data integration method Candidate Genes and SNPs (CANGES) that links SNP and linkage disequilibrium data to pathway- and protein-protein interaction information. It can be used as a knowledge discovery tool for the search of disease associated causative variants from genome-wide studies as well as to generate new hypotheses on synergistica  ...[more]

Similar Datasets

2022-07-05 | PXD022091 | Pride
| S-EPMC6355707 | biostudies-literature
| S-EPMC3961547 | biostudies-literature
| S-EPMC6330278 | biostudies-literature
2011-05-06 | E-GEOD-29051 | biostudies-arrayexpress
| S-EPMC4497619 | biostudies-literature
| S-EPMC8670043 | biostudies-literature
| S-EPMC5977601 | biostudies-literature
2011-05-06 | GSE29051 | GEO
| S-EPMC5001771 | biostudies-literature