Unknown

Dataset Information

0

GeneFishing to reconstruct context specific portraits of biological processes.


ABSTRACT: Rapid advances in genomic technologies have led to a wealth of diverse data, from which novel discoveries can be gleaned through the application of robust statistical and computational methods. Here, we describe GeneFishing, a semisupervised computational approach to reconstruct context-specific portraits of biological processes by leveraging gene-gene coexpression information. GeneFishing incorporates multiple high-dimensional statistical ideas, including dimensionality reduction, clustering, subsampling, and results aggregation, to produce robust results. To illustrate the power of our method, we applied it using 21 genes involved in cholesterol metabolism as "bait" to "fish out" (or identify) genes not previously identified as being connected to cholesterol metabolism. Using simulation and real datasets, we found that the results obtained through GeneFishing were more interesting for our study than those provided by related gene prioritization methods. In particular, application of GeneFishing to the GTEx liver RNA sequencing (RNAseq) data not only reidentified many known cholesterol-related genes, but also pointed to glyoxalase I (GLO1) as a gene implicated in cholesterol metabolism. In a follow-up experiment, we found that GLO1 knockdown in human hepatoma cell lines increased levels of cellular cholesterol ester, validating a role for GLO1 in cholesterol metabolism. In addition, we performed pantissue analysis by applying GeneFishing on various tissues and identified many potential tissue-specific cholesterol metabolism-related genes. GeneFishing appears to be a powerful tool for identifying related components of complex biological systems and may be used across a wide range of applications.

SUBMITTER: Liu K 

PROVIDER: S-EPMC6754596 | biostudies-literature | 2019 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

GeneFishing to reconstruct context specific portraits of biological processes.

Liu Ke K   Theusch Elizabeth E   Zhou Yun Y   Ashuach Tal T   Dose Andrea C AC   Bickel Peter J PJ   Medina Marisa W MW   Huang Haiyan H  

Proceedings of the National Academy of Sciences of the United States of America 20190904 38


Rapid advances in genomic technologies have led to a wealth of diverse data, from which novel discoveries can be gleaned through the application of robust statistical and computational methods. Here, we describe GeneFishing, a semisupervised computational approach to reconstruct context-specific portraits of biological processes by leveraging gene-gene coexpression information. GeneFishing incorporates multiple high-dimensional statistical ideas, including dimensionality reduction, clustering, s  ...[more]

Similar Datasets

| S-EPMC3339396 | biostudies-literature
| S-EPMC4443674 | biostudies-literature
| S-EPMC6283956 | biostudies-literature
| S-EPMC6538558 | biostudies-literature
| S-EPMC2613524 | biostudies-literature
| S-EPMC3748606 | biostudies-literature
| S-EPMC3187657 | biostudies-literature
| S-EPMC6080920 | biostudies-literature
| S-EPMC7653997 | biostudies-literature
| S-EPMC8533999 | biostudies-literature