Unknown

Dataset Information

0

Recursive random forest algorithm for constructing multilayered hierarchical gene regulatory networks that govern biological pathways.


ABSTRACT:

Background

Present knowledge indicates a multilayered hierarchical gene regulatory network (ML-hGRN) often operates above a biological pathway. Although the ML-hGRN is very important for understanding how a pathway is regulated, there is almost no computational algorithm for directly constructing ML-hGRNs.

Results

A backward elimination random forest (BWERF) algorithm was developed for constructing the ML-hGRN operating above a biological pathway. For each pathway gene, the BWERF used a random forest model to calculate the importance values of all transcription factors (TFs) to this pathway gene recursively with a portion (e.g. 1/10) of least important TFs being excluded in each round of modeling, during which, the importance values of all TFs to the pathway gene were updated and ranked until only one TF was remained in the list. The above procedure, termed BWERF. After that, the importance values of a TF to all pathway genes were aggregated and fitted to a Gaussian mixture model to determine the TF retention for the regulatory layer immediately above the pathway layer. The acquired TFs at the secondary layer were then set to be the new bottom layer to infer the next upper layer, and this process was repeated until a ML-hGRN with the expected layers was obtained.

Conclusions

BWERF improved the accuracy for constructing ML-hGRNs because it used backward elimination to exclude the noise genes, and aggregated the individual importance values for determining the TFs retention. We validated the BWERF by using it for constructing ML-hGRNs operating above mouse pluripotency maintenance pathway and Arabidopsis lignocellulosic pathway. Compared to GENIE3, BWERF showed an improvement in recognizing authentic TFs regulating a pathway. Compared to the bottom-up Gaussian graphical model algorithm we developed for constructing ML-hGRNs, the BWERF can construct ML-hGRNs with significantly reduced edges that enable biologists to choose the implicit edges for experimental validation.

SUBMITTER: Deng W 

PROVIDER: S-EPMC5291523 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

Recursive random forest algorithm for constructing multilayered hierarchical gene regulatory networks that govern biological pathways.

Deng Wenping W   Zhang Kui K   Busov Victor V   Wei Hairong H  

PloS one 20170203 2


<h4>Background</h4>Present knowledge indicates a multilayered hierarchical gene regulatory network (ML-hGRN) often operates above a biological pathway. Although the ML-hGRN is very important for understanding how a pathway is regulated, there is almost no computational algorithm for directly constructing ML-hGRNs.<h4>Results</h4>A backward elimination random forest (BWERF) algorithm was developed for constructing the ML-hGRN operating above a biological pathway. For each pathway gene, the BWERF  ...[more]

Similar Datasets

| S-EPMC4797117 | biostudies-literature
2012-05-10 | GSE37858 | GEO
2012-05-09 | E-GEOD-37858 | biostudies-arrayexpress
| S-EPMC6339008 | biostudies-literature
2022-05-16 | GSE189510 | GEO
| S-EPMC4266947 | biostudies-literature
| S-EPMC5278550 | biostudies-literature
| S-EPMC6311931 | biostudies-literature
| S-EPMC6157185 | biostudies-literature
| S-EPMC3985673 | biostudies-literature