Unknown

Dataset Information

0

Prioritizing predictive biomarkers for gene essentiality in cancer cells with mRNA expression data and DNA copy number profile.


ABSTRACT:

Motivation

Finding driver genes that are responsible for the aberrant proliferation rate of cancer cells is informative for both cancer research and the development of targeted drugs. The established experimental and computational methods are labor-intensive. To make algorithms feasible in real clinical settings, methods that can predict driver genes using less experimental data are urgently needed.

Results

We designed an effective feature selection method and used Support Vector Machines (SVM) to predict the essentiality of the potential driver genes in cancer cell lines with only 10 genes as features. The accuracy of our predictions was the highest in the Broad-DREAM Gene Essentiality Prediction Challenge. We also found a set of genes whose essentiality could be predicted much more accurately than others, which we called Accurately Predicted (AP) genes. Our method can serve as a new way of assessing the essentiality of genes in cancer cells.

Availability and implementation

The raw data that support the findings of this study are available at Synapse. https://www.synapse.org/#! Synapse: syn2384331/wiki/62825. Source code is available at GitHub. https://github.com/GuanLab/DREAM-Gene-Essentiality-Challenge.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Guan Y 

PROVIDER: S-EPMC6247930 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC4593641 | biostudies-literature
| S-EPMC3548603 | biostudies-literature
| S-EPMC4054098 | biostudies-literature
| S-EPMC3084615 | biostudies-literature
| S-EPMC2753655 | biostudies-literature
| S-EPMC3514678 | biostudies-literature
| S-EPMC4396974 | biostudies-literature
| S-EPMC2623285 | biostudies-literature
| S-EPMC10399873 | biostudies-literature
| S-ECPF-GEOD-25508 | biostudies-other