Dataset Information

A negative selection heuristic to predict new transcriptional targets.

ABSTRACT: BACKGROUND: Supervised machine learning approaches have been recently adopted in the inference of transcriptional targets from high throughput trascriptomic and proteomic data showing major improvements from with respect to the state of the art of reverse gene regulatory network methods. Beside traditional unsupervised techniques, a supervised classifier learns, from known examples, a function that is able to recognize new relationships for new data. In the context of gene regulatory inference a supervised classifier is coerced to learn from positive and unlabeled examples, as the counter negative examples are unavailable or hard to collect. Such a condition could limit the performance of the classifier especially when the amount of training examples is low. RESULTS: In this paper we improve the supervised identification of transcriptional targets by selecting reliable counter negative examples from the unlabeled set. We introduce an heuristic based on the known topology of transcriptional networks that in fact restores the conventional positive/negative training condition and shows a significant improvement of the classification performance. We empirically evaluate the proposed heuristic with the experimental datasets of Escherichia coli and show an example of application in the prediction of BCL6 direct core targets in normal germinal center human B cells obtaining a precision of 60%. CONCLUSIONS: The availability of only positive examples in learning transcriptional relationships negatively affects the performance of supervised classifiers. We show that the selection of reliable negative examples, a practice adopted in text mining approaches, improves the performance of such classifiers opening new perspectives in the identification of new transcriptional targets.

SUBMITTER: Cerulo L

PROVIDER: S-EPMC3548675 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A negative selection heuristic to predict new transcriptional targets.

Cerulo Luigi L Paduano Vincenzo V Zoppoli Pietro P Ceccarelli Michele M

BMC bioinformatics 20130114

<h4>Background</h4>Supervised machine learning approaches have been recently adopted in the inference of transcriptional targets from high throughput trascriptomic and proteomic data showing major improvements from with respect to the state of the art of reverse gene regulatory network methods. Beside traditional unsupervised techniques, a supervised classifier learns, from known examples, a function that is able to recognize new relationships for new data. In the context of gene regulatory infe ...[more]

PMID: 23368951

Dataset Information

A negative selection heuristic to predict new transcriptional targets.

Publications

A negative selection heuristic to predict new transcriptional targets.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Fine-tuning enhancer models to predict transcriptional targets across multiple genomes.
| S-EPMC2047340 | biostudies-literature

Reference Evapotranspiration Modeling Using New Heuristic Methods.
| S-EPMC7517042 | biostudies-literature

Selection of new immunotherapy targets for NK/T cell lymphoma.
| S-EPMC7724344 | biostudies-literature

Crossreactive αβ T Cell Receptors Are the Predominant Targets of Thymocyte Negative Selection.
| S-EPMC4654978 | biostudies-literature

Heuristic evaluation on mobile interfaces: a new checklist.
| S-EPMC4177852 | biostudies-literature

A novel test for selection on cis-regulatory elements reveals positive and negative selection acting on mammalian transcriptional enhancers.
| S-EPMC3808868 | biostudies-other

Heuristic algorithms for feature selection under Bayesian models with block-diagonal covariance structure.
| S-EPMC5872553 | biostudies-literature

Conserved expression patterns predict microRNA targets.
| S-EPMC2736581 | biostudies-other

New Promising Targets for Synthetic Omptin-Based Peptide Vaccine against Gram-Negative Pathogens.
| S-EPMC6630670 | biostudies-literature