Unknown

Dataset Information

0

Geptop: a gene essentiality prediction tool for sequenced bacterial genomes based on orthology and phylogeny.


ABSTRACT: Integrative genomics predictors, which score highly in predicting bacterial essential genes, would be unfeasible in most species because the data sources are limited. We developed a universal approach and tool designated Geptop, based on orthology and phylogeny, to offer gene essentiality annotations. In a series of tests, our Geptop method yielded higher area under curve (AUC) scores in the receiver operating curves than the integrative approaches. In the ten-fold cross-validations among randomly upset samples, Geptop yielded an AUC of 0.918, and in the cross-organism predictions for 19 organisms Geptop yielded AUC scores between 0.569 and 0.959. A test applied to the very recently determined essential gene dataset from the Porphyromonas gingivalis, which belongs to a phylum different with all of the above 19 bacterial genomes, gave an AUC of 0.77. Therefore, Geptop can be applied to any bacterial species whose genome has been sequenced. Compared with the essential genes uniquely identified by the lethal screening, the essential genes predicted only by Gepop are associated with more protein-protein interactions, especially in the three bacteria with lower AUC scores (<0.7). This may further illustrate the reliability and feasibility of our method in some sense. The web server and standalone version of Geptop are available at http://cefg.uestc.edu.cn/geptop/ free of charge. The tool has been run on 968 bacterial genomes and the results are accessible at the website.

SUBMITTER: Wei W 

PROVIDER: S-EPMC3744497 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Geptop: a gene essentiality prediction tool for sequenced bacterial genomes based on orthology and phylogeny.

Wei Wen W   Ning Lu-Wen LW   Ye Yuan-Nong YN   Guo Feng-Biao FB  

PloS one 20130815 8


Integrative genomics predictors, which score highly in predicting bacterial essential genes, would be unfeasible in most species because the data sources are limited. We developed a universal approach and tool designated Geptop, based on orthology and phylogeny, to offer gene essentiality annotations. In a series of tests, our Geptop method yielded higher area under curve (AUC) scores in the receiver operating curves than the integrative approaches. In the ten-fold cross-validations among random  ...[more]

Similar Datasets

| S-EPMC2867826 | biostudies-literature
| S-EPMC1800777 | biostudies-literature
| S-EPMC3908345 | biostudies-literature
| S-EPMC8449488 | biostudies-literature
| S-EPMC5679510 | biostudies-literature
| S-EPMC3251307 | biostudies-literature
| S-EPMC5585334 | biostudies-literature
| S-EPMC471550 | biostudies-literature
| S-EPMC137074 | biostudies-literature
| S-EPMC5479036 | biostudies-literature