Unknown

Dataset Information

0

Genome majority vote improves gene predictions.


ABSTRACT: Recent studies have noted extensive inconsistencies in gene start sites among orthologous genes in related microbial genomes. Here we provide the first documented evidence that imposing gene start consistency improves the accuracy of gene start-site prediction. We applied an algorithm using a genome majority vote (GMV) scheme to increase the consistency of gene starts among orthologs. We used a set of validated Escherichia coli genes as a standard to quantify accuracy. Results showed that the GMV algorithm can correct hundreds of gene prediction errors in sets of five or ten genomes while introducing few errors. Using a conservative calculation, we project that GMV would resolve many inconsistencies and errors in publicly available microbial gene maps. Our simple and logical solution provides a notable advance toward accurate gene maps.

SUBMITTER: Wall ME 

PROVIDER: S-EPMC3219611 | biostudies-literature | 2011 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genome majority vote improves gene predictions.

Wall Michael E ME   Raghavan Sindhu S   Cohn Judith D JD   Dunbar John J  

PLoS computational biology 20111117 11


Recent studies have noted extensive inconsistencies in gene start sites among orthologous genes in related microbial genomes. Here we provide the first documented evidence that imposing gene start consistency improves the accuracy of gene start-site prediction. We applied an algorithm using a genome majority vote (GMV) scheme to increase the consistency of gene starts among orthologs. We used a set of validated Escherichia coli genes as a standard to quantify accuracy. Results showed that the GM  ...[more]

Similar Datasets

| S-EPMC5495993 | biostudies-literature
| S-EPMC2449328 | biostudies-literature
| S-EPMC2144342 | biostudies-other
| S-EPMC9790176 | biostudies-literature
| S-EPMC8247156 | biostudies-literature
| S-EPMC7322007 | biostudies-literature
| S-EPMC6779412 | biostudies-literature
| S-EPMC534666 | biostudies-literature
| S-EPMC10994007 | biostudies-literature
| S-EPMC9364503 | biostudies-literature