Unknown

Dataset Information

0

GeneValidator: identify problems with protein-coding gene predictions.


ABSTRACT:

Unlabelled

: Genomes of emerging model organisms are now being sequenced at very low cost. However, obtaining accurate gene predictions remains challenging: even the best gene prediction algorithms make substantial errors and can jeopardize subsequent analyses. Therefore, many predicted genes must be time-consumingly visually inspected and manually curated. We developed GeneValidator (GV) to automatically identify problematic gene predictions and to aid manual curation. For each gene, GV performs multiple analyses based on comparisons to gene sequences from large databases. The resulting report identifies problematic gene predictions and includes extensive statistics and graphs for each prediction to guide manual curation efforts. GV thus accelerates and enhances the work of biocurators and researchers who need accurate gene predictions from newly sequenced genomes.

Availability and implementation

GV can be used through a web interface or in the command-line. GV is open-source (AGPL), available at https://wurmlab.github.io/tools/genevalidator

Contact

: y.wurm@qmul.ac.uk

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Dragan MA 

PROVIDER: S-EPMC4866521 | biostudies-literature | 2016 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

GeneValidator: identify problems with protein-coding gene predictions.

Drăgan Monica-Andreea MA   Moghul Ismail I   Priyam Anurag A   Bustos Claudio C   Wurm Yannick Y  

Bioinformatics (Oxford, England) 20160118 10


<h4>Unlabelled</h4>: Genomes of emerging model organisms are now being sequenced at very low cost. However, obtaining accurate gene predictions remains challenging: even the best gene prediction algorithms make substantial errors and can jeopardize subsequent analyses. Therefore, many predicted genes must be time-consumingly visually inspected and manually curated. We developed GeneValidator (GV) to automatically identify problematic gene predictions and to aid manual curation. For each gene, GV  ...[more]

Similar Datasets

2016-09-24 | GSE87233 | GEO
| S-EPMC9250521 | biostudies-literature
| S-EPMC1941744 | biostudies-literature
| S-EPMC3636199 | biostudies-literature
2023-01-08 | GSE211000 | GEO
| S-EPMC3517517 | biostudies-literature
| S-EPMC6128939 | biostudies-literature
| S-EPMC10448985 | biostudies-literature
| S-EPMC2699501 | biostudies-literature
2023-01-08 | GSE210999 | GEO