Unknown

Dataset Information

0

An integrative method for identifying the over-annotated protein-coding genes in microbial genomes.


ABSTRACT: The falsely annotated protein-coding genes have been deemed one of the major causes accounting for the annotating errors in public databases. Although many filtering approaches have been designed for the over-annotated protein-coding genes, some are questionable due to the resultant increase in false negative. Furthermore, there is no webserver or software specifically devised for the problem of over-annotation. In this study, we propose an integrative algorithm for detecting the over-annotated protein-coding genes in microorganisms. Overall, an average accuracy of 99.94% is achieved over 61 microbial genomes. The extremely high accuracy indicates that the presented algorithm is efficient to differentiate the protein-coding genes from the non-coding open reading frames. Abundant analyses show that the predicting results are reliable and the integrative algorithm is robust and convenient. Our analysis also indicates that the over-annotated protein-coding genes can cause the false positive of horizontal gene transfers detection. The webserver of the proposed algorithm can be freely accessible from www.cbi.seu.edu.cn/RPGM.

SUBMITTER: Yu JF 

PROVIDER: S-EPMC3223076 | biostudies-literature | 2011 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

An integrative method for identifying the over-annotated protein-coding genes in microbial genomes.

Yu Jia-Feng JF   Xiao Ke K   Jiang Dong-Ke DK   Guo Jing J   Wang Ji-Hua JH   Sun Xiao X  

DNA research : an international journal for rapid publication of reports on genes and genomes 20110908 6


The falsely annotated protein-coding genes have been deemed one of the major causes accounting for the annotating errors in public databases. Although many filtering approaches have been designed for the over-annotated protein-coding genes, some are questionable due to the resultant increase in false negative. Furthermore, there is no webserver or software specifically devised for the problem of over-annotation. In this study, we propose an integrative algorithm for detecting the over-annotated  ...[more]

Similar Datasets

| S-EPMC4811333 | biostudies-literature
| S-EPMC77393 | biostudies-literature
| S-EPMC4216110 | biostudies-literature
| S-EPMC2687780 | biostudies-literature
2014-08-21 | GSE60095 | GEO
| S-EPMC2794547 | biostudies-literature
2014-08-21 | E-GEOD-60095 | biostudies-arrayexpress
| S-EPMC5181535 | biostudies-literature
| S-EPMC5741054 | biostudies-literature
| S-EPMC6847864 | biostudies-literature