Unknown

Dataset Information

0

Annotation of the M. tuberculosis hypothetical orfeome: adding functional information to more than half of the uncharacterized proteins.


ABSTRACT: The genome of Mycobacterium tuberculosis (H37Rv) contains 4,019 protein coding genes, of which more than thousand have been categorized as 'hypothetical' implying that for these not even weak functional associations could be identified so far. We here predict reliable functional indications for half of this large hypothetical orfeome: 497 genes can be annotated based on orthology, and another 125 can be linked to interacting proteins via integrated genomic context analysis and literature mining. The assignments include newly identified clusters of interacting proteins, hypothetical genes that are associated to well known pathways and putative disease-relevant targets. All together, we have raised the fraction of the proteome with at least some functional annotation to 88% which should considerably enhance the interpretation of large-scale experiments targeting this medically important organism.

SUBMITTER: Doerks T 

PROVIDER: S-EPMC3317503 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

Annotation of the M. tuberculosis hypothetical orfeome: adding functional information to more than half of the uncharacterized proteins.

Doerks Tobias T   van Noort Vera V   Minguez Pablo P   Bork Peer P  

PloS one 20120402 4


The genome of Mycobacterium tuberculosis (H37Rv) contains 4,019 protein coding genes, of which more than thousand have been categorized as 'hypothetical' implying that for these not even weak functional associations could be identified so far. We here predict reliable functional indications for half of this large hypothetical orfeome: 497 genes can be annotated based on orthology, and another 125 can be linked to interacting proteins via integrated genomic context analysis and literature mining.  ...[more]

Similar Datasets

| S-EPMC3877243 | biostudies-literature
| S-EPMC3397526 | biostudies-literature
| S-EPMC7394047 | biostudies-literature
| S-EPMC545604 | biostudies-literature
| S-EPMC6528289 | biostudies-literature