Unknown

Dataset Information

0

Computer-based annotation of putative AraC/XylS-family transcription factors of known structure but unknown function.


ABSTRACT: Currently, about 20 crystal structures per day are released and deposited in the Protein Data Bank. A significant fraction of these structures is produced by research groups associated with the structural genomics consortium. The biological function of many of these proteins is generally unknown or not validated by experiment. Therefore, a growing need for functional prediction of protein structures has emerged. Here we present an integrated bioinformatics method that combines sequence-based relationships and three-dimensional (3D) structural similarity of transcriptional regulators with computer prediction of their cognate DNA binding sequences. We applied this method to the AraC/XylS family of transcription factors, which is a large family of transcriptional regulators found in many bacteria controlling the expression of genes involved in diverse biological functions. Three putative new members of this family with known 3D structure but unknown function were identified for which a probable functional classification is provided. Our bioinformatics analyses suggest that they could be involved in plant cell wall degradation (Lin2118 protein from Listeria innocua, PDB code 3oou), symbiotic nitrogen fixation (protein from Chromobacterium violaceum, PDB code 3oio), and either metabolism of plant-derived biomass or nitrogen fixation (protein from Rhodopseudomonas palustris, PDB code 3mn2).

SUBMITTER: Schuller A 

PROVIDER: S-EPMC3312330 | biostudies-literature | 2012

REPOSITORIES: biostudies-literature

altmetric image

Publications

Computer-based annotation of putative AraC/XylS-family transcription factors of known structure but unknown function.

Schüller Andreas A   Slater Alex W AW   Norambuena Tomás T   Cifuentes Juan J JJ   Almonacid Leonardo I LI   Melo Francisco F  

Journal of biomedicine & biotechnology 20120313


Currently, about 20 crystal structures per day are released and deposited in the Protein Data Bank. A significant fraction of these structures is produced by research groups associated with the structural genomics consortium. The biological function of many of these proteins is generally unknown or not validated by experiment. Therefore, a growing need for functional prediction of protein structures has emerged. Here we present an integrated bioinformatics method that combines sequence-based rel  ...[more]

Similar Datasets

| S-EPMC9178005 | biostudies-literature
| S-EPMC4983702 | biostudies-literature
| S-EPMC6445133 | biostudies-literature
2021-10-13 | GSE183001 | GEO
| S-EPMC7478709 | biostudies-literature
| S-EPMC4135657 | biostudies-literature
| S-EPMC2832520 | biostudies-literature
| S-EPMC135190 | biostudies-literature
| S-EPMC7455731 | biostudies-literature
| S-EPMC93790 | biostudies-literature