Unknown

Dataset Information

0

Systematic search for putative new domain families in Mycoplasma gallisepticum genome.


ABSTRACT:

Background

Protein domains are the fundamental units of protein structure, function and evolution. The delineation of different domains in proteins is important for classification, understanding of structure, function and evolution. The delineation of protein domains within a polypeptide chain, namely at the genome scale, can be achieved in several ways but may remain problematic in many instances. Difficulties in identifying the domain content of a given sequence arise when the query sequence has no homologues with experimentally determined structure and searching against sequence domain databases also results in insignificant matches. Identification of domains under low sequence identity conditions and lack of structural homologues acquire a crucial importance especially at the genomic scale.

Findings

We have developed a new method for the identification of domains in unassigned regions through indirect connections and scaled up its application to the analysis of 434 unassigned regions in 726 protein sequences of Mycoplasma gallisepticum genome. We could establish 71 new domain relationships and probable 63 putative new domain families through intermediate sequences in the unassigned regions, which importantly represent an overall 10% increase in PfamA domain annotation over the direct assignment in this genome.

Conclusions

The systematic analysis of the unassigned regions in the Mycoplasma gallisepticum genome has provided some insight into the possible new domain relationships and putative new domain families. Further investigation of these predicted new domains may prove beneficial in improving the existing domain prediction algorithms.

SUBMITTER: Reddy CC 

PROVIDER: S-EPMC2865477 | biostudies-literature | 2010 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Systematic search for putative new domain families in Mycoplasma gallisepticum genome.

Reddy Chilamakuri Cs CC   Rani Sane Sudha SS   Offmann Bernard B   Sowdhamini R R  

BMC research notes 20100412


<h4>Background</h4>Protein domains are the fundamental units of protein structure, function and evolution. The delineation of different domains in proteins is important for classification, understanding of structure, function and evolution. The delineation of protein domains within a polypeptide chain, namely at the genome scale, can be achieved in several ways but may remain problematic in many instances. Difficulties in identifying the domain content of a given sequence arise when the query se  ...[more]

Similar Datasets

| S-EPMC3155973 | biostudies-literature
| S-EPMC173959 | biostudies-other
2010-01-07 | GSE19755 | GEO
| S-EPMC6696634 | biostudies-literature
| S-EPMC5442611 | biostudies-literature
| S-EPMC101673 | biostudies-literature
| S-EPMC4007778 | biostudies-literature
| S-EPMC176583 | biostudies-other
| S-EPMC7162529 | biostudies-literature
2010-04-22 | E-GEOD-19755 | biostudies-arrayexpress