Unknown

Dataset Information

0

Exploring the adenylation domain repertoire of nonribosomal peptide synthetases using an ensemble of sequence-search methods.


ABSTRACT: The introduction of two-dimension (2D) graphs and their numerical characterization for comparative analyses of DNA/RNA and protein sequences without the need of sequence alignments is an active yet recent research topic in bioinformatics. Here, we used a 2D artificial representation (four-color maps) with a simple numerical characterization through topological indices (TIs) to aid the discovering of remote homologous of Adenylation domains (A-domains) from the Nonribosomal Peptide Synthetases (NRPS) class in the proteome of the cyanobacteria Microcystis aeruginosa. Cyanobacteria are a rich source of structurally diverse oligopeptides that are predominantly synthesized by NPRS. Several A-domains share amino acid identities lower than 20 % being a possible source of remote homologous. Therefore, A-domains cannot be easily retrieved by BLASTp searches using a single template. To cope with the sequence diversity of the A-domains we have combined homology-search methods with an alignment-free tool that uses protein four-color-maps. TI2BioP (Topological Indices to BioPolymers) version 2.0, available at http://ti2biop.sourceforge.net/ allowed the calculation of simple TIs from the protein sequences (four-color maps). Such TIs were used as input predictors for the statistical estimations required to build the alignment-free models. We concluded that the use of graphical/numerical approaches in cooperation with other sequence search methods, like multi-templates BLASTp and profile HMM, can give the most complete exploration of the repertoire of highly diverse protein families.

SUBMITTER: Aguero-Chapin G 

PROVIDER: S-EPMC3712989 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

altmetric image

Publications

Exploring the adenylation domain repertoire of nonribosomal peptide synthetases using an ensemble of sequence-search methods.

Agüero-Chapin Guillermin G   Molina-Ruiz Reinaldo R   Maldonado Emanuel E   de la Riva Gustavo G   Sánchez-Rodríguez Aminael A   Vasconcelos Vitor V   Antunes Agostinho A  

PloS one 20130716 7


The introduction of two-dimension (2D) graphs and their numerical characterization for comparative analyses of DNA/RNA and protein sequences without the need of sequence alignments is an active yet recent research topic in bioinformatics. Here, we used a 2D artificial representation (four-color maps) with a simple numerical characterization through topological indices (TIs) to aid the discovering of remote homologous of Adenylation domains (A-domains) from the Nonribosomal Peptide Synthetases (N  ...[more]

Similar Datasets

| S-EPMC10041650 | biostudies-literature
| S-EPMC4795001 | biostudies-literature
| S-EPMC1253831 | biostudies-literature
| S-EPMC6632073 | biostudies-literature
| S-EPMC4177296 | biostudies-literature
| S-EPMC4760355 | biostudies-literature
| S-EPMC4502403 | biostudies-literature
| S-EPMC10443035 | biostudies-literature
| S-EPMC5457498 | biostudies-literature
| S-EPMC3238681 | biostudies-literature