Dataset Information

An alignment-free domain architecture similarity search (ADASS) algorithm for inferring homology between multi-domain proteins.

ABSTRACT: Annotations of the genes and their products are largely guided by inferring homology. Sequence similarity is the primary measure used for annotation purpose however, the domain content and order were given less importance albeit the fact that domain insertion, deletion, positional changes can bring in functional varieties. Of late, several methods developed quantify domain architecture similarity depending on alignments of their sequences and are focused on only homologous proteins. We present an alignment-free domain architecture-similarity search (ADASS) algorithm that identifies proteins that share very poor sequence similarity yet having similar domain architectures. We introduce a "singlet matching-triplet comparison" method in ADASS, wherein triplet of domains is compared with other triplets in a pair-wise comparison of two domain architectures. Different events in the triplet comparison are scored as per a scoring scheme and an average pairwise distance score (Domain Architecture Distance score - DAD Score) is calculated between protein domains architectures. We use domain architectures of a selected domain termed as centric domain and cluster them based on DAD score. The algorithm has high Positive Prediction Value (PPV) with respect to the clustering of the sequences of selected domain architectures. A comparison of domain architecture based dendrograms using ADASS method and an existing method revealed that ADASS can classify proteins depending on the extent of domain architecture level similarity. ADASS is more relevant in cases of proteins with tiny domains having little contribution to the overall sequence similarity but contributing significantly to the overall function.

SUBMITTER: Syamaladevi DP

PROVIDER: S-EPMC3705623 | biostudies-literature | 2013

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

An alignment-free domain architecture similarity search (ADASS) algorithm for inferring homology between multi-domain proteins.

Syamaladevi Divya P DP Joshi Adwait A Sowdhamini Ramanathan R

Bioinformation 20130608 10

Annotations of the genes and their products are largely guided by inferring homology. Sequence similarity is the primary measure used for annotation purpose however, the domain content and order were given less importance albeit the fact that domain insertion, deletion, positional changes can bring in functional varieties. Of late, several methods developed quantify domain architecture similarity depending on alignments of their sequences and are focused on only homologous proteins. We present a ...[more]

PMID: 23861564

Dataset Information

An alignment-free domain architecture similarity search (ADASS) algorithm for inferring homology between multi-domain proteins.

Publications

An alignment-free domain architecture similarity search (ADASS) algorithm for inferring homology between multi-domain proteins.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Multi-membrane search algorithm.
| S-EPMC8648127 | biostudies-literature

Alignment algorithm for homology modeling and threading.
| S-EPMC2143918 | biostudies-other

SigAlign: an alignment algorithm guided by explicit similarity criteria.
| S-EPMC11347165 | biostudies-literature

PSimScan: algorithm and utility for fast protein similarity search.
| S-EPMC3591303 | biostudies-literature

Protein intrinsically disordered region prediction by combining neural architecture search and multi-objective genetic algorithm.
| S-EPMC10483879 | biostudies-literature

Using homology relations within a database markedly boosts protein sequence similarity search.
| S-EPMC4460465 | biostudies-literature

Alignment-free local structural search by writhe decomposition.
| S-EPMC2859133 | biostudies-literature

Integrating alignment-based and alignment-free sequence similarity measures for biological sequence classification.
| S-EPMC4410667 | biostudies-literature

An Alignment-Free Algorithm in Comparing the Similarity of Protein Sequences Based on Pseudo-Markov Transition Probabilities among Amino Acids.
| S-EPMC5137889 | biostudies-literature

Greedy 3-Point Search (G3PS)-A Novel Algorithm for Pharmacophore Alignment.
| S-EPMC8658842 | biostudies-literature