Unknown

Dataset Information

0

Amino acid substitution scoring matrices specific to intrinsically disordered regions in proteins.


ABSTRACT: An amino acid substitution scoring matrix encapsulates the rates at which various amino acid residues in proteins are substituted by other amino acid residues, over time. Database search methods make use of substitution scoring matrices to identify sequences with homologous relationships. However, widely used substitution scoring matrices, such as BLOSUM series, have been developed using aligned blocks that are mostly devoid of disordered regions in proteins. Hence, these substitution-scoring matrices are mostly inappropriate for homology searches involving proteins enriched with disordered regions as the disordered regions have distinct amino acid compositional bias, and therefore expected to have undergone amino acid substitutions that are distinct from those in the ordered regions. We, therefore, developed a novel series of substitution scoring matrices referred to as EDSSMat by exclusively considering the substitution frequencies of amino acids in the disordered regions of the eukaryotic proteins. The newly developed matrices were tested for their ability to detect homologs of proteins enriched with disordered regions by means of SSEARCH tool. The results unequivocally demonstrate that EDSSMat matrices detect more number of homologs than the widely used BLOSUM, PAM and other standard matrices, indicating their utility value for homology searches of intrinsically disordered proteins.

SUBMITTER: Trivedi R 

PROVIDER: S-EPMC6841959 | biostudies-literature | 2019 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Amino acid substitution scoring matrices specific to intrinsically disordered regions in proteins.

Trivedi Rakesh R   Nagarajaram Hampapathalu Adimurthy HA  

Scientific reports 20191108 1


An amino acid substitution scoring matrix encapsulates the rates at which various amino acid residues in proteins are substituted by other amino acid residues, over time. Database search methods make use of substitution scoring matrices to identify sequences with homologous relationships. However, widely used substitution scoring matrices, such as BLOSUM series, have been developed using aligned blocks that are mostly devoid of disordered regions in proteins. Hence, these substitution-scoring ma  ...[more]

Similar Datasets

| S-EPMC3103576 | biostudies-literature
| S-EPMC7586916 | biostudies-literature
| S-EPMC9250585 | biostudies-literature
| S-EPMC4490338 | biostudies-literature
| S-EPMC3949125 | biostudies-literature
| S-EPMC4824835 | biostudies-literature
| S-EPMC2630510 | biostudies-literature
| S-EPMC8445205 | biostudies-literature
| S-EPMC6954741 | biostudies-literature
| S-EPMC3904525 | biostudies-other