Unknown

Dataset Information

0

RBLOSUM performs better than CorBLOSUM with lesser error per query.


ABSTRACT:

Objective

BLOSUM matrices serve as standard matrices for many protein sequence alignment programs. BLOSUM matrices have been constructed using BLOCKS version5.0 with 27,102 BLOCKS, whereas the latest updated version14.3 has 6,739,916 BLOCKS. We read with interest the research article by Hess et al. (BMC Bioinform 17:189, 2016) on CorBLOSUM, wherein it is argued that an inaccuracy in the BLOSUM code affects the cluster memberships of sequences. They show that replacing the integer based clustering threshold to floating point arguably improves the performances of CorBLOSUM over BLOSUM and RBLOSUM matrices. They compare BLOSUM6214.3 against RBLOSUM69, with relative entropies of 0.2685 and 0.2662 respectively. The present work attempts to repeat the computation to verify the respective analog matrices.

Results

In our attempt to repeat the computation, we observed that the relative entropy of BLOSUM6214.3 is 0.2360 and BLOSUM5014.3 is 0.1198. As only matrices of similar entropies can be compared, BLOSUM62 can be compared only with RBLOSUM66 and BLOSUM50 can be compared only with RBLOSUM56. We conducted experiments with Astral data sets, and demonstrated the improved accuracy in the coverage. Our results imply that RBLOSUM performs statistically better than CorBLOSUM and BLOSUM matrices.

SUBMITTER: Govindarajan R 

PROVIDER: S-EPMC5963171 | biostudies-literature | 2018 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

RBLOSUM performs better than CorBLOSUM with lesser error per query.

Govindarajan Renganayaki R   Leela Biji Christopher BC   Nair Achuthsankar S AS  

BMC research notes 20180521 1


<h4>Objective</h4>BLOSUM matrices serve as standard matrices for many protein sequence alignment programs. BLOSUM matrices have been constructed using BLOCKS version<sub>5.0</sub> with 27,102 BLOCKS, whereas the latest updated version<sub>14.3</sub> has 6,739,916 BLOCKS. We read with interest the research article by Hess et al. (BMC Bioinform 17:189, 2016) on CorBLOSUM, wherein it is argued that an inaccuracy in the BLOSUM code affects the cluster memberships of sequences. They show that replaci  ...[more]

Similar Datasets

| S-EPMC10849680 | biostudies-literature
| S-EPMC7125218 | biostudies-literature
| S-EPMC6760955 | biostudies-literature
| S-EPMC5912751 | biostudies-literature
| S-EPMC2519017 | biostudies-literature
| S-EPMC3408576 | biostudies-literature
| S-EPMC3673918 | biostudies-literature
| S-EPMC7990232 | biostudies-literature
| S-EPMC4614478 | biostudies-literature
| S-EPMC7148641 | biostudies-literature