Unknown

Dataset Information

0

Structural homology guided alignment of cysteine rich proteins.


ABSTRACT: BACKGROUND:Cysteine rich protein families are notoriously difficult to align due to low sequence identity and frequent insertions and deletions. RESULTS:Here we present an alignment method that ensures homologous cysteines align by assigning a unique 10 amino acid barcode to those identified as structurally homologous by the DALI webserver. The free inter-cysteine regions of the barcoded sequences can then be aligned using any standard algorithm. Finally the barcodes are replaced with the original columns to yield an alignment which requires the minimum of manual refinement. CONCLUSIONS:Using structural homology information to constrain sequence alignments allows the alignment of highly divergent, repetitive sequences that are poorly dealt with by existing algorithms. Tools are provided to perform this method online using the CysBar web-tool (http://CysBar.science.latrobe.edu.au) and offline (python script available from http://github.com/ts404/CysBar).

SUBMITTER: Shafee TM 

PROVIDER: S-EPMC4709342 | biostudies-literature | 2016

REPOSITORIES: biostudies-literature

altmetric image

Publications

Structural homology guided alignment of cysteine rich proteins.

Shafee Thomas M A TM   Robinson Andrew J AJ   van der Weerden Nicole N   Anderson Marilyn A MA  

SpringerPlus 20160112


<h4>Background</h4>Cysteine rich protein families are notoriously difficult to align due to low sequence identity and frequent insertions and deletions.<h4>Results</h4>Here we present an alignment method that ensures homologous cysteines align by assigning a unique 10 amino acid barcode to those identified as structurally homologous by the DALI webserver. The free inter-cysteine regions of the barcoded sequences can then be aligned using any standard algorithm. Finally the barcodes are replaced  ...[more]

Similar Datasets

| S-EPMC20287 | biostudies-literature
| S-EPMC6638209 | biostudies-literature
| S-EPMC4521993 | biostudies-literature
| S-EPMC7786378 | biostudies-literature
| S-EPMC2268138 | biostudies-literature
| S-EPMC4423139 | biostudies-literature
| S-EPMC6262176 | biostudies-literature
| S-EPMC3537238 | biostudies-literature
| S-EPMC1483079 | biostudies-literature
| S-EPMC2785551 | biostudies-literature