Unknown

Dataset Information

0

RE-MuSiC: a tool for multiple sequence alignment with regular expression constraints.


ABSTRACT: RE-MuSiC is a web-based multiple sequence alignment tool that can incorporate biological knowledge about structure, function, or conserved patterns regarding the sequences of interest. It accepts amino acid or nucleic acid sequences and a set of constraints as inputs. The constraints are pattern descriptions, instead of exact positions of fragments to be aligned together. The output is an alignment where for each pattern (constraint), an occurrence on each sequence can be found aligned together with those on the other sequences, in a manner that the overall alignment is optimized. Its predecessor, MuSiC, has been found useful by researchers since its release in 2004. However, it is noticed in applications that the pattern formulation adopted in MuSiC, namely, plain strings allowing mismatches, is not expressive and flexible enough. The constraint formulation adopted in RE-MuSiC is therefore enhanced to be regular expressions, which is convenient in expressing many biologically significant patterns like those collected in the PROSITE database, or structural consensuses that often involve variable ranges between conserved parts. Experiments demonstrate that RE-MuSiC can be used to help predict important residues and locate phylogenetically conserved structural elements. RE-MuSiC is available on-line at http://140.113.239.131/RE-MUSIC.

SUBMITTER: Chung YS 

PROVIDER: S-EPMC1933182 | biostudies-literature | 2007 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

RE-MuSiC: a tool for multiple sequence alignment with regular expression constraints.

Chung Yun-Sheng YS   Lee Wei-Hsun WH   Tang Chuan Yi CY   Lu Chin Lung CL  

Nucleic acids research 20070508 Web Server issue


RE-MuSiC is a web-based multiple sequence alignment tool that can incorporate biological knowledge about structure, function, or conserved patterns regarding the sequences of interest. It accepts amino acid or nucleic acid sequences and a set of constraints as inputs. The constraints are pattern descriptions, instead of exact positions of fragments to be aligned together. The output is an alignment where for each pattern (constraint), an occurrence on each sequence can be found aligned together  ...[more]

Similar Datasets

| S-EPMC1579236 | biostudies-literature
| S-EPMC3799466 | biostudies-literature
| S-EPMC2951093 | biostudies-literature
| S-EPMC5624947 | biostudies-literature
| S-EPMC11008887 | biostudies-literature
| S-EPMC6330207 | biostudies-literature
| S-EPMC6657586 | biostudies-literature
| S-EPMC4599319 | biostudies-literature
| S-EPMC8289385 | biostudies-literature
| S-EPMC6151001 | biostudies-literature