Unknown

Dataset Information

0

Enhancing prediction of short linear protein motifs with Wregex 3.0.


ABSTRACT: Short linear motifs (SLiMs) play an important role in protein-protein interactions. However, SLiM patterns are intrinsically permissive and result into many matches that occur just by chance, specially when targeting large datasets. To prioritize these matches as candidates for functional testing, we developed Wregex (Weighted regular expression), which uses a position-specific scoring matrix (PSSM) to order a list of regular expression matches according to a PSSM-derived score. Here we present Wregex 3.0, an improved version with new functionalities such as the support for a second auxiliary motif to help refining prediction of a primary SLiM, and post-translational modifications (PTMs) enrichment taking into account that many regulatory SLiM-mediated interactions are modulated by one or more PTMs. This version also incorporates a number of new features such as a convenient use of subproteomes, showing UniProt annotations such as disordered regions, searching for all known motifs and generating decoy databases for enrichment analysis. We provide case studies to illustrate how these new Wregex functionalities enhance prediction of short linear protein motifs. The Wregex 3.0 server is freely accessible at https://ehubio.ehu.eus/wregex3/.

SUBMITTER: Prieto G 

PROVIDER: S-EPMC11318550 | biostudies-literature | 2024 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Enhancing prediction of short linear protein motifs with Wregex 3.0.

Prieto Gorka G   Rodríguez Jose A JA   Fullaondo Asier A  

Computational and structural biotechnology journal 20240717


Short linear motifs (SLiMs) play an important role in protein-protein interactions. However, SLiM patterns are intrinsically permissive and result into many matches that occur just by chance, specially when targeting large datasets. To prioritize these matches as candidates for functional testing, we developed Wregex (Weighted regular expression), which uses a position-specific scoring matrix (PSSM) to order a list of regular expression matches according to a PSSM-derived score. Here we present  ...[more]

Similar Datasets

| S-EPMC5345135 | biostudies-literature
| S-EPMC4654906 | biostudies-literature
| S-EPMC10542142 | biostudies-literature
| S-EPMC10746173 | biostudies-literature
| S-EPMC1665647 | biostudies-literature
| S-EPMC2702822 | biostudies-literature
| S-EPMC9737320 | biostudies-literature
| S-EPMC4495300 | biostudies-literature
| S-EPMC3125787 | biostudies-literature
| S-EPMC7416808 | biostudies-literature