Dataset Information

A structure filter for the Eukaryotic Linear Motif Resource.

ABSTRACT:

Background

Many proteins are highly modular, being assembled from globular domains and segments of natively disordered polypeptides. Linear motifs, short sequence modules functioning independently of protein tertiary structure, are most abundant in natively disordered polypeptides but are also found in accessible parts of globular domains, such as exposed loops. The prediction of novel occurrences of known linear motifs attempts the difficult task of distinguishing functional matches from stochastically occurring non-functional matches. Although functionality can only be confirmed experimentally, confidence in a putative motif is increased if a motif exhibits attributes associated with functional instances such as occurrence in the correct taxonomic range, cellular compartment, conservation in homologues and accessibility to interacting partners. Several tools now use these attributes to classify putative motifs based on confidence of functionality.

Results

Current methods assessing motif accessibility do not consider much of the information available, either predicting accessibility from primary sequence or regarding any motif occurring in a globular region as low confidence. We present a method considering accessibility and secondary structural context derived from experimentally solved protein structures to rectify this situation. Putatively functional motif occurrences are mapped onto a representative domain, given that a high quality reference SCOP domain structure is available for the protein itself or a close relative. Candidate motifs can then be scored for solvent-accessibility and secondary structure context. The scores are calibrated on a benchmark set of experimentally verified motif instances compared with a set of random matches. A combined score yields 3-fold enrichment for functional motifs assigned to high confidence classifications and 2.5-fold enrichment for random motifs assigned to low confidence classifications. The structure filter is implemented as a pipeline with both a graphical interface via the ELM resource http://elm.eu.org/ and through a Web Service protocol.

Conclusion

New occurrences of known linear motifs require experimental validation as the bioinformatics tools currently have limited reliability. The ELM structure filter will aid users assessing candidate motifs presenting in globular structural regions. Most importantly, it will help users to decide whether to expend their valuable time and resources on experimental testing of interesting motif candidates.

SUBMITTER: Via A

PROVIDER: S-EPMC2774702 | biostudies-literature | 2009 Oct

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A structure filter for the Eukaryotic Linear Motif Resource.

Via Allegra A Gould Cathryn M CM Gemünd Christine C Gibson Toby J TJ Helmer-Citterich Manuela M

BMC bioinformatics 20091024

<h4>Background</h4>Many proteins are highly modular, being assembled from globular domains and segments of natively disordered polypeptides. Linear motifs, short sequence modules functioning independently of protein tertiary structure, are most abundant in natively disordered polypeptides but are also found in accessible parts of globular domains, such as exposed loops. The prediction of novel occurrences of known linear motifs attempts the difficult task of distinguishing functional matches fro ...[more]

PMID: 19852836

Dataset Information

A structure filter for the Eukaryotic Linear Motif Resource.

Background

Results

Conclusion

Publications

A structure filter for the Eukaryotic Linear Motif Resource.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

The eukaryotic linear motif resource - 2018 update.
| S-EPMC5753338 | biostudies-literature

The Eukaryotic Linear Motif resource: 2022 release.
| S-EPMC8728146 | biostudies-literature

ELM-the eukaryotic linear motif resource in 2020.
| S-EPMC7145657 | biostudies-literature

ELM-the Eukaryotic Linear Motif resource-2024 update.
| S-EPMC10767929 | biostudies-literature

The eukaryotic linear motif resource ELM: 10 years and counting.
| S-EPMC3964949 | biostudies-literature

The articles.ELM resource: simplifying access to protein linear motif literature by annotation, text-mining and classification.
| S-EPMC7276420 | biostudies-literature

ELM--the database of eukaryotic linear motifs.
| S-EPMC3245074 | biostudies-literature

Holographic photopolymer linear variable filter with enhanced blue reflection.
| S-EPMC3985781 | biostudies-literature

diArk--a resource for eukaryotic genome research.
| S-EPMC1868023 | biostudies-literature

EuPathDB: The Eukaryotic Pathogen Genomics Database Resource.
| S-EPMC7124890 | biostudies-literature