Unknown

Dataset Information

0

NoLogo: a new statistical model highlights the diversity and suggests new classes of Crm1-dependent nuclear export signals.


ABSTRACT: BACKGROUND:Crm1-dependent Nuclear Export Signals (NESs) are clusters of alternating hydrophobic and non-hydrophobic amino acid residues between 10 to 15 amino acids in length. NESs were largely thought to follow simple consensus patterns, based on which they were categorized into 6-10 classes. However, newly discovered NESs often deviate from the established consensus patterns. Thus, identifying NESs within protein sequences remains a bioinformatics challenge. RESULTS:We describe a probabilistic representation of NESs using a new generative model we call NoLogo that can account for a large diversity of NESs. Using this model to predict NESs, we demonstrate improved performance over PSSM and GLAM2 models, but do not achieve the performance of the state-of-the-art NES predictor LocNES. Our findings illustrate that over 30% of NESs are best described by novel NES classes rather than the 6-10 classes proposed by current/existing models. Finally, many NESs have additional hydrophobic residues either upstream or downstream of the canonical four residues, suggesting possible functionality. CONCLUSION:Applying the NoLogo model highlights the observation that NESs are more diverse than previously appreciated. Our work questions the practice of assigning each NES to one of several predefined NES classes. Finally, our analysis suggests a novel and testable biophysical perspective on interaction between Crm1 receptor and Crm1-dependent NESs.

SUBMITTER: Liku ME 

PROVIDER: S-EPMC5828312 | biostudies-literature | 2018 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

NoLogo: a new statistical model highlights the diversity and suggests new classes of Crm1-dependent nuclear export signals.

Liku Muluye E ME   Legere Elizabeth-Ann EA   Moses Alan M AM  

BMC bioinformatics 20180227 1


<h4>Background</h4>Crm1-dependent Nuclear Export Signals (NESs) are clusters of alternating hydrophobic and non-hydrophobic amino acid residues between 10 to 15 amino acids in length. NESs were largely thought to follow simple consensus patterns, based on which they were categorized into 6-10 classes. However, newly discovered NESs often deviate from the established consensus patterns. Thus, identifying NESs within protein sequences remains a bioinformatics challenge.<h4>Results</h4>We describe  ...[more]

Similar Datasets

| S-EPMC5358978 | biostudies-literature
| S-EPMC7525811 | biostudies-literature
| S-EPMC517610 | biostudies-other
| S-EPMC6232958 | biostudies-literature
| S-EPMC3756925 | biostudies-literature
| S-EPMC3388738 | biostudies-literature
| S-EPMC4360530 | biostudies-literature
| S-EPMC3437623 | biostudies-literature
| S-EPMC2171279 | biostudies-literature
| S-EPMC3591659 | biostudies-literature