Unknown

Dataset Information

0

Characterization and prediction of protein nucleolar localization sequences.


ABSTRACT: Although the nucleolar localization of proteins is often believed to be mediated primarily by non-specific retention to core nucleolar components, many examples of short nucleolar targeting sequences have been reported in recent years. In this article, 46 human nucleolar localization sequences (NoLSs) were collated from the literature and subjected to statistical analysis. Of the residues in these NoLSs 48% are basic, whereas 99% of the residues are predicted to be solvent-accessible with 42% in ?-helix and 57% in coil. The sequence and predicted protein secondary structure of the 46 NoLSs were used to train an artificial neural network to identify NoLSs. At a true positive rate of 54%, the predictor's overall false positive rate (FPR) is estimated to be 1.52%, which can be broken down to FPRs of 0.26% for randomly chosen cytoplasmic sequences, 0.80% for randomly chosen nucleoplasmic sequences and 12% for nuclear localization signals. The predictor was used to predict NoLSs in the complete human proteome and 10 of the highest scoring previously unknown NoLSs were experimentally confirmed. NoLSs are a prevalent type of targeting motif that is distinct from nuclear localization signals and that can be computationally predicted.

SUBMITTER: Scott MS 

PROVIDER: S-EPMC2995072 | biostudies-literature | 2010 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Characterization and prediction of protein nucleolar localization sequences.

Scott Michelle S MS   Boisvert François-Michel FM   McDowall Mark D MD   Lamond Angus I AI   Barton Geoffrey J GJ  

Nucleic acids research 20100726 21


Although the nucleolar localization of proteins is often believed to be mediated primarily by non-specific retention to core nucleolar components, many examples of short nucleolar targeting sequences have been reported in recent years. In this article, 46 human nucleolar localization sequences (NoLSs) were collated from the literature and subjected to statistical analysis. Of the residues in these NoLSs 48% are basic, whereas 99% of the residues are predicted to be solvent-accessible with 42% in  ...[more]

Similar Datasets

| S-EPMC3264635 | biostudies-literature
| S-EPMC533942 | biostudies-literature
| S-EPMC146614 | biostudies-other
| S-EPMC7815533 | biostudies-literature
| S-EPMC532049 | biostudies-literature
| S-EPMC5042399 | biostudies-literature
| S-EPMC168944 | biostudies-literature
| S-EPMC7114095 | biostudies-literature
| S-EPMC2289268 | biostudies-other
| S-EPMC6997756 | biostudies-literature