Unknown

Dataset Information

0

Structure prediction of partial-length protein sequences.


ABSTRACT: Protein structure information is essential to understand protein function. Computational methods to accurately predict protein structure from the sequence have primarily been evaluated on protein sequences representing full-length native proteins. Here, we demonstrate that top-performing structure prediction methods can accurately predict the partial structures of proteins encoded by sequences that contain approximately 50% or more of the full-length protein sequence. We hypothesize that structure prediction may be useful for predicting functions of proteins whose corresponding genes are mapped expressed sequence tags (ESTs) that encode partial-length amino acid sequences. Additionally, we identify a confidence score representing the quality of a predicted structure as a useful means of predicting the likelihood that an arbitrary polypeptide sequence represents a portion of a foldable protein sequence ("foldability"). This work has ramifications for the prediction of protein structure with limited or noisy sequence information, as well as genome annotation.

SUBMITTER: Laurenzi A 

PROVIDER: S-EPMC3742278 | biostudies-literature | 2013 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Structure prediction of partial-length protein sequences.

Laurenzi Adrian A   Hung Ling-Hong LH   Samudrala Ram R  

International journal of molecular sciences 20130717 7


Protein structure information is essential to understand protein function. Computational methods to accurately predict protein structure from the sequence have primarily been evaluated on protein sequences representing full-length native proteins. Here, we demonstrate that top-performing structure prediction methods can accurately predict the partial structures of proteins encoded by sequences that contain approximately 50% or more of the full-length protein sequence. We hypothesize that structu  ...[more]

Similar Datasets

| S-EPMC4425021 | biostudies-literature
| S-EPMC2143245 | biostudies-other
| S-EPMC6755676 | biostudies-literature
| S-EPMC3009544 | biostudies-literature
| S-EPMC8670487 | biostudies-literature
| S-EPMC1479845 | biostudies-literature
| S-EPMC2995072 | biostudies-literature
| S-EPMC535295 | biostudies-literature
| S-EPMC2612599 | biostudies-literature
| S-EPMC8769711 | biostudies-literature