Unknown

Dataset Information

0

Three reasons protein disorder analysis makes more sense in the light of collagen.


ABSTRACT: We have identified that the collagen helix has the potential to be disruptive to analyses of intrinsically disordered proteins. The collagen helix is an extended fibrous structure that is both promiscuous and repetitive. Whilst its sequence is predicted to be disordered, this type of protein structure is not typically considered as intrinsic disorder. Here, we show that collagen-encoding proteins skew the distribution of exon lengths in genes. We find that previous results, demonstrating that exons encoding disordered regions are more likely to be symmetric, are due to the abundance of the collagen helix. Other related results, showing increased levels of alternative splicing in disorder-encoding exons, still hold after considering collagen-containing proteins. Aside from analyses of exons, we find that the set of proteins that contain collagen significantly alters the amino acid composition of regions predicted as disordered. We conclude that research in this area should be conducted in the light of the collagen helix.

SUBMITTER: Smithers B 

PROVIDER: S-EPMC4838654 | biostudies-literature | 2016 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

Three reasons protein disorder analysis makes more sense in the light of collagen.

Smithers Ben B   Oates Matt E ME   Tompa Peter P   Gough Julian J  

Protein science : a publication of the Protein Society 20160419 5


We have identified that the collagen helix has the potential to be disruptive to analyses of intrinsically disordered proteins. The collagen helix is an extended fibrous structure that is both promiscuous and repetitive. Whilst its sequence is predicted to be disordered, this type of protein structure is not typically considered as intrinsic disorder. Here, we show that collagen-encoding proteins skew the distribution of exon lengths in genes. We find that previous results, demonstrating that ex  ...[more]

Similar Datasets

| S-EPMC2696554 | biostudies-literature
| S-EPMC1181915 | biostudies-literature
| S-EPMC7247512 | biostudies-literature
| S-EPMC8334846 | biostudies-literature
| S-EPMC4061766 | biostudies-literature
| S-EPMC2808345 | biostudies-literature
| S-EPMC5045741 | biostudies-other
| S-EPMC6934763 | biostudies-literature
| S-EPMC11320653 | biostudies-literature
| S-EPMC7408373 | biostudies-literature