Unknown

Dataset Information

0

Combinatorial analysis for sequence and spatial motif discovery in short sequence fragments.


ABSTRACT: Motifs are overrepresented sequence or spatial patterns appearing in proteins. They often play important roles in maintaining protein stability and in facilitating protein function. When motifs are located in short sequence fragments, as in transmembrane domains that are only 6-20 residues in length, and when there is only very limited data, it is difficult to identify motifs. In this study, we introduce combinatorial models based on permutation for assessing statistically significant sequence and spatial patterns in short sequences. We show that our method can uncover previously unknown sequence and spatial motifs in beta-barrel membrane proteins and that our method outperforms existing methods in detecting statistically significant motifs in this data set. Last, we discuss implications of motif analysis for problems involving short sequences in other families of proteins.

SUBMITTER: Jackups R 

PROVIDER: S-EPMC3417775 | biostudies-literature | 2010 Jul-Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Combinatorial analysis for sequence and spatial motif discovery in short sequence fragments.

Jackups Ronald R   Liang Jie J  

IEEE/ACM transactions on computational biology and bioinformatics 20100701 3


Motifs are overrepresented sequence or spatial patterns appearing in proteins. They often play important roles in maintaining protein stability and in facilitating protein function. When motifs are located in short sequence fragments, as in transmembrane domains that are only 6-20 residues in length, and when there is only very limited data, it is difficult to identify motifs. In this study, we introduce combinatorial models based on permutation for assessing statistically significant sequence a  ...[more]

Similar Datasets

| S-EPMC2703457 | biostudies-literature
| S-EPMC3534220 | biostudies-literature
| S-EPMC1933137 | biostudies-literature
| S-EPMC1524906 | biostudies-other
| S-EPMC6030882 | biostudies-literature
| S-EPMC3556059 | biostudies-literature
| S-EPMC3112114 | biostudies-literature
| S-EPMC3287487 | biostudies-literature
| S-EPMC6042083 | biostudies-other
| S-EPMC6238301 | biostudies-literature