Ontology highlight
ABSTRACT:
SUBMITTER: Turjanski P
PROVIDER: S-EPMC7184844 | biostudies-literature | 2018 Dec
REPOSITORIES: biostudies-literature
Turjanski Pablo P Ferreiro Diego U DU
The journal of physical chemistry. B 20181008 49
All known terrestrial proteins are coded as continuous strings of ≈20 amino acids. The patterns formed by the repetitions of elements in groups of finite sequences describes the natural architectures of protein families. We present a method to search for patterns and groupings of patterns in protein sequences using a mathematically precise definition for "repetition", an efficient algorithmic implementation and a robust scoring system with no adjustable parameters. We show that the sequence patt ...[more]