Ontology highlight
ABSTRACT:
SUBMITTER: Atchley WR
PROVIDER: S-EPMC1088356 | biostudies-other | 2005 May
REPOSITORIES: biostudies-other
Atchley William R WR Zhao Jieping J Fernandes Andrew D AD Drüke Tanja T
Proceedings of the National Academy of Sciences of the United States of America 20050425 18
Biological sequences are composed of long strings of alphabetic letters rather than arrays of numerical values. Lack of a natural underlying metric for comparing such alphabetic data significantly inhibits sophisticated statistical analyses of sequences, modeling structural and functional aspects of proteins, and related problems. Herein, we use multivariate statistical analyses on almost 500 amino acid attributes to produce a small set of highly interpretable numeric patterns of amino acid vari ...[more]