Unknown

Dataset Information

0

Visualizing bacterial tRNA identity determinants and antideterminants using function logos and inverse function logos.


ABSTRACT: Sequence logos are stacked bar graphs that generalize the notion of consensus sequence. They employ entropy statistics very effectively to display variation in a structural alignment of sequences of a common function, while emphasizing its over-represented features. Yet sequence logos cannot display features that distinguish functional subclasses within a structurally related superfamily nor do they display under-represented features. We introduce two extensions to address these needs: function logos and inverse logos. Function logos display subfunctions that are over-represented among sequences carrying a specific feature. Inverse logos generalize both sequence logos and function logos by displaying under-represented, rather than over-represented, features or functions in structural alignments. To make inverse logos, a compositional inverse is applied to the feature or function frequency distributions before logo construction, where a compositional inverse is a mathematical transform that makes common features or functions rare and vice versa. We applied these methods to a database of structurally aligned bacterial tDNAs to create highly condensed, birds-eye views of potentially all so-called identity determinants and antideterminants that confer specific amino acid charging or initiator function on tRNAs in bacteria. We recovered both known and a few potentially novel identity elements. Function logos and inverse logos are useful tools for exploratory bioinformatic analysis of structure-function relationships in sequence families and superfamilies.

SUBMITTER: Freyhult E 

PROVIDER: S-EPMC1363773 | biostudies-literature | 2006

REPOSITORIES: biostudies-literature

altmetric image

Publications

Visualizing bacterial tRNA identity determinants and antideterminants using function logos and inverse function logos.

Freyhult Eva E   Moulton Vincent V   Ardell David H DH  

Nucleic acids research 20060209 3


Sequence logos are stacked bar graphs that generalize the notion of consensus sequence. They employ entropy statistics very effectively to display variation in a structural alignment of sequences of a common function, while emphasizing its over-represented features. Yet sequence logos cannot display features that distinguish functional subclasses within a structurally related superfamily nor do they display under-represented features. We introduce two extensions to address these needs: function  ...[more]

Similar Datasets

| S-EPMC6103679 | biostudies-literature
| S-EPMC5738739 | biostudies-literature
| S-EPMC3242769 | biostudies-literature
| S-EPMC3401359 | biostudies-literature
| S-EPMC7884595 | biostudies-literature
| S-EPMC4117946 | biostudies-literature
| S-EPMC2253709 | biostudies-literature
| S-EPMC3064791 | biostudies-literature
| S-EPMC10796397 | biostudies-literature
| S-EPMC4318935 | biostudies-literature