Unknown

Dataset Information

0

SNAD: Sequence Name Annotation-based Designer.


ABSTRACT: BACKGROUND: A growing diversity of biological data is tagged with unique identifiers (UIDs) associated with polynucleotides and proteins to ensure efficient computer-mediated data storage, maintenance, and processing. These identifiers, which are not informative for most people, are often substituted by biologically meaningful names in various presentations to facilitate utilization and dissemination of sequence-based knowledge. This substitution is commonly done manually that may be a tedious exercise prone to mistakes and omissions. RESULTS: Here we introduce SNAD (Sequence Name Annotation-based Designer) that mediates automatic conversion of sequence UIDs (associated with multiple alignment or phylogenetic tree, or supplied as plain text list) into biologically meaningful names and acronyms. This conversion is directed by precompiled or user-defined templates that exploit wealth of annotation available in cognate entries of external databases. Using examples, we demonstrate how this tool can be used to generate names for practical purposes, particularly in virology. CONCLUSION: A tool for controllable annotation-based conversion of sequence UIDs into biologically meaningful names and acronyms has been developed and placed into service, fostering links between quality of sequence annotation, and efficiency of communication and knowledge dissemination among researchers.

SUBMITTER: Sidorov IA 

PROVIDER: S-EPMC2739203 | biostudies-literature | 2009

REPOSITORIES: biostudies-literature

altmetric image

Publications

SNAD: Sequence Name Annotation-based Designer.

Sidorov Igor A IA   Reshetov Denis A DA   Gorbalenya Alexander E AE  

BMC bioinformatics 20090814


<h4>Background</h4>A growing diversity of biological data is tagged with unique identifiers (UIDs) associated with polynucleotides and proteins to ensure efficient computer-mediated data storage, maintenance, and processing. These identifiers, which are not informative for most people, are often substituted by biologically meaningful names in various presentations to facilitate utilization and dissemination of sequence-based knowledge. This substitution is commonly done manually that may be a te  ...[more]

Similar Datasets

| S-EPMC3065684 | biostudies-literature
| S-EPMC2688272 | biostudies-literature
| S-EPMC6513161 | biostudies-literature
| S-EPMC5267466 | biostudies-literature
| S-EPMC8359369 | biostudies-literature
| S-EPMC5415185 | biostudies-literature
| S-EPMC4653902 | biostudies-literature
| PRJEB44938 | ENA
| PRJEB44939 | ENA
| PRJEB44941 | ENA