Unknown

Dataset Information

0

Discriminatory power of RNA family models.


ABSTRACT: MOTIVATION: RNA family models group nucleotide sequences that share a common biological function. These models can be used to find new sequences belonging to the same family. To succeed in this task, a model needs to exhibit high sensitivity as well as high specificity. As model construction is guided by a manual process, a number of problems can occur, such as the introduction of more than one model for the same family or poorly constructed models. We explore the Rfam database to discover such problems. RESULTS: Our main contribution is in the definition of the discriminatory power of RNA family models, together with a first algorithm for its computation. In addition, we present calculations across the whole Rfam database that show several families lacking high specificity when compared to other families. We give a list of these clusters of families and provide a tentative explanation. Our program can be used to: (i) make sure that new models are not equivalent to any model already present in the database; and (ii) new models are not simply submodels of existing families. AVAILABILITY: www.tbi.univie.ac.at/software/cmcompare/. The code is licensed under the GPLv3. Results for the whole Rfam database and supporting scripts are available together with the software.

SUBMITTER: Honer zu Siederdissen C 

PROVIDER: S-EPMC2935435 | biostudies-literature | 2010 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Discriminatory power of RNA family models.

Höner zu Siederdissen Christian C   Hofacker Ivo L IL  

Bioinformatics (Oxford, England) 20100901 18


<h4>Motivation</h4>RNA family models group nucleotide sequences that share a common biological function. These models can be used to find new sequences belonging to the same family. To succeed in this task, a model needs to exhibit high sensitivity as well as high specificity. As model construction is guided by a manual process, a number of problems can occur, such as the introduction of more than one model for the same family or poorly constructed models. We explore the Rfam database to discove  ...[more]

Similar Datasets

| S-EPMC2920015 | biostudies-other
| S-EPMC8219380 | biostudies-literature
| S-EPMC85114 | biostudies-literature
| S-EPMC4894298 | biostudies-literature
| S-EPMC7814417 | biostudies-literature
| S-EPMC6075574 | biostudies-literature
| S-EPMC5983439 | biostudies-literature
| S-EPMC8328194 | biostudies-literature
| S-EPMC4011653 | biostudies-other
| S-EPMC7236949 | biostudies-literature