Unknown

Dataset Information

0

A novel method for finding tRNA genes.


ABSTRACT: We describe a novel procedure for generating and optimizing pattern descriptors that can be used to find structural motifs in DNA or RNA sequences. This combines a pattern-description language (based primarily on secondary structure alignment and conservation of some key nucleotides) with a scoring function that relies heavily on estimated folding free energies for the secondary structure of interest. For the cloverleaf secondary structure characteristic of tRNA, we show that a fairly simple pattern descriptor can find almost all known tRNA genes in both bacterial and eukaryotic genomes, and that false positives (sequences that match the pattern but that are probably not tRNAs) can be recognized by their high estimated folding free energies. A general procedure for optimizing descriptors (and hence for finding new structural motifs) is also described. For six bacterial, four eukaryotic, and four archaea genome sequences, our results compare favorably with those of the more complex and specialized tRNAscan-SE algorithm. Prospects for using this general approach to find other RNA structural motifs are discussed.

SUBMITTER: Tsui V 

PROVIDER: S-EPMC1370417 | biostudies-literature | 2003 May

REPOSITORIES: biostudies-literature

altmetric image

Publications

A novel method for finding tRNA genes.

Tsui Vickie V   Macke Tom T   Case David A DA  

RNA (New York, N.Y.) 20030501 5


We describe a novel procedure for generating and optimizing pattern descriptors that can be used to find structural motifs in DNA or RNA sequences. This combines a pattern-description language (based primarily on secondary structure alignment and conservation of some key nucleotides) with a scoring function that relies heavily on estimated folding free energies for the secondary structure of interest. For the cloverleaf secondary structure characteristic of tRNA, we show that a fairly simple pat  ...[more]

Similar Datasets

| PRJEB27580 | ENA
| S-EPMC5124285 | biostudies-literature
| S-EPMC7455704 | biostudies-literature
| S-EPMC1137005 | biostudies-literature
2011-04-01 | GSE24169 | GEO
| S-EPMC4191379 | biostudies-literature
| S-EPMC3868979 | biostudies-literature
2011-03-31 | E-GEOD-24169 | biostudies-arrayexpress
| S-EPMC2673418 | biostudies-literature
| S-EPMC6778316 | biostudies-literature