Unknown

Dataset Information

0

Noncoding RNA gene detection using comparative sequence analysis.


ABSTRACT:

Background

Noncoding RNA genes produce transcripts that exert their function without ever producing proteins. Noncoding RNA gene sequences do not have strong statistical signals, unlike protein coding genes. A reliable general purpose computational genefinder for noncoding RNA genes has been elusive.

Results

We describe a comparative sequence analysis algorithm for detecting novel structural RNA genes. The key idea is to test the pattern of substitutions observed in a pairwise alignment of two homologous sequences. A conserved coding region tends to show a pattern of synonymous substitutions, whereas a conserved structural RNA tends to show a pattern of compensatory mutations consistent with some base-paired secondary structure. We formalize this intuition using three probabilistic "pair-grammars": a pair stochastic context free grammar modeling alignments constrained by structural RNA evolution, a pair hidden Markov model modeling alignments constrained by coding sequence evolution, and a pair hidden Markov model modeling a null hypothesis of position-independent evolution. Given an input pairwise sequence alignment (e.g. from a BLASTN comparison of two related genomes) we classify the alignment into the coding, RNA, or null class according to the posterior probability of each class.

Conclusions

We have implemented this approach as a program, QRNA, which we consider to be a prototype structural noncoding RNA genefinder. Tests suggest that this approach detects noncoding RNA genes with a fair degree of reliability.

SUBMITTER: Rivas E 

PROVIDER: S-EPMC64605 | biostudies-literature | 2001

REPOSITORIES: biostudies-literature

altmetric image

Publications

Noncoding RNA gene detection using comparative sequence analysis.

Rivas E E   Eddy S R SR  

BMC bioinformatics 20011010


<h4>Background</h4>Noncoding RNA genes produce transcripts that exert their function without ever producing proteins. Noncoding RNA gene sequences do not have strong statistical signals, unlike protein coding genes. A reliable general purpose computational genefinder for noncoding RNA genes has been elusive.<h4>Results</h4>We describe a comparative sequence analysis algorithm for detecting novel structural RNA genes. The key idea is to test the pattern of substitutions observed in a pairwise ali  ...[more]

Similar Datasets

| S-EPMC10336498 | biostudies-literature
| S-EPMC84932 | biostudies-literature
| S-EPMC4232354 | biostudies-literature
| S-EPMC4294734 | biostudies-literature
| S-EPMC3205578 | biostudies-literature
| S-EPMC140344 | biostudies-literature
| S-EPMC310816 | biostudies-literature
| S-EPMC3891352 | biostudies-literature
| S-EPMC4800405 | biostudies-literature
| S-EPMC5429720 | biostudies-literature