Unknown

Dataset Information

0

RNA secondary structure prediction from sequence alignments using a network of k-nearest neighbor classifiers.


ABSTRACT: We present a machine learning method (a hierarchical network of k-nearest neighbor classifiers) that uses an RNA sequence alignment in order to predict a consensus RNA secondary structure. The input to the network is the mutual information, the fraction of complementary nucleotides, and a novel consensus RNAfold secondary structure prediction of a pair of alignment columns and its nearest neighbors. Given this input, the network computes a prediction as to whether a particular pair of alignment columns corresponds to a base pair. By using a comprehensive test set of 49 RFAM alignments, the program KNetFold achieves an average Matthews correlation coefficient of 0.81. This is a significant improvement compared with the secondary structure prediction methods PFOLD and RNAalifold. By using the example of archaeal RNase P, we show that the program can also predict pseudoknot interactions.

SUBMITTER: Bindewald E 

PROVIDER: S-EPMC1383574 | biostudies-literature | 2006 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

RNA secondary structure prediction from sequence alignments using a network of k-nearest neighbor classifiers.

Bindewald Eckart E   Shapiro Bruce A BA  

RNA (New York, N.Y.) 20060301 3


We present a machine learning method (a hierarchical network of k-nearest neighbor classifiers) that uses an RNA sequence alignment in order to predict a consensus RNA secondary structure. The input to the network is the mutual information, the fraction of complementary nucleotides, and a novel consensus RNAfold secondary structure prediction of a pair of alignment columns and its nearest neighbors. Given this input, the network computes a prediction as to whether a particular pair of alignment  ...[more]

Similar Datasets

| S-EPMC6191722 | biostudies-literature
| S-EPMC514602 | biostudies-literature
| S-EPMC4968729 | biostudies-literature
| S-EPMC5449625 | biostudies-literature
| S-EPMC9344895 | biostudies-literature
| S-EPMC10639048 | biostudies-literature
| S-EPMC4501258 | biostudies-literature
| S-EPMC2621365 | biostudies-literature
| S-EPMC9035839 | biostudies-literature
| S-EPMC10044938 | biostudies-literature