Unknown

Dataset Information

0

Mammalian microRNA prediction through a support vector machine model of sequence and structure.


ABSTRACT:

Background

MicroRNAs (miRNAs) are endogenous small noncoding RNA gene products, on average 22 nt long, found in a wide variety of organisms. They play important regulatory roles by targeting mRNAs for degradation or translational repression. There are 377 known mouse miRNAs and 475 known human miRNAs in the May 2007 release of the miRBase database, the majority of which are conserved between the two species. A number of recent reports imply that it is likely that many mammalian miRNAs remain to be discovered. The possibility that there are more of them expressed at lower levels or in more specialized expression contexts calls for the exploitation of genome sequence information to accelerate their discovery.

Methodology/principal findings

In this article, we describe a computational method-mirCoS-that uses three support vector machine models sequentially to discover new miRNA candidates in mammalian genomes based on sequence, secondary structure, and conservation. mirCoS can efficiently detect the majority of known miRNAs and predicts an extensive set of hairpin structures based on human-mouse comparisons. In total, 3476 mouse candidates and 3441 human candidates were found. These hairpins are more similar to known miRNAs than to negative controls in several aspects not considered by the prediction algorithm. A significant fraction of predictions is supported by existing expression evidence.

Conclusions/significance

Using a novel approach, mirCoS performs comparably to or better than existing miRNA prediction methods, and contributes a significant number of new candidate miRNAs for experimental verification.

SUBMITTER: Sheng Y 

PROVIDER: S-EPMC1978525 | biostudies-literature | 2007 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Mammalian microRNA prediction through a support vector machine model of sequence and structure.

Sheng Ying Y   Engström Pär G PG   Lenhard Boris B  

PloS one 20070926 9


<h4>Background</h4>MicroRNAs (miRNAs) are endogenous small noncoding RNA gene products, on average 22 nt long, found in a wide variety of organisms. They play important regulatory roles by targeting mRNAs for degradation or translational repression. There are 377 known mouse miRNAs and 475 known human miRNAs in the May 2007 release of the miRBase database, the majority of which are conserved between the two species. A number of recent reports imply that it is likely that many mammalian miRNAs re  ...[more]

Similar Datasets

| S-EPMC1594580 | biostudies-literature
| S-EPMC4106887 | biostudies-literature
| S-EPMC1159118 | biostudies-literature
| S-EPMC4394448 | biostudies-literature
| S-EPMC4352747 | biostudies-other
| S-EPMC1534064 | biostudies-literature
| S-EPMC6854775 | biostudies-literature
| S-EPMC2220009 | biostudies-literature
| S-EPMC10030784 | biostudies-literature
| S-EPMC3264588 | biostudies-literature