Dataset Information

MiSTAR: miRNA target prediction through modeling quantitative and qualitative miRNA binding site information in a stacked model structure.

ABSTRACT: In microRNA (miRNA) target prediction, typically two levels of information need to be modeled: the number of potential miRNA binding sites present in a target mRNA and the genomic context of each individual site. Single model structures insufficiently cope with this complex training data structure, consisting of feature vectors of unequal length as a consequence of the varying number of miRNA binding sites in different mRNAs. To circumvent this problem, we developed a two-layered, stacked model, in which the influence of binding site context is separately modeled. Using logistic regression and random forests, we applied the stacked model approach to a unique data set of 7990 probed miRNA-mRNA interactions, hereby including the largest number of miRNAs in model training to date. Compared to lower-complexity models, a particular stacked model, named miSTAR (miRNA stacked model target prediction; www.mi-star.org), displays a higher general performance and precision on top scoring predictions. More importantly, our model outperforms published and widely used miRNA target prediction algorithms. Finally, we highlight flaws in cross-validation schemes for evaluation of miRNA target prediction models and adopt a more fair and stringent approach.

SUBMITTER: Van Peer G

PROVIDER: S-EPMC5397177 | biostudies-literature | 2017 Apr

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

miSTAR: miRNA target prediction through modeling quantitative and qualitative miRNA binding site information in a stacked model structure.

Van Peer Gert G De Paepe Ayla A Stock Michiel M Anckaert Jasper J Volders Pieter-Jan PJ Vandesompele Jo J De Baets Bernard B Waegeman Willem W

Nucleic acids research 20170401 7

In microRNA (miRNA) target prediction, typically two levels of information need to be modeled: the number of potential miRNA binding sites present in a target mRNA and the genomic context of each individual site. Single model structures insufficiently cope with this complex training data structure, consisting of feature vectors of unequal length as a consequence of the varying number of miRNA binding sites in different mRNAs. To circumvent this problem, we developed a two-layered, stacked model, ...[more]

PMID: 27986855

Dataset Information

MiSTAR: miRNA target prediction through modeling quantitative and qualitative miRNA binding site information in a stacked model structure.

Publications

miSTAR: miRNA target prediction through modeling quantitative and qualitative miRNA binding site information in a stacked model structure.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Position-wise binding preference is important for miRNA target site prediction.
| S-EPMC8453239 | biostudies-literature

MicroRNA Target Site Identification by Integrating Sequence and Binding Information
2013-05-25 | E-GEOD-46611 | biostudies-arrayexpress

MicroRNA Target Site Identification by Integrating Sequence and Binding Information
2013-05-25 | GSE46611 | GEO

Dual modality feature fused neural network integrating binding site information for drug target affinity prediction.
| S-EPMC11775287 | biostudies-literature

MicroRNA target site identification by integrating sequence and binding information.
| S-EPMC3818907 | biostudies-literature

mirMark: a site-level and UTR-level classifier for miRNA target prediction.
| S-EPMC4243195 | biostudies-literature

Cas9-chromatin binding information enables more accurate CRISPR off-target prediction.
| S-EPMC4605288 | biostudies-literature

BindingSiteDTI: differential-scale binding site modelling for drug-target interaction prediction.
| S-EPMC11256917 | biostudies-literature

Prediction of miRNA-Disease Associations by Cascade Forest Model Based on Stacked Autoencoder.
| S-EPMC10343850 | biostudies-literature

Using Attribution Sequence Alignment to Interpret Deep Learning Models for miRNA Binding Site Prediction.
| S-EPMC10045089 | biostudies-literature