Unknown

Dataset Information

0

NmSEER V2.0: a prediction tool for 2'-O-methylation sites based on random forest and multi-encoding combination.


ABSTRACT: BACKGROUND:2'-O-methylation (2'-O-me or Nm) is a post-transcriptional RNA methylation modified at 2'-hydroxy, which is common in mRNAs and various non-coding RNAs. Previous studies revealed the significance of Nm in multiple biological processes. With Nm getting more and more attention, a revolutionary technique termed Nm-seq, was developed to profile Nm sites mainly in mRNA with single nucleotide resolution and high sensitivity. In a recent work, supported by the Nm-seq data, we have reported a method in silico for predicting Nm sites, which relies on nucleotide sequence information, and established an online server named NmSEER. More recently, a more confident dataset produced by refined Nm-seq was available. Therefore, in this work, we redesigned the prediction model to achieve a more robust performance on the new data. RESULTS:We redesigned the prediction model from two perspectives, including machine learning algorithm and multi-encoding scheme combination. With optimization by 5-fold cross-validation tests and evaluation by independent test respectively, random forest was selected as the most robust algorithm. Meanwhile, one-hot encoding, together with position-specific dinucleotide sequence profile and K-nucleotide frequency encoding were collectively applied to build the final predictor. CONCLUSIONS:The predictor of updated version, named NmSEER V2.0, achieves an accurate prediction performance (AUROC?=?0.862) and has been settled into a brand-new server, which is available at http://www.rnanut.net/nmseer-v2/ for free.

SUBMITTER: Zhou Y 

PROVIDER: S-EPMC6929462 | biostudies-literature | 2019 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

NmSEER V2.0: a prediction tool for 2'-O-methylation sites based on random forest and multi-encoding combination.

Zhou Yiran Y   Cui Qinghua Q   Zhou Yuan Y  

BMC bioinformatics 20191224 Suppl 25


<h4>Background</h4>2'-O-methylation (2'-O-me or Nm) is a post-transcriptional RNA methylation modified at 2'-hydroxy, which is common in mRNAs and various non-coding RNAs. Previous studies revealed the significance of Nm in multiple biological processes. With Nm getting more and more attention, a revolutionary technique termed Nm-seq, was developed to profile Nm sites mainly in mRNA with single nucleotide resolution and high sensitivity. In a recent work, supported by the Nm-seq data, we have re  ...[more]

Similar Datasets

| S-EPMC4724119 | biostudies-literature
| S-EPMC3530872 | biostudies-other
| S-EPMC4494626 | biostudies-literature
2012-05-09 | E-GEOD-37858 | biostudies-arrayexpress
| S-EPMC4811047 | biostudies-literature
2012-05-10 | GSE37858 | GEO
| S-EPMC3376144 | biostudies-literature
| S-EPMC3429425 | biostudies-literature
2022-05-16 | GSE189510 | GEO