Dataset Information

Distinguishing mirtrons from canonical miRNAs with data exploration and machine learning methods.

ABSTRACT: Mirtrons are non-canonical microRNAs encoded in introns the biogenesis of which starts with splicing. They are not processed by Drosha and enter the canonical pathway at the Exportin-5 level. Mirtrons are much less evolutionary conserved than canonical miRNAs. Due to the differences, canonical miRNA predictors are not applicable to mirtron prediction. Identification of differences is important for designing mirtron prediction algorithms and may help to improve the understanding of mirtron functioning. So far, only simple, single-feature comparisons were reported. These are insensitive to complex feature relations. We quantified miRNAs with 25 features and showed that it is impossible to distinguish the two miRNA species using simple thresholds on any single feature. However, when using the Principal Component Analysis mirtrons and canonical miRNAs are grouped separately. Moreover, several methodologically diverse machine learning classifiers delivered high classification performance. Using feature selection algorithms we found features (e.g. bulges in the stem region), previously reported divergent in two classes, that did not contribute to improving classification accuracy, which suggests that they are not biologically meaningful. Finally, we proposed a combination of the most important features (including Guanine content, hairpin free energy and hairpin length) which convey a specific pattern, crucial for identifying mirtrons.

SUBMITTER: Rorbach G

PROVIDER: S-EPMC5953923 | biostudies-literature | 2018 May

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Distinguishing mirtrons from canonical miRNAs with data exploration and machine learning methods.

Rorbach Grzegorz G Unold Olgierd O Konopka Bogumil M BM

Scientific reports 20180515 1

Mirtrons are non-canonical microRNAs encoded in introns the biogenesis of which starts with splicing. They are not processed by Drosha and enter the canonical pathway at the Exportin-5 level. Mirtrons are much less evolutionary conserved than canonical miRNAs. Due to the differences, canonical miRNA predictors are not applicable to mirtron prediction. Identification of differences is important for designing mirtron prediction algorithms and may help to improve the understanding of mirtron functi ...[more]

PMID: 29765080

Dataset Information

Distinguishing mirtrons from canonical miRNAs with data exploration and machine learning methods.

Publications

Distinguishing mirtrons from canonical miRNAs with data exploration and machine learning methods.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Distinguishing reproductive phasiRNAs in grasses from other identically-sized small RNAs using machine learning methods
2018-07-23 | GSE108105 | GEO

Distinguishing Discoid and Centripetal Levallois methods through machine learning.
| S-EPMC7757815 | biostudies-literature

Distinguishing butchery cut marks from crocodile bite marks through machine learning methods.
| S-EPMC5893542 | biostudies-literature

Exploration of machine learning methods to predict systemic lupus erythematosus hospitalizations.
| S-EPMC9547899 | biostudies-literature

Machine Learning Methods for Histopathological Image Analysis.
| S-EPMC6158771 | biostudies-other

Distinguishing Learning Rules with Brain Machine Interfaces.
| S-EPMC10129057 | biostudies-literature

Quantifying performance of machine learning methods for neuroimaging data.
| S-EPMC6688909 | biostudies-literature

Circulating miRNAs and Machine Learning for Lateralizing Primary Aldosteronism
2024-10-31 | GSE264578 | GEO

Exploration of geochemical data with compositional canonical biplots.
| S-EPMC7839972 | biostudies-literature

Machine learning and deep learning methods that use omics data for metastasis prediction.
| S-EPMC8450182 | biostudies-literature