Dataset Information

DMfold: A Novel Method to Predict RNA Secondary Structure With Pseudoknots Based on Deep Learning and Improved Base Pair Maximization Principle.

ABSTRACT: While predicting the secondary structure of RNA is vital for researching its function, determining RNA secondary structure is challenging, especially for that with pseudoknots. Typically, several excellent computational methods can be utilized to predict the secondary structure (with or without pseudoknots), but they have their own merits and demerits. These methods can be classified into two categories: the multi-sequence method and the single-sequence method. The main advantage of the multi-sequence method lies in its use of the auxiliary sequences to assist in predicting the secondary structure, but it can only successfully predict in the presence of multiple highly homologous sequences. The single-sequence method is associated with the major merit of easy operation (only need the target sequence to predict secondary structure), but its folding parameters are the common features of diversity RNA, which cannot describe the unique characteristics of RNA, thus potentially resulting in the low prediction accuracy in some RNA. In this paper, "DMfold," a method based on the Deep Learning and Improved Base Pair Maximization Principle, is proposed to predict the secondary structure with pseudoknots, which fully absorbs the advantages and avoids some disadvantages of those two methods. Notably, DMfold could predict the secondary structure of RNA by learning similar RNA in the known structures, which uses the similar RNA sequences instead of the highly homogeneous sequences in the multi-sequence method, thereby reducing the requirement for auxiliary sequences. In DMfold, it only needs to input the target sequence to predict the secondary structure. Its folding parameters are fully extracted automatically by deep learning, which could avoid the lack of folding parameters in the single-sequence method. Experiments show that our method is not only simple to operate, but also improves the prediction accuracy compared to multiple excellent prediction methods. A repository containing our code can be found at https://github.com/linyuwangPHD/RNA-Secondary-Structure-Database.

SUBMITTER: Wang L

PROVIDER: S-EPMC6409321 | biostudies-literature | 2019

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

DMfold: A Novel Method to Predict RNA Secondary Structure With Pseudoknots Based on Deep Learning and Improved Base Pair Maximization Principle.

Wang Linyu L Liu Yuanning Y Zhong Xiaodan X Liu Haiming H Lu Chao C Li Cong C Zhang Hao H

Frontiers in genetics 20190304

While predicting the secondary structure of RNA is vital for researching its function, determining RNA secondary structure is challenging, especially for that with pseudoknots. Typically, several excellent computational methods can be utilized to predict the secondary structure (with or without pseudoknots), but they have their own merits and demerits. These methods can be classified into two categories: the multi-sequence method and the single-sequence method. The main advantage of the multi-se ...[more]

PMID: 30886627

Dataset Information

DMfold: A Novel Method to Predict RNA Secondary Structure With Pseudoknots Based on Deep Learning and Improved Base Pair Maximization Principle.

Publications

DMfold: A Novel Method to Predict RNA Secondary Structure With Pseudoknots Based on Deep Learning and Improved Base Pair Maximization Principle.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

TT2NE: a novel algorithm to predict RNA secondary structures with pseudoknots.
| S-EPMC3152363 | biostudies-literature

McGenus: a Monte Carlo algorithm to predict RNA secondary structures with pseudoknots.
| S-EPMC3561945 | biostudies-literature

CyloFold: secondary structure prediction including pseudoknots.
| S-EPMC2896150 | biostudies-literature

Reversed-phase ion-pair liquid chromatography method for purification of duplex DNA with single base pair resolution.
| S-EPMC3814375 | biostudies-literature

RNA Secondary Structures with Limited Base Pair Span: Exact Backtracking and an Application.
| S-EPMC7823788 | biostudies-literature

Base-pair resolution detection of transcription factor binding site by deep deconvolutional network.
| S-EPMC6184544 | biostudies-literature

Accurate SHAPE-directed RNA secondary structure modeling, including pseudoknots.
| S-EPMC3619282 | biostudies-literature

Highly specific unnatural base pair systems as a third base pair for PCR amplification.
| S-EPMC3315302 | biostudies-literature

Improved Chou-Fasman method for protein secondary structure prediction.
| S-EPMC1780123 | biostudies-literature

Deep RNA sequencing at single base-pair resolution reveals high complexity of the rice transcriptome.
| S-EPMC2860166 | biostudies-literature