Unknown

Dataset Information

0

MU-PseUDeep: A deep learning method for prediction of pseudouridine sites.


ABSTRACT: Pseudouridine synthase binds to uridine sites and catalyzes the conversion of uridine to pseudouridine (?). This binding takes place in a specific context and in the conformation of nucleotides. Most machine-learning methods for ? site classification use nucleotide frequency as a feature, which may not fully depict the relevant conformation around a ? site. Using the power of deep learning and raw sequence, as well as secondary structure features, our tool MU-PseUDeep is designed to capture both the sequence and secondary structure context, which inputs the raw RNA sequence and the predicted secondary structure to two sets of convolutional neural networks. It has shown considerable improvement in ? site prediction over existing tools, XG-PseU, PseUI, and iRNA-PseU for both balanced and imbalanced datasets. To the best of our knowledge, this is the most accurate tool for ? site prediction. We also used MU-PseUDeep to scan the human transcriptome, which shows that the genes with predicted ? sites are enriched in nucleotide and protein binding, as well as in neurodegeneration pathways. The tool is open source, available at https://github.com/smk5g5/MU-PseUDeep.

SUBMITTER: Khan SM 

PROVIDER: S-EPMC7387732 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

altmetric image

Publications

MU-PseUDeep: A deep learning method for prediction of pseudouridine sites.

Khan Saad M SM   He Fei F   Wang Duolin D   Chen Yongbing Y   Xu Dong D  

Computational and structural biotechnology journal 20200715


Pseudouridine synthase binds to uridine sites and catalyzes the conversion of uridine to pseudouridine (Ψ). This binding takes place in a specific context and in the conformation of nucleotides. Most machine-learning methods for Ψ site classification use nucleotide frequency as a feature, which may not fully depict the relevant conformation around a Ψ site. Using the power of deep learning and raw sequence, as well as secondary structure features, our tool MU-PseUDeep is designed to capture both  ...[more]

Similar Datasets

| S-EPMC7901771 | biostudies-literature
| S-EPMC6691328 | biostudies-literature
| S-EPMC8637112 | biostudies-literature
| S-EPMC6205083 | biostudies-literature
| S-EPMC7509169 | biostudies-literature
| S-EPMC8575008 | biostudies-literature
| S-EPMC8240957 | biostudies-literature
| S-EPMC8197219 | biostudies-literature
| S-EPMC7316719 | biostudies-literature
| S-EPMC6838336 | biostudies-literature