Unknown

Dataset Information

0

Prediction of LncRNA Subcellular Localization with Deep Learning from Sequence Features.


ABSTRACT: Long non-coding RNAs are involved in biological processes throughout the cell including the nucleus, chromatin and cytosol. However, most lncRNAs remain unannotated and functional annotation of lncRNAs is difficult due to their low conservation and their tissue and developmentally specific expression. LncRNA subcellular localization is highly informative regarding its biological function, although it is difficult to discover because few prediction methods currently exist. While protein subcellular localization prediction is a well-established research field, lncRNA localization prediction is a novel research problem. We developed DeepLncRNA, a deep learning algorithm which predicts lncRNA subcellular localization directly from lncRNA transcript sequences. We analyzed 93 strand-specific RNA-seq samples of nuclear and cytosolic fractions from multiple cell types to identify differentially localized lncRNAs. We then extracted sequence-based features from the lncRNAs to construct our DeepLncRNA model, which achieved an accuracy of 72.4%, sensitivity of 83%, specificity of 62.4% and area under the receiver operating characteristic curve of 0.787. Our results suggest that primary sequence motifs are a major driving force in the subcellular localization of lncRNAs.

SUBMITTER: Gudenas BL 

PROVIDER: S-EPMC6219567 | biostudies-literature | 2018 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Prediction of LncRNA Subcellular Localization with Deep Learning from Sequence Features.

Gudenas Brian L BL   Wang Liangjiang L  

Scientific reports 20181106 1


Long non-coding RNAs are involved in biological processes throughout the cell including the nucleus, chromatin and cytosol. However, most lncRNAs remain unannotated and functional annotation of lncRNAs is difficult due to their low conservation and their tissue and developmentally specific expression. LncRNA subcellular localization is highly informative regarding its biological function, although it is difficult to discover because few prediction methods currently exist. While protein subcellul  ...[more]

Similar Datasets

| S-EPMC7604748 | biostudies-literature
| S-EPMC6612824 | biostudies-other
| S-EPMC8621699 | biostudies-literature
| S-EPMC2040162 | biostudies-literature
| S-BSST732 | biostudies-other
2022-02-15 | PXD019987 | Pride
| S-EPMC6030869 | biostudies-literature