Unknown

Dataset Information

0

PredLnc-GFStack: A Global Sequence Feature Based on a Stacked Ensemble Learning Method for Predicting lncRNAs from Transcripts.


ABSTRACT: Long non-coding RNAs (lncRNAs) are a class of RNAs with the length exceeding 200 base pairs (bps), which do not encode proteins, nevertheless, lncRNAs have many vital biological functions. A large number of novel transcripts were discovered as a result of the development of high-throughput sequencing technology. Under this circumstance, computational methods for lncRNA prediction are in great demand. In this paper, we consider global sequence features and propose a stacked ensemble learning-based method to predict lncRNAs from transcripts, abbreviated as PredLnc-GFStack. We extract the critical features from the candidate feature list using the genetic algorithm (GA) and then employ the stacked ensemble learning method to construct PredLnc-GFStack model. Computational experimental results show that PredLnc-GFStack outperforms several state-of-the-art methods for lncRNA prediction. Furthermore, PredLnc-GFStack demonstrates an outstanding ability for cross-species ncRNA prediction.

SUBMITTER: Liu S 

PROVIDER: S-EPMC6770532 | biostudies-literature | 2019 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

PredLnc-GFStack: A Global Sequence Feature Based on a Stacked Ensemble Learning Method for Predicting lncRNAs from Transcripts.

Liu Shuai S   Zhao Xiaohan X   Zhang Guangyan G   Li Weiyang W   Liu Feng F   Liu Shichao S   Zhang Wen W  

Genes 20190903 9


Long non-coding RNAs (lncRNAs) are a class of RNAs with the length exceeding 200 base pairs (bps), which do not encode proteins, nevertheless, lncRNAs have many vital biological functions. A large number of novel transcripts were discovered as a result of the development of high-throughput sequencing technology. Under this circumstance, computational methods for lncRNA prediction are in great demand. In this paper, we consider global sequence features and propose a stacked ensemble learning-base  ...[more]

Similar Datasets

| S-EPMC6331124 | biostudies-literature
| S-EPMC9377622 | biostudies-literature
| S-EPMC7294416 | biostudies-literature
| S-EPMC7911732 | biostudies-literature
| S-EPMC10080841 | biostudies-literature
| S-EPMC8110930 | biostudies-literature
| S-EPMC7814738 | biostudies-literature
| S-EPMC7984626 | biostudies-literature
| S-EPMC4909287 | biostudies-literature
| S-EPMC8119477 | biostudies-literature