Unknown

Dataset Information

0

PiRNN: deep learning algorithm for piRNA prediction.


ABSTRACT: Piwi-interacting RNAs (piRNAs) are the largest class of small non-coding RNAs discovered in germ cells. Identifying piRNAs from small RNA data is a challenging task due to the lack of conserved sequences and structural features of piRNAs. Many programs have been developed to identify piRNA from small RNA data. However, these programs have limitations. They either rely on extracting complicated features, or only demonstrate strong performance on transposon related piRNAs. Here we proposed a new program called piRNN for piRNA identification. For our software, we applied a convolutional neural network classifier that was trained on the datasets from four different species (Caenorhabditis elegans, Drosophila melanogaster, rat and human). A matrix of k-mer frequency values was used to represent each sequence. piRNN has great usability and shows better performance in comparison with other programs. It is freely available at https://github.com/bioinfolabmu/piRNN.

SUBMITTER: Wang K 

PROVIDER: S-EPMC6078063 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

piRNN: deep learning algorithm for piRNA prediction.

Wang Kai K   Hoeksema Joshua J   Liang Chun C  

PeerJ 20180803


Piwi-interacting RNAs (piRNAs) are the largest class of small non-coding RNAs discovered in germ cells. Identifying piRNAs from small RNA data is a challenging task due to the lack of conserved sequences and structural features of piRNAs. Many programs have been developed to identify piRNA from small RNA data. However, these programs have limitations. They either rely on extracting complicated features, or only demonstrate strong performance on transposon related piRNAs. Here we proposed a new p  ...[more]

Similar Datasets

| S-EPMC8497544 | biostudies-literature
| S-EPMC5445391 | biostudies-literature
| S-EPMC9279906 | biostudies-literature
2023-03-31 | GSE165175 | GEO
2021-06-22 | GSE175456 | GEO
2023-03-31 | GSE165174 | GEO
2023-03-31 | GSE165173 | GEO
| S-EPMC8637112 | biostudies-literature