Unknown

Dataset Information

0

ORFLine: a bioinformatic pipeline to prioritise small open reading frames identifies candidate secreted small proteins from lymphocytes.


ABSTRACT:

Motivation

The annotation of small open reading frames (smORFs) of less than 100 codons (<300 nucleotides) is challenging due to the large number of such sequences in the genome.

Results

In this study, we developed a computational pipeline, which we have named ORFLine, that stringently identifies smORFs and classifies them according to their position within transcripts. We identified a total of 5744 unique smORFs in datasets from mouse B and T lymphocytes and systematically characterized them using ORFLine. We further searched smORFs for the presence of a signal peptide, which predicted known secreted chemokines as well as novel micropeptides. Four novel micropeptides show evidence of secretion and are therefore candidate mediators of immunoregulatory functions.

Availability

Freely available on the web at https://github.com/boboppie/ORFLine.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Hu F 

PROVIDER: S-EPMC8504629 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

2021-04-28 | GSE154491 | GEO
| PRJNA646476 | ENA
2019-07-03 | GSE125218 | GEO
2014-09-11 | E-GEOD-60384 | biostudies-arrayexpress
| S-EPMC3334604 | biostudies-literature
| S-EPMC7085969 | biostudies-literature
| S-EPMC10152738 | biostudies-literature
2014-09-11 | GSE60384 | GEO
2020-03-14 | GSE131650 | GEO
| S-EPMC4359375 | biostudies-literature