Unknown

Dataset Information

0

SmORFunction: a tool for predicting functions of small open reading frames and microproteins.


ABSTRACT:

Background

Small open reading frame (smORF) is open reading frame with a length of less than 100 codons. Microproteins, translated from smORFs, have been found to participate in a variety of biological processes such as muscle formation and contraction, cell proliferation, and immune activation. Although previous studies have collected and annotated a large abundance of smORFs, functions of the vast majority of smORFs are still unknown. It is thus increasingly important to develop computational methods to annotate the functions of these smORFs.

Results

In this study, we collected 617,462 unique smORFs from three studies. The expression of smORF RNAs was estimated by reannotated microarray probes. Using a speed-optimized correlation algorism, the functions of smORFs were predicted by their correlated genes with known functional annotations. After applying our method to 5 known microproteins from literatures, our method successfully predicted their functions. Further validation from the UniProt database showed that at least one function of 202 out of 270 microproteins was predicted.

Conclusions

We developed a method, smORFunction, to provide function predictions of smORFs/microproteins in at most 265 models generated from 173 datasets, including 48 tissues/cells, 82 diseases (and normal). The tool can be available at https://www.cuilab.cn/smorfunction .

SUBMITTER: Ji X 

PROVIDER: S-EPMC7559452 | biostudies-literature | 2020 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

smORFunction: a tool for predicting functions of small open reading frames and microproteins.

Ji Xiangwen X   Cui Chunmei C   Cui Qinghua Q  

BMC bioinformatics 20201014 1


<h4>Background</h4>Small open reading frame (smORF) is open reading frame with a length of less than 100 codons. Microproteins, translated from smORFs, have been found to participate in a variety of biological processes such as muscle formation and contraction, cell proliferation, and immune activation. Although previous studies have collected and annotated a large abundance of smORFs, functions of the vast majority of smORFs are still unknown. It is thus increasingly important to develop comput  ...[more]

Similar Datasets

| S-EPMC10032668 | biostudies-literature
2021-04-28 | GSE154491 | GEO
| S-EPMC2813248 | biostudies-literature
2019-07-03 | GSE125218 | GEO
| S-EPMC3454372 | biostudies-literature
2014-09-11 | E-GEOD-60384 | biostudies-arrayexpress
| S-EPMC3334604 | biostudies-literature
| S-EPMC7085969 | biostudies-literature
| S-EPMC10152738 | biostudies-literature
2014-09-11 | GSE60384 | GEO