Dataset Information

Discovery of lincRNA-encoded Peptides: An Integrated Transcriptomics, Proteomics and Bioinformatics Approach

ABSTRACT: Long noncoding RNA (lncRNA) refers to the family of RNA transcripts with more than 200 nucleotides in length, but cannot encode proteins. lincRNA (long intergenic noncoding RNA) is a subset of lncRNA that do not overlap with known genes. Increasing evidences have shown that some of these transcripts do in fact contain open reading frames (ORFs) to code short peptides, and do have significant functional roles within the cells. However, many of these peptides remain unannotated and uncharacterized. This study proposes a workflow integrating proteomics, transcriptomics and bioinformatics specifically for lincRNA-encoded peptide discovery. The workflow was tested on the mouse kidney inner medulla (IM), a region that contains the collecting duct system responsible for regulated water transport. In brief, short peptides (from 2 to 20 kDa) were enriched by tricine protein gel and in-gel trypsinized into peptides, then analyzed using high resolution mass spectrometry. However, to match mass fragment ion spectra to peptide sequences requires a reference peptide sequence database which are not available for the noncoding transcripts, and must be generated de novo in the sample of interest. We modified the RNA-Seq mapping workflow by filtering out coding reads first to better quantitate the noncoding transcript expressions. Also, a rule-based ORF prediction was implemented to select one best predicted ORF per noncoding transcript to construct the peptide library. Candidates were further evaluated using several quality control criteria and bioinformatics tools. Three candidates, conserved in rat and human, passed all criteria, maybe truly novel coding genes. In summary, we present a workflow based on the modern transcriptomics and proteomics technologies for lincRNA-encoded peptide discovery. A computational challenge is to generate a hypothetical lincRNA-encoded peptide database for peptide-mass spectra matching. With this workflow, we discovered three previously unannotated peptides in the mouse kidney inner medulla. The same workflow can be applied in any cell or tissue type of interest to quickly advance this research field.

INSTRUMENT(S): Orbitrap Fusion

ORGANISM(S): Mus Musculus (mouse)

TISSUE(S): Epithelial Cell, Kidney

SUBMITTER: CHIN-RANG YANG

LAB HEAD: CHIN-RANG YANG

PROVIDER: PXD013892 | Pride | 2020-05-12

REPOSITORIES: Pride

ACCESS DATA

Dataset's files

Source:

Items per page:

1 - 5 of 22

			Action	DRS
	2_01.raw	Raw
	2_02.raw	Raw
	2_03.raw	Raw
	2_04.raw	Raw
	2_05.raw	Raw

Dataset Information

Discovery of lincRNA-encoded Peptides: An Integrated Transcriptomics, Proteomics and Bioinformatics Approach

Dataset's files

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Integration of genome-wide approaches identifies lncRNAs of adult neural stem cells and their progeny in vivo [RNA-seq]
2013-04-11 | E-GEOD-45278 | biostudies-arrayexpress

Diversity in pervasive translation. A new translational landscape of yeast.
2024-09-04 | PXD040766 | Pride

Expression Profiling of Arabidopsis Long Noncoding RNAs Under Continuous Light Condition
2017-08-11 | GSE80094 | GEO

Evidence for Existence of Multiple Functional Human Small RNAs Derived from Transcripts of Protein-Coding Genes
2023-02-20 | GSE221958 | GEO

The transcriptional and translational landscape of HCoV-OC43 infection
2024-01-17 | GSE252692 | GEO

Identification of endogenous small peptides involved in rice immunity through transcriptomics- and proteomics-based screening
2024-01-08 | GSE252769 | GEO

Identification of endogenous small peptides involved in rice immunity through transcriptomics- and proteomics-based screening
2024-01-08 | GSE252768 | GEO

Identification of endogenous small peptides involved in rice immunity through transcriptomics- and proteomics-based screening
2024-01-08 | GSE252767 | GEO

Identifying interaction partners of noncanonical peptides.
2024-05-23 | PXD037658 | Pride

The IDA-LIKE peptides IDL6 and IDL7 are negative modulators of stress responses in Arabidopsis thaliana
2016-03-31 | E-GEOD-77467 | biostudies-arrayexpress