Browse
Submit Data
Databases
API
Help

Dataset Information

7 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

FuSe: a tool to move RNA-Seq analyses from chromosomal/gene loci to functional grouping of mRNA transcripts.

ABSTRACT:

Summary

Typical RNA sequencing (RNA-Seq) analyses are performed either at the gene level by summing all reads from the same locus, assuming that all transcripts from a gene make a protein or at the transcript level, assuming that each transcript displays unique function. However, these assumptions are flawed, as a gene can code for different types of transcripts and different transcripts are capable of synthesizing similar, different or no protein. As a consequence, functional changes are not well illustrated by either gene or transcript analyses. We propose to improve RNA-Seq analyses by grouping the transcripts based on their similar functions. We developed FuSe to predict functional similarities using the primary and secondary structure of proteins. To estimate the likelihood of proteins with similar functions, FuSe computes two confidence scores: knowledge (KS) and discovery (DS) for protein pairs. Overlapping protein pairs exhibiting high confidence are grouped to form 'similar function protein groups' and expression is calculated for each functional group. The impact of using FuSe is demonstrated on in vitro cells exposed to paracetamol, which highlight genes responsible for cell adhesion and glycogen regulation which were earlier shown to be not differentially expressed with traditional analysis methods.

Availability and implementation

The source code is available at https://github.com/rajinder4489/FuSe. Data for APAP exposure are available in the BioStudies database (http://www.ebi.ac.uk/biostudies) under accession numbers S-HECA143, S-HECA(158) and S-HECA139.

Supplementary information

Supplementary data are available at Bioinformatics online.

SUBMITTER: Gupta R

PROVIDER: S-EPMC8058771 | biostudies-literature |

REPOSITORIES: biostudies-literature

ACCESS DATA

Json Xml

Similar Datasets

Bacterial chromosomal loci move subdiffusively through a viscoelastic cytoplasm.

Project description:Tracking of fluorescently labeled chromosomal loci in live bacterial cells reveals a robust scaling of the mean square displacement (MSD) as ?(0.39). We propose that the observed motion arises from relaxation of the Rouse modes of the DNA polymer within the viscoelastic environment of the cytoplasm. The time-averaged and ensemble-averaged MSD of chromosomal loci exhibit ergodicity, and the velocity autocorrelation function is negative at short time lags. These observations are most consistent with fractional Langevin motion and rule out a continuous time random walk model as an explanation for anomalous motion in vivo.

| S-EPMC4929007 | biostudies-literature

miARma-Seq: a comprehensive tool for miRNA, mRNA and circRNA analysis.

Project description:Large-scale RNAseq has substantially changed the transcriptomics field, as it enables an unprecedented amount of high resolution data to be acquired. However, the analysis of these data still poses a challenge to the research community. Many tools have been developed to overcome this problem, and to facilitate the study of miRNA expression profiles and those of their target genes. While a few of these enable both kinds of analysis to be performed, they also present certain limitations in terms of their requirements and/or the restrictions on data uploading. To avoid these restraints, we have developed a suite that offers the identification of miRNA, mRNA and circRNAs that can be applied to any sequenced organism. Additionally, it enables differential expression, miRNA-mRNA target prediction and/or functional analysis. The miARma-Seq pipeline is presented as a stand-alone tool that is both easy to install and flexible in terms of its use, and that brings together well-established software in a single bundle. Our suite can analyze a large number of samples due to its multithread design. By testing miARma-Seq in validated datasets, we demonstrate here the benefits that can be gained from this tool by making it readily accessible to the research community.

| S-EPMC4863143 | biostudies-literature

Protocol to measure ribosome density along mRNA transcripts of Arabidopsis thaliana tissues using Ribo-seq.

Project description:Ribosome profiling (Ribo-seq) measures ribosome density along messenger RNA (mRNA) transcripts and is used to estimate the "translational fitness" of a given mRNA in response to environmental or developmental cues with high resolution. Here, we describe a protocol for Ribo-seq in plants adapted for the model plant Arabidopsis thaliana. We describe steps for lysis and nucleolytic digestion and ribosome footprinting. We then detail library construction, sequencing, and data analysis.

| S-EPMC10469065 | biostudies-literature

Integrative Analyses of scRNA-seq, Bulk mRNA-seq, and DNA Methylation Profiling in Depressed Suicide Brain Tissues.

Project description:BackgroundSuicidal behaviors have become a serious public health concern globally due to the economic and human cost of suicidal behavior to individuals, families, communities, and society. However, the underlying etiology and biological mechanism of suicidal behavior remains poorly understood.MethodsWe collected different single omic data, including single-cell RNA sequencing (scRNA-seq), bulk mRNA-seq, DNA methylation microarrays from the cortex of Major Depressive Disorder (MDD) in suicide subjects' studies, as well as fluoxetine-treated rats brains. We matched subject IDs that overlapped between the transcriptome dataset and the methylation dataset. The differential expression genes and differentially methylated regions were calculated with a 2-group comparison analysis. Cross-omics analysis was performed to calculate the correlation between the methylated and transcript levels of differentially methylated CpG sites and mapped transcripts. Additionally, we performed a deconvolution analysis for bulk mRNA-seq and DNA methylation profiling with scRNA-seq as the reference profiles.ResultsDifference in cell type proportions among 7 cell types. Meanwhile, our analysis of single-cell sequence from the antidepressant-treated rats found that drug-specific differential expression genes were enriched into biological pathways, including ion channels and glutamatergic receptors.ConclusionsThis study identified some important dysregulated genes influenced by DNA methylation in 2 brain regions of depression and suicide patients. Interestingly, we found that oligodendrocyte precursor cells (OPCs) have the most contributors for cell-type proportions related to differential expression genes and methylated sites in suicidal behavior.

| S-EPMC10726413 | biostudies-literature

Integrated CNV-seq, karyotyping and SNP-array analyses for effective prenatal diagnosis of chromosomal mosaicism.

Project description:BackgroundEmerging studies suggest that low-coverage massively parallel copy number variation sequencing (CNV-seq) more sensitive than chromosomal microarray analysis (CMA) for detecting low-level mosaicism. However, a retrospective back-to-back comparison evaluating accuracy, efficacy, and incremental yield of CNV-seq compared with CMA is warranted.MethodsA total of 72 mosaicism cases identified by karyotyping or CMA were recruited to the study. There were 67 mosaic samples co-analysed by CMA and CNV-seq, comprising 40 with sex chromosome aneuploidy, 22 with autosomal aneuploidy and 5 with large cryptic genomic rearrangements.ResultsOf the 67 positive mosaic cases, the levels of mosaicism defined by CNV-seq ranged from 6 to 92% compared to the ratio from 3 to 90% by karyotyping and 20% to 72% by CMA. CNV-seq not only identified all 43 chromosomal aneuploidies or large cryptic genomic rearrangements detected by CMA, but also provided a 34.88% (15/43) increased yield compared with CMA. The improved yield of mosaicism detection by CNV-seq was largely due to the ability to detect low level mosaicism below 20%.ConclusionIn the context of prenatal diagnosis, CNV-seq identified additional and clinically significant mosaicism with enhanced resolution and increased sensitivity. This study provides strong evidence for applying CNV-seq as an alternative to CMA for detection of aneuploidy and mosaic variants.

| S-EPMC7905897 | biostudies-literature

RNA-seq analyses of multiple meristems of soybean: novel and alternative transcripts, evolutionary and functional implications.

Project description:BACKGROUND: Soybean is one of the most important crops, providing large amounts of dietary proteins and edible oil, and is also an excellent model for studying evolution of duplicated genes. However, relative to the model plants Arabidopsis and rice, the present knowledge about soybean transcriptome is quite limited. RESULTS: In this study, we employed RNA-seq to investigate transcriptomes of 11 soybean tissues, for genome-wide discovery of truly expressed genes, and novel and alternative transcripts, as well as analyses of conservation and divergence of duplicated genes and their functional implications. We detected a total of 54,132 high-confidence expressed genes, and identified 6,718 novel transcriptional regions with a mean length of 372 bp. We also provided strong evidence for alternative splicing (AS) events for ~15.9% of the genes with two or more exons. Among them, 1,834 genes exhibited stage-dependent AS, and 202 genes had tissue-biased exon-skipping events. We further defined the conservation and divergence in expression patterns between duplicated gene pairs from recent whole genome duplications (WGDs); differentially expressed genes, tissue preferentially expressed genes, transcription factors and specific gene family members were identified for shoot apical meristem and flower development. CONCLUSIONS: Our results significantly improved soybean gene annotation, and also provide valuable resources for functional genomics and studies of the evolution of duplicated genes from WGDs in soybean.

| S-EPMC4070088 | biostudies-literature

mRNA-seq and miRNA-seq profiling analyses reveal molecular mechanisms regulating induction of fruiting body in Ophiocordyceps sinensis.

Project description:Ophiocordyceps sinensis has been a source of valuable materials in traditional Asian medicine for over two thousand years. With recent global warming and overharvest, however, the availability of these wild fungi has decreased dramatically. While fruiting body of O. sinensis has been artificially cultivated, the molecular mechanisms that govern the induction of fruiting body at the transcriptional and post-transcriptional levels are unclear. In this study, we carried out both mRNA and small RNA sequencing to identify crucial genes and miRNA-like RNAs (milRNAs) involved in the development of fruiting body. A total of 2875 differentially expressed genes (DEGs), and 71 differentially expressed milRNAs (DEMs) were identified among the mycoparasite complex, the sclerotium (ST) and the fruiting body stage. Functional enrichment and Gene Set Enrichment Analysis indicated that the ST had increased oxidative stress and energy metabolism and that mitogen-activated protein kinase signaling might induce the formation of fruiting body. Integrated analysis of DEGs and DEMs revealed that n_os_milR16, n_os_milR21, n_os_milR34, and n_os_milR90 could be candidate milRNAs that regulate the induction of fruiting body. This study provides transcriptome-wide insight into the molecular basis of fruiting body formation in O. Sinensis and identifies potential candidate genes for improving induction rate.

| S-EPMC8217512 | biostudies-literature

Functional study and epigenetic targets analyses of <i>SIRT1</i> in intramuscular preadipocytes via ChIP-seq and mRNA-seq.

Project description:The SIRT1 epigenetic regulator is involved in hepatic lipid homoeostasis. However, the role of SIRT1 in regulating intramuscular fat deposition as well as the pathways and potential epigenetic targets involved remain unknown. Herein, we investigate SIRT1 function, its genome-wide epigenetic target profile, and transcriptomic changes under SIRT1 overexpression during yak intramuscular preadipocytes differentiation. To this end, we analysed the relationship between SIRT1 and intramuscular fat content as well as lipid metabolism-related genes in longissimus dorsi tissue. We found that SIRT1 expression negatively correlates with intramuscular fat content as well as with the expression of genes related to lipid synthesis, while positively correlating with that of fatty acid oxidation-involved genes. SIRT1 overexpression in intramuscular preadipocytes significantly reduced adipose differentiation marker expression, intracellular triacylglycerol content, and lipid deposition. Chromatin immunoprecipitation coupled with high-throughput sequencing of H3K4ac (a known direct target of SIRT1) and high-throughput mRNA sequencing results revealed that SIRT1 may regulate intramuscular fat deposition via three potential new transcription factors (NRF1, NKX3.1, and EGR1) and four genes (MAPK1, RXRA, AGPAT1, and HADH) implicated in protein processing within the endoplasmic reticulum pathway and the MAPK signalling pathway in yaks. Our study provides novel insights into the role of SIRT1 in regulating yak intramuscular fat deposition and may help clarify the mechanistic determinants of yak meat characteristics.

| S-EPMC9980681 | biostudies-literature

MGcount: a total RNA-seq quantification tool to address multi-mapping and multi-overlapping alignments ambiguity in non-coding transcripts.

Project description:BackgroundTotal-RNA sequencing (total-RNA-seq) allows the simultaneous study of both the coding and the non-coding transcriptome. Yet, computational pipelines have traditionally focused on particular biotypes, making assumptions that are not fullfilled by total-RNA-seq datasets. Transcripts from distinct RNA biotypes vary in length, biogenesis, and function, can overlap in a genomic region, and may be present in the genome with a high copy number. Consequently, reads from total-RNA-seq libraries may cause ambiguous genomic alignments, demanding for flexible quantification approaches.ResultsHere we present Multi-Graph count (MGcount), a total-RNA-seq quantification tool combining two strategies for handling ambiguous alignments. First, MGcount assigns reads hierarchically to small-RNA and long-RNA features to account for length disparity when transcripts overlap in the same genomic position. Next, MGcount aggregates RNA products with similar sequences where reads systematically multi-map using a graph-based approach. MGcount outputs a transcriptomic count matrix compatible with RNA-sequencing downstream analysis pipelines, with both bulk and single-cell resolution, and the graphs that model repeated transcript structures for different biotypes. The software can be used as a python module or as a single-file executable program.ConclusionsMGcount is a flexible total-RNA-seq quantification tool that successfully integrates reads that align to multiple genomic locations or that overlap with multiple gene features. Its approach is suitable for the simultaneous estimation of protein-coding, long non-coding and small non-coding transcript concentration, in both precursor and processed forms. Both source code and compiled software are available at https://github.com/hitaandrea/MGcount .

| S-EPMC8760670 | biostudies-literature

mRNA on the move: the road to its biological destiny.

Project description:Cells have evolved to regulate the asymmetric distribution of specific mRNA targets to institute spatial and temporal control over gene expression. Over the last few decades, evidence has mounted as to the importance of localization elements in the mRNA sequence and their respective RNA-binding proteins. Live imaging methodologies have shown mechanistic details of this phenomenon. In this minireview, we focus on the advanced biochemical and cell imaging techniques used to tweeze out the finer aspects of mechanisms of mRNA movement.

| S-EPMC3711302 | biostudies-literature

OmicsDI is part of the ELIXIR infrastructure

OmicsDI is an Elixir interoperability service. Learn more ›

Tweets

OmicsDI Databases

PRIDE
PeptideAtlas
MassIVE
JPOST Repository
Physiome Model Repository

EGA
EVA
ENA
LINCS
PAXDB
Cell Collective

MetaboLights
Metabolomics Workbench
MetabolomeExpress
GNPS
BioModels
FAIRDOMHub

ArrayExpress
dbGaP
ExpressionAtlas
GEO
NODE

Information

Databases
Help
API
Contact us
Code on GitHub
Terms of use
Submit Data