Browse
Submit Data
Databases
API
Help

Dataset Information

0 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

Optimization of miRNA-seq Data Pre-Processing

ABSTRACT: Next-generation sequencing is currently the platform of choice for the discovery and quantification of miRNAs. Despite this, there is no clear consensus on how the data should be pre-processed prior to conducting downstream analyses. Often overlooked, data pre-processing is an essential step in data analysis: the presence of unreliable features and noise can affect the conclusions drawn from downstream analyses. Using a spike-in dilution study, we evaluated the effects of several general-purpose aligners (BWA, Bowtie, Bowtie 2 and Novoalign), and normalization methods (counts-per-million, total count scaling, upper quartile scaling, Trimmed Mean of M, DESeq, linear regression, cyclic loess and quantile) with respect to the final miRNA count data distribution, variance, bias and accuracy of differential expression analysis.

ORGANISM(S): Homo sapiens

PROVIDER: GSE67074 | GEO | 2015/03/21

SECONDARY ACCESSION(S): PRJNA278977

REPOSITORIES: GEO

ACCESS DATA

Json Xml

Dataset's files

Source:

			Action	DRS
		Other

Items per page:

1 - 1 of 1

Similar Datasets

E-MTAB-6783

Project description:Microarray data from YTHDF2-deficient pre-leukemic cells and control pre-leukemic cells

2018-09-17 | E-MTAB-6783 | ExpressionAtlas

Temporal gene expression across osteoblastogenesis.

Project description:Purpose: Osteoblast cells mature from a mesenchymal stem cell pool to become cells capable of forming bone matrix and mineralizing this matrix. The goal of this study was to characterize temporal changes in the transcriptome across osteoblast maturation, starting with committed mesenchymal stem cell/ early pre-osteoblast stage through to mature osteoblasts capable of matrix mineralization. Methods: Enriched populations of pre-osteoblast like cells were obtained from neonatal calvaria from C57BL/6J mice expressing CFP under the control of the Col3.6 promoter. These cells were placed into culture for 4 days, removed from culture and subjected FACS sorting based on the presence/absence of CFP expression. Cells expressing CFP were returned to culture, subjected to an osteoblast differentiation cocktail and RNA was collected at 2, 4, 6, 8, 10, 12, 14, 16 and 18 days post differentiation. Methods II: mRNA profiles for each time point were generated by next generation RNA sequencing, using an Illumina HiSeq 2000. Three technical replicates per samples were sequenced. The alignments for abundance estimation of transcripts was conducted using Bowtie version 0.12.9, using the NCBIm37 reference genome. Expression level per gene was calculated using RSEM version 1.2.0 with the parameters of --fragment-length-mean 280 and --fragment-length-sd 50, and the expression level for each sample was normalized relative to the per sample upper quartile. Gene expression in calvarial osteoblasts from neonatal C57BL/6J-Col3.6 CFP mice at 9 time points post differentiation

2014-06-23 | E-GEOD-54461 | biostudies-arrayexpress

Differentially regulated genes in TOR knockdown Arabidopsis lines

Project description:Rapamycin-sensitive transgenic Arabidopsis lines (BP12) expressing yeast FK506 Binding Protein12 (FKBP12) were developed. Inhibition of TOR in BP12 plants by rapamycin resulted in slower overall root, leaf and shoot growth and development leading to poor nutrient uptake and light energy utilization. Genetic and physiological studies together with RNA-Seq and metabolite analysis of TOR-suppressed lines revealed that TOR regulates development and lifespan in Arabidopsis by restructuring cell growth, carbon and nitrogen metabolism, gene expression, ribosomal RNA and protein synthesis. Arabidopsis WT (Col)and BP12-2 (TOR knockdown line) seedlings at 15 DAG were treated with rapamycin for 3 days by transferring from 0.5 MS medium to 0.5 MS+10 ug/ml rapamycin. Triplicate samples of rapamycin treated WT and BP12-2 seedlings were used for RNA-Seq analysis (Illumina Hiseq 2000). Paired-end alignments were obtained through aligning short reads onto the reference Arabidopsis Genome (TAIR9) using Bowtie. More than 80% of the reads mapped onto the genome. Htseq-count was used to count the reads from the Bowtie derived output files. Differential expressed genes were identified using edgeR. The FDR-corrected P value for differential expression was set to be <=0.05.

2012-12-18 | E-GEOD-42968 | biostudies-arrayexpress

Neuronal proteome dynamics during homeostatic scaling.

Project description:Homeostatic scaling adjusts synaptic strength in response to persistent changes in neuronal network activity. This compensatory mechanism requires proteome remodeling accomplished via regulation of protein synthesis as well as degradation, but the global patterns of proteome remodeling and the underlying dynamics of individual proteins remain elusive. Here we used dynamic SILAC labeling in cultured hippocampal cells to identify proteins involved in homeostatic up- or down-scaling and to quantify their changes in synthesis and degradation as well as resulting changes in protein abundance or turnover. Our data demonstrate that a large fraction of the neuronal proteome is remodeled during homeostatic scaling. Most proteins were down-regulated by decreased synthesis or up-regulated by decreased degradation. Comparably fewer proteins showed increased synthesis or degradation rates. More than half of the quantified synaptic proteins were regulated, including pre- as well as postsynaptic proteins with diverse molecular functions.

2020-03-31 | PXD016004 | Pride

A practical evaluation of alignment algorithms for RNA variant calling analysis

Project description:We performed RNA-seq with ten pieces of breast cancer (invasive ductal carcinoma; luminal B type) tissue and three pieces of adjacent normal tissue from a single patient. These RNA-seq data were used to evaluate the performance of splice-aware aligners.

2018-02-06 | GSE110114 | GEO

Temporal gene expression across osteoblastogenesis.

2014-06-23 | GSE54461 | GEO

Transcriptome analysis of Bacillus subtilis NBRC 16449 grown on surface of boiled soybeans under the similar condition to production of Japanese traditional soybean-fermented food "natto"

Project description:Purpose:The goals of this study are to clarify the B. subtilis NBRC 16449 response to soybeans. Methods: B. subtilis NBRC 16449 cells were aerobically cultured in liquid LB, LB solidified with agar, or on surface of boiled soybeans to logarithmic growth phase. Total RNAs were extracted from bacterial cells by Hot-Phenol method. Samples for RNA-seq were prepared according to Illmina protocol available from the manufacture. The sequence reads that passed quality filters were analyzed at the transcript isoform level with bowtie v0.11.2. Results: Using an optimized data analysis workflow, we mapped around 15 million sequence reads per sample to the whole genome of B. subtilis BEST195 and identified 4271 transcripts in B. subtilis NBRC 16449 with Bowtie aligner. Read count per genome was extracted from known gene annotations with HTSeq program. Compared the transcriptomes of B. subtilis NBRC 16449 grown on LB solidified with agar to that grown on surface of boiled soybeans, about 5% of genes showed the different expression levels.

2020-11-02 | GSE109523 | GEO

Quantitative Proteomics of miR-148a in gastric cancer cells

Project description:A quantitative proteomics combined with stable isotope labeling was applied to identify the global profile of miR-148a-regulated downstream proteins in AGS cancer cells. For proteomic analysis, cells were treated with miR-148a mimic (Pre-miR-148a) or miR-148a negative control (miR-CTL) and the downstream protein expression level (Pre-miR-148a/miR-CTL) were quantified using iTRAQ approach. Bioinformatics pipeline: The peak list in the resultant MS/MS spectra were generated by Mascot Distiller v2.1.1.0 and searched using Mascot v2.2 against the International Protein Index (IPI) human database (v. 3.64, 84032 sequences). The Mascot search parameters were +-0.1 Da for MS tolerance, +-0.1 Da for MS/MS mass tolerance, allowances for two missed cleavages, and variable modifications of deamidation (NQ), oxidation (M), iTRAQ (N terminal), iTRAQ (K), and MMTS (C). Protein quantitation were calculated using the Multi-Q software v1.6.5.4 with a dynamic range filter of ion count > 30.

2013-07-24 | PXD000190 | Pride

Quantitative Analysis of Wild Type and Neat1 -/- Cerebral Frontal Cortex Transcriptomes

Project description:Purpose: The goals of this study are to elucidate dowstream effects of lnc RNA, Neat1 deletion in cerebral frontal cortex of adult mice by comparing Next-generation sequencing -derived cortical transcriptome profiles (RNA-seq) between wild type and Neat1 knockout mice. Methods: Brain mRNA profiles of 2-4 moths-old wild-type (WT) and lnc RNA, Neat1 knockout (Neat1−/−) mice were generated by deep sequencing, using Illumina. Reads were mapped to mm10 reference genome using TopHat (version 2.0.9) and Bowtie (version 2.1.0), with the default parameters. Known iGenomes Ensembl mm10 were quantified by HTSeq (version 0.6.0) in intersection-strict mode. A sample-by-gene read count matrix was generated for all samples by the Ensembl genes. Scaling normalization to remove composition biases in sequencing data was applied to log(CPM) (read Counts Per Million total reads) using the trimmed mean of M-values (TMM) method. Results: RNA-seq showed near-complete depletion of Neat1 RNA levels. 1359 genes were differentially expressed in the frontal cortex of Neat1-/- mice. 25 of these differentially expressed genes withstood multiple testing corrections. Examination of RNA-seq data by principle component analysis showed two principle components that were mutually uncorrelated and orthogonal. Hierarchical cluster tree analysis showed that joined nodes from Neat1-/- samples were distanced from control subset cluster confirming the results of the PCA. Conclusions: Analyses of differentially expressed gene signature from NEAT1-/- mice revealed a significant impact on processes related to oligodendrocyte differentiation and RNA post-transcriptional modification with the underlying mechanisms involving Wnt signaling, cell contact interactions, and regulation of cholesterol/lipid metabolism.

2019-02-21 | GSE126814 | GEO

Human fetal yolk sac scRNA-seq data (sample ID: F158 for Haniffa Lab; 16099 for HDBR)

Project description:Investigating the blood, immune and stromal cells present in a human fetal embryo in a world first single cell transcriptomic atlas. The embryo was dissected into 12 coronal sections, yolk sac, and yolk sac stalk. Live single cells sorted, with cell suspension then undergoing 10x chromium 5 prime scRNA-seq. This accession contains the yolk sac and yolk sac stalk data from this embryo. A matched accession contains the coronal section data. Lane "WS_wEMB12142156" (from yolk sac) was excluded from downstream analysis due to low fraction reads in cells post-CellRanger QC. Termination procedure for this embryo was medical. The F158_[features...barcodes...matrix].[tsv...mtx].gz files attached to this accession represent raw count data from all the 10x lanes in this accession combined, and as output from CellRanger filtered matrices (CellRanger version 6.0.1 using human reference genome GRCh38-2020-A). One set of count matrices relates to the yolk sac data, and one set of count matrices relates to the yolk sac stalk data.

2022-12-07 | E-MTAB-11673 | biostudies-arrayexpress

OmicsDI is part of the ELIXIR infrastructure

OmicsDI is an Elixir interoperability service. Learn more ›

Tweets

OmicsDI Databases

PRIDE
PeptideAtlas
MassIVE
JPOST Repository
Physiome Model Repository

EGA
EVA
ENA
LINCS
PAXDB
Cell Collective

MetaboLights
Metabolomics Workbench
MetabolomeExpress
GNPS
BioModels
FAIRDOMHub

ArrayExpress
dbGaP
ExpressionAtlas
GEO
NODE

Information

Databases
Help
API
Contact us
Code on GitHub
Terms of use
Submit Data