Browse
Submit Data
Databases
API
Help

Dataset Information

0 Views

0 Connections

0 Citations

0 Reanalyses

0 Downloads

Omics score: 0

A computational framework for identifying promoter sequences in non-model organisms using RNA-seq datasets

ABSTRACT: We developed a computational framework to discover short DNA sequences that confer strong expression in non-model organisms. The framework relies solely on whole genome and RNA sequencing data types, which are easily accessible to a variety of research groups. The framework proceeds in three main stages: 1) identification of a group of highly expressed loci that maintain high transcript counts across a broad range of experimental conditions, 2) extraction of the corresponding upstream candidate promoter regions of these highly expressed loci while minding nearby annotations and avoiding those that may potentially reside in operons, and 3) application of the motif finding algorithm in BioProspector to these upstream regions to predict the location and sequence of the -35 and -10 hexamers that drive the strong expression of these loci. Ultimately, we report sequences of 27-30 bases in length as candidate -35, -10 signals for each of the top loci and create a consensus motif from these predictions. We apply our framework to 80 RNA-seq datasets collected for the methanotroph Methylotuvimicrobium buryatense 5GB1 and validate our predictions computationally and experimentally. The data deposited here represent all RNA-seq data that, until this study, has not previously been published.

ORGANISM(S): Methylotuvimicrobium buryatense

PROVIDER: GSE162089 | GEO | 2021/05/24

REPOSITORIES: GEO

ACCESS DATA

Json Xml

Dataset's files

Source:

			Action	DRS
		Other

Items per page:

1 - 1 of 1

Similar Datasets

A methanotrophic bacterium to enable direct methane capture for climate mitigation

Project description:We report here a methanotroph, Methylotuvimicrobium buryatense 5GB1C, that consumes methane at 500ppm at rates several times higher than any previously published. Analyses of bioreactor-based performance and RNAseq based transcriptomics suggest that this superior ability to utilize low methane is based at least in part on an extremely low non-growth associated maintenance energy and on a 5-fold higher methane specific affinity than previous reports.

2023-08-02 | GSE221011 | GEO

OXYGEN-LIMITED METABOLISM IN THE METHANOTROPH METHYLOMICROBIUM BURYATENSE 5GB1C

Project description:The bacteria that grow on methane aerobically (methanotrophs) support populations of non-methanotrophs in the natural environment by excreting methane-derived carbon. One group of excreted compounds are short-chain organic acids, generated in highest abundance when cultures are grown under O2-starvation. We examined this O2-starvation condition in the methanotroph Methylomicrobium buryatense 5GB1C . Under prolonged O2-starvation in a closed vial, this methanotroph increases the amount of acetate excreted about 10-fold, but the formate, lactate, and succinate excreted do not respond to this culture condition. In bioreactor cultures, the amount of each excreted product is similar across a range of growth rates and limiting substrates, including O2-limitation. A set of mutants were generated in genes predicted to be involved in generating or regulating excretion of these compounds and tested for growth defects, and changes in excretion products. The phenotypes and associated metabolic flux modeling suggested that in M. buryatense 5GB1C, formate and acetate are excreted in response to redox imbalance, and the resulting metabolic state represents a combination of fermentation and respiration metabolism.

2017-08-24 | GSE101981 | GEO

Core metabolism shifts of methanol vs. methane growth in the methanotroph Methylomicrobium buryatense 5GB1

Project description:Methylomicrobium buryatense 5GB1 is an obligate methylotroph, which grows on methane or methanol with similar growth rates. Core metabolic pathways are similar on both substrates, but recent studies of methane metabolism suggest that growth on methanol might have significant differences from growth on methane. In this study, both a targeted metabolomics approach as well as a 13C tracer approach have been taken to understand core carbon metabolism in M. buryatense 5GB1 during methanol growth, to determine whether such differences occur. Targeted metabolomics analyses were performed on both methane and methanol cultures to identify metabolic nodes with altered fluxes. Several key metabolites showed significant differences in pool size. Noticeably, 2-keto-3-deoxy-6-phosphogluconate (KDPG) showed much larger pools under methanol culture, suggesting the Entner-Doudoroff (ED) pathway was more active. Intermediates in other parts of metabolism also showed differences in pool sizes under methanol growth. A systematic shift of active core metabolism is proposed to explain the changes. In order to distinguish flux partition differences at the C3-C4 node, 13C tracer analysis was also applied to methanol-grown cultures. Using the experimental results as constraints, we applied flux balance analysis to determine the metabolic flux phenotype of M. buryatense 5GB1 growing on methanol. The resulting new insights into core metabolism of this methanotroph provide an improved basis for future strain design.

2019-04-17 | GSE110541 | GEO

Determination of the transcription start sites of heterologous promoters in Actinoplanes sp. SE50/110 by 5'-end specific transcriptome sequencing

Project description:The promoter structure influences binding and clearance of RNA polymerase and therefore substantially influences expression of a gene. A promoter usually consists of a -10 and a -35-region, an extended -10-motif and A+T-rich upstream promoter elements. Most of these elements are optional, whereas the -10-region is essential (Albersmeier et al. 2017). Knowledge about the transcription start sites (TSS) of genes allows genome-wide localization and determination of the promoter regions. In our group, a special protocol for the amplification of primary transcripts was developed, including the capture of primary transcripts, rewriting them into cDNA (complementary DNA) and amplification in the further course of the protocol (Pfeifer-Sancar et al. 2013). Here, TSS were manually determined with special regard to the heterologous promoters. For each construct, at least one and up to three different TSS were found, leading to the identification of one or several -10-core-hexamers. These were located mostly 6 to 7 nucleotides upstream of each TSS, which corresponds to the average distance of 6.4 nt described for Actinoplanes sp. SE50/110 by Schwientek et al. (2014).

2019-10-26 | E-MTAB-8433 | biostudies-arrayexpress

Methylotuvimicrobium buryatense

Project description:OXYGEN-LIMITED METABOLISM IN THE METHANOTROPH METHYLOMICROBIUM BURYATENSE 5GB1C

| PRJNA396065 | ENA

Core metabolism shifts of methanol vs. methane growth in the methanotroph Methylomicrobium buryatense 5GB1

Project description:Core metabolism shifts of methanol vs. methane growth in the methanotroph Methylomicrobium buryatense 5GB1

| PRJNA433951 | ENA

Transcription profiling of Mycobacterium tuberculosis wild type and Rv3133c/DosR mutants grown in hypoxic conditions

Project description:Unlike many pathogens that are overtly harmful to their hosts, Mycobacterium tuberculosis can persist for years within humans in a clinically latent state. Latency is often linked to hypoxic conditions within the host. Among M. tuberculosis genes induced by hypoxia is a putative transcription factor, Rv3133c/DosR. We performed targeted disruption of this locus followed by transcriptome analysis of wild-type and mutant bacilli. Nearly all the genes powerfully regulated by hypoxia require Rv3133c/DosR for their induction. Computer analysis identified a consensus motif, a variant of which is located upstream of nearly all M. tuberculosis genes rapidly induced by hypoxia. Further, Rv3133c/DosR binds to the two copies of this motif upstream of the hypoxic response gene alpha-crystallin. Mutations within the binding sites abolish both Rv3133c/DosR binding as well as hypoxic induction of a downstream reporter gene. Also, mutation experiments with Rv3133c/DosR confirmed sequence-based predictions that the C-terminus is responsible for DNA binding and that the aspartate at position 54 is essential for function. Together, these results demonstrate that Rv3133c/DosR is a transcription factor of the two-component response regulator class, and that it is the primary mediator of a hypoxic signal within M. tuberculosis.

2007-09-29 | E-SMDB-4099 | biostudies-arrayexpress

A genome-wide map of CTCF multivalency redefines the CTCF code

Project description:The “CTCF code” hypothesis posits that CTCF pleotropic functions are driven by recognition of diverse DNA sequences through combinatorial use of its 11 zinc fingers (ZFs). This model however is supported by in vitro binding studies of a limited number of sequences. To directly test CTCF multivalency in vivo we here define ZF binding requirements at ~50,000 genomic sites in primary lymphocytes. We find that CTCF reads sequence diversity through ZF clustering. ZFs4-7 anchor CTCF to ~80% of targets containing the 20bp core motif. Non-conserved flanking sequences are recognized by ZFs1-2 and ZFs8- 11 clusters, which also stabilize CTCF broadly. Alternatively, CTCF employ ZFs9-11 to associate with a second phylogenetically-conserved upstream motif at ~15% of its sites. Individually, ZFs increase overall binding affinity and chromatin residence time. Unexpectedly, we also uncover a conserved downstream DNA motif that destabilizes CTCF occupancy. CTCF thus associates with a wide array of DNA modules via combinatorial clustering of its 11 ZFs.

2011-11-19 | GSE33819 | GEO

De novo inference of thermodynamic binding energies using deep learning models of in vivo transcription factor binding

Project description:We introduce Affinity Distillation (AD), a method for extracting thermodynamic affinities de-novo from in-vivo immunoprecipitation experiments using deep learning. We show that neural networks modeling base-resolution in-vivo binding profiles of yeast and mammalian TFs can accurately predict energetic impacts of varying underlying DNA sequence on TF binding. Systematic comparisons between Affinity Distillation predictions and other predictive algorithms consistently show that Affinity Distillation more accurately predicts affinities across a wide range of TF structural classes and DNA sequences. Affinity Distillation relies on in-silico marginalization against many sequence backgrounds, resulting in a higher dynamic range and more accurate predictions than motif discovery algorithms. Moreover, we show that Affinity Distillation can learn differential paralog-specific affinities, thereby making it possible to more accurately reconstruct regulatory networks in cells.

2022-06-29 | GSE207001 | GEO

Identification of AflR binding sites in the genome of Aspergillus flavus by ChIP-seq

Project description:We report here the AflR binding motif of Aspergillus flavus for the first time with the aid of ChIP-seq analysis. Of the 540 peak sequences associated with AflR binding events, 66.8% were located within 2 kb upstream (promoter region) of translational start sites. The identified 18-bp binding motif was a perfect palindromic sequence, 5′-CSSGGGWTCGAWCCCSSG’3′ with S representing G or C and W representing A or T. On closer examination, we hypothesized that the 18-bp motif sequence identified contained two identical parts (here called motif A and motif B). Motif A was in positions 8–18 on the upper strand, while motif B was in positions 11-1 on the bottom strand. The inferred length and sequence of the putative motif identified in A. flavus were similar to previous findings in A. parasiticus and A. nidulans. Gene ontology analysis indicated that AflR bound to other genes outside the aflatoxin biosynthetic gene cluster.

2020-05-02 | GSE149696 | GEO

OmicsDI is part of the ELIXIR infrastructure

OmicsDI is an Elixir interoperability service. Learn more ›

Tweets

OmicsDI Databases

PRIDE
PeptideAtlas
MassIVE
JPOST Repository
Physiome Model Repository

EGA
EVA
ENA
LINCS
PAXDB
Cell Collective

MetaboLights
Metabolomics Workbench
MetabolomeExpress
GNPS
BioModels
FAIRDOMHub

ArrayExpress
dbGaP
ExpressionAtlas
GEO
NODE

Information

Databases
Help
API
Contact us
Code on GitHub
Terms of use
Submit Data