Project description:Hypervariable regions V3-V5 of bacterial 16S rRNA genes. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/
Project description:16S rRNA gene sequences are commonly analyzed for taxonomic and phylogenetic studies because they contain variable regions that can help distinguish different genera. However, intra-genus distinction using variable region homology is often impossible due to the high overall sequence identities among closely related species, even though some residues may be conserved within respective species. Using a computational method that included the allelic diversity within individual genomes, we discovered that certain Escherichia and Shigella species can be distinguished by a multi-allelic 16S rRNA variable region single nucleotide polymorphism (SNP). To evaluate the performance of 16S rRNAs with altered variable regions, we developed an in vivo system that measures the acceptance and distribution of variant 16S rRNAs into a large pool of natural versions supporting normal translation and growth. We found that 16S rRNAs containing evolutionarily disparate variable regions were underpopulated both in ribosomes and in active translation pools, even for an SNP. Overall, this study revealed that variable region sequences can substantially influence the performance of 16S rRNAs and that this biological constraint can be leveraged to justify refining taxonomic assignments of variable region sequence data. IMPORTANCE This study reevaluates the notion that 16S rRNA gene variable region sequences are uninformative for intra-genus classification and that single nucleotide variations within them have no consequence to strains that bear them. We demonstrated that the performance of 16S rRNAs in Escherichia coli can be negatively impacted by sequence changes in variable regions, even for single nucleotide changes that are native to closely related Escherichia and Shigella species; thus, biological performance is likely constraining the evolution of variable regions in bacteria. Further, the native nucleotide variations we tested occur in all strains of their respective species and across their multiple 16S rRNA gene copies, suggesting that these species evolved beyond what would be discerned from a consensus sequence comparison. Therefore, this work also reveals that the multiple 16S rRNA gene alleles found in most bacteria can provide more informative phylogenetic and taxonomic detail than a single reference allele.
Project description:Cells devote a significant effort toward the production of multiple modified nucleotides in rRNAs, which fine tune the ribosome function. Here, we report that two methyltransferases, RsmB and RsmF, are responsible for all four 5-methylcytidine (m(5)C) modifications in 16S rRNA of Thermus thermophilus. Like Escherichia coli RsmB, T. thermophilus RsmB produces m(5)C967. In contrast to E. coli RsmF, which introduces a single m(5)C1407 modification, T. thermophilus RsmF modifies three positions, generating m(5)C1400 and m(5)C1404 in addition to m(5)C1407. These three residues are clustered near the decoding site of the ribosome, but are situated in distinct structural contexts, suggesting a requirement for flexibility in the RsmF active site that is absent from the E. coli enzyme. Two of these residues, C1400 and C1404, are sufficiently buried in the mature ribosome structure so as to require extensive unfolding of the rRNA to be accessible to RsmF. In vitro, T. thermophilus RsmF methylates C1400, C1404, and C1407 in a 30S subunit substrate, but only C1400 and C1404 when naked 16S rRNA is the substrate. The multispecificity of T. thermophilus RsmF is potentially explained by three crystal structures of the enzyme in a complex with cofactor S-adenosyl-methionine at up to 1.3 A resolution. In addition to confirming the overall structural similarity to E. coli RsmF, these structures also reveal that key segments in the active site are likely to be dynamic in solution, thereby expanding substrate recognition by T. thermophilus RsmF.
Project description:Contaminated aquifer (Dusseldorf-Flinger, Germany) templates extracted from 5 sediment depths ranging between 6.4 and 8.4 m below ground and over 3 years of sampling were amplified for amplicon pyrosequencing using the primers Ba27f (5’-aga gtt tga tcm tgg ctc ag-3’) and Ba519r (5’- tat tac cgc ggc kgc tg-3’), extended as amplicon fusion primers with respective primer A or B adapters, key sequence and multiplex identifiers (MID) as recommended by 454/Roche. Amplicons were purified and pooled as specified by the manufacturer. Emulsion PCR (emPCR), purification of DNA-enriched beads and sequencing run were performed following protocols and using a 2nd generation pyrosequencer (454 GS FLX Titanium, Roche) as recommended by the developer. Quality filtering of the pyrosequencing reads was performed using the automatic amplicon pipeline of the GS Run Processor (Roche), with a slight modification concerning the valley filter (vfScanAllFlows false instead of TiOnly) to extract the sequences. Demultiplexed raw reads were furhter trimmed for quality and lenght (>250 bp). 15 samples examined in total from important plume zones of the aquifer sampled in Feb. 2006, Sep. 2008 and Jun. 2009 (5 every year of sampling).
Project description:Contaminated aquifer (Dusseldorf-Flinger, Germany) templates extracted from 5 sediment depths ranging between 6.4 and 8.4 m below ground and over 3 years of sampling were amplified for amplicon pyrosequencing using the primers Ba27f (5’-aga gtt tga tcm tgg ctc ag-3’) and Ba519r (5’- tat tac cgc ggc kgc tg-3’), extended as amplicon fusion primers with respective primer A or B adapters, key sequence and multiplex identifiers (MID) as recommended by 454/Roche. Amplicons were purified and pooled as specified by the manufacturer. Emulsion PCR (emPCR), purification of DNA-enriched beads and sequencing run were performed following protocols and using a 2nd generation pyrosequencer (454 GS FLX Titanium, Roche) as recommended by the developer. Quality filtering of the pyrosequencing reads was performed using the automatic amplicon pipeline of the GS Run Processor (Roche), with a slight modification concerning the valley filter (vfScanAllFlows false instead of TiOnly) to extract the sequences. Demultiplexed raw reads were furhter trimmed for quality and lenght (>250 bp).
Project description:Microbial functions in the host physiology are a result of co-evolution between microbial communities and their hosts. Here we show that cold exposure leads to marked shift of the microbiota composition, referred to as cold microbiota. Transplantation of the cold microbiota to germ-free mice is sufficient to increase the insulin sensitivity of the host, and enable complete tolerance to cold partly by promoting the white fat browning, leading to increased energy expenditure and fat loss. During prolonged cold however, the body weight loss is attenuated, caused by adaptive mechanisms maximising caloric uptake and increasing intestinal, villi and microvilli lengths. This increased absorptive surface is promoted by the cold microbiota - effect that can be diminished by co-transplanting the most downregulated bacterial strain from the Verrucomicrobia phylum, Akkermansia muciniphila, during the cold microbiota transfer. Our results demonstrate the microbiota as a key factor orchestrating the overall energy homeostasis during increased demand.
Project description:Sphingomonads comprise a physiologically versatile group many of which appear to be adapted to oligotrophic environments, but several also had features in their genomes indicative of host associations. In this study, the extent variability of the 16S-23S rDNA intergenic spacer (ITS) sequences of 14 ATCC reference sphingomonad strains and 23 isolates recovered from drinking water was investigated through PCR amplification and sequencing. Sequencing analysis of the 16S-23S rRNA gene ITS region revealed that the ITS sizes for all studied isolates varied between 415 and 849 bp, while their G+C content was 42.2-57.9 mol%. Five distinct ITS types were identified: ITS(none) (without tRNA genes), ITS(Ala(TGC)), ITS(Ala(TGC)+Ile(GAT)), ITS(Ile(GAT)+Ala(TGC)), and ITS (Ile(GAT)+Pseudo). All of the identified tRNA(Ala(TGC)) molecules consisted of 73 bases, and all of the tRNA(Ile(GAT)) molecules consisted of 74 bases. We also detected striking variability in the size of the ITS region among the various examined isolates. Highest variability was detected within the ITS-2. The importance of this study is that this is the first comparison of the 16S-23S rDNA ITS sequence similarities and tRNA genes from sphingomonads. Collectively the data obtained in this study revealed the heterogeneity and extent of variability within the ITS region compared to the 16S rRNA gene within closely related isolates. Sequence and length polymorphisms within the ITS region along with the ITS types (tRNA-containing or lacking and the type of tRNA) and ITS-2 size and sequence similarities allowed us to overcome the limitation we previously encountered in resolving closely related isolates based on the 16S rRNA gene sequence.
Project description:PCR-ribotyping, a typing method based on size variation in 16S-23S rRNA intergenic spacer region (ISR), has been used widely for molecular epidemiological investigations of C. difficile infections. In the present study, we describe the sequence diversity of ISRs from 43 C. difficile strains, representing different PCR-ribotypes and suggest homologous recombination as a possible mechanism driving the evolution of 16S-23S rRNA ISRs. ISRs of 45 different lengths (ranging from 185 bp to 564 bp) were found among 458 ISRs. All ISRs could be described with one of the 22 different structural groups defined by the presence or absence of different sequence modules; tRNAAla genes and different combinations of spacers of different lengths (33 bp, 53 bp or 20 bp) and 9 bp direct repeats separating the spacers. The ISR structural group, in most cases, coincided with the sequence length. ISRs that were of the same lengths had also very similar nucleotide sequence, suggesting that ISRs were not suitable for discriminating between different strains based only on the ISR sequence. Despite large variations in the length, the alignment of ISR sequences, based on the primary sequence and secondary structure information, revealed many conserved regions which were mainly involved in maturation of pre-rRNA. Phylogenetic analysis of the ISR alignment yielded strong evidence for intra- and inter-homologous recombination which could be one of the mechanisms driving the evolution of C. difficile 16S-23S ISRs. The modular structure of the ISR, the high sequence similarities of ISRs of the same sizes and the presence of homologous recombination also suggest that different copies of C. difficile 16S-23S rRNA ISR are evolving in concert.