Project description:Accurate description of a microbial community is an important first step in understanding the role of its components in ecosystem function. A method for surveying microbial communities termed Serial Analysis of Ribosomal DNA (SARD) is described here. Through a series of molecular cloning steps, short DNA sequence tags are recovered from the fifth variable (V5) region of the prokaryotic 16S rRNA gene from microbial communities. These tags are ligated to form concatemers comprised of 20-40 tags which are cloned and identified by DNA sequencing. Four agricultural soil samples were profiled with SARD to assess the method’s utility. A total of 37,008 SARD tags comprising 3,127 unique sequences were identified. Comparison of duplicate profiles from one soil genomic DNA preparation revealed the method was highly reproducible. The large numbers of singleton tags together with non-parametric richness estimates indicated a significant amount of sequence tag diversity remained undetected with this level of sampling. The abundance classes of the observed tags were scale-free and conformed to a power law distribution. Numerically, the majority of the total tags observed belonged to abundance classes that were each present at less than 1% of the community. Over 99% of the unique tags individually made up less than 1% of the community. Therefore, from either numerical or diversity standpoints, low abundant taxa comprised a significant proportion of the microbial communities examined and could potentially make a large contribution to ecosystem function. SARD may provide a means to explore the ecological role of these rare members of microbial communities in qualitative and quantitative terms. Keywords: SARD profiles, culture-independent study, microbial community survey, microbial census
Project description:Identifying all essential genomic components is critical for the assembly of minimal artificial life. In the genome-reduced bacterium Mycoplasma pneumoniae, we found, that small ORFs (smORFs; <100 residues), accounting for 10% of all ORFs, are the most frequently essential genomic components (53%). Essentiality of smORFs may be explained by their function as members of protein and/or DNA/RNA complexes. In larger proteins, essentiality applied to individual domains and not entire proteins, a notion we could confirm by expression of truncated domains. The fraction of essential non-coding RNAs non-overlapping with essential genes is 5% higher than of non-transcribed regions (0.9%), pointing to the important functions of the former. The data highlights the minimal genome (33%, 269410 bp of M. pneumoniae genome), with an unexpected hidden layer of smORFs with essential functions.