Dataset Information

Identification of putative essential protein domains from high-density transposon insertion sequencing.

ABSTRACT: A first clue to gene function can be obtained by examining whether a gene is required for life in certain standard conditions, that is, whether a gene is essential. In bacteria, essential genes are usually identified by high-density transposon mutagenesis followed by sequencing of insertion sites (Tn-seq). These studies assign the term "essential" to whole genes rather than the protein domain sequences that encode the essential functions. However, genes can code for multiple protein domains that evolve their functions independently. Therefore, when essential genes code for more than one protein domain, only one of them could be essential. In this study, we defined this subset of genes as "essential domain-containing" (EDC) genes. Using a Tn-seq data set built-in Burkholderia cenocepacia K56-2, we developed an in silico pipeline to identify EDC genes and the essential protein domains they encode. We found forty candidate EDC genes and demonstrated growth defect phenotypes using CRISPR interference (CRISPRi). This analysis included two knockdowns of genes encoding the protein domains of unknown function DUF2213 and DUF4148. These putative essential domains are conserved in more than two hundred bacterial species, including human and plant pathogens. Together, our study suggests that essentiality should be assigned to individual protein domains rather than genes, contributing to a first functional characterization of protein domains of unknown function.

SUBMITTER: Rahman ASMZ

PROVIDER: S-EPMC8770471 | biostudies-literature | 2022 Jan

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Identification of putative essential protein domains from high-density transposon insertion sequencing.

Rahman A S M Zisanur ASMZ Timmerman Lukas L Gallardo Flyn F Cardona Silvia T ST

Scientific reports 20220119 1

A first clue to gene function can be obtained by examining whether a gene is required for life in certain standard conditions, that is, whether a gene is essential. In bacteria, essential genes are usually identified by high-density transposon mutagenesis followed by sequencing of insertion sites (Tn-seq). These studies assign the term "essential" to whole genes rather than the protein domain sequences that encode the essential functions. However, genes can code for multiple protein domains that ...[more]

PMID: 35046497

Similar Datasets

Project description:Gene essentiality studies have been performed on numerous bacterial pathogens, but essential gene sets have been determined for only a few plant-associated bacteria. Pseudomonas protegens Pf-5 is a plant-commensal, biocontrol bacterium that can control disease-causing pathogens on a wide range of crops. Work on Pf-5 has mostly focused on secondary metabolism and biocontrol genes, but genome-wide approaches such as high-throughput transposon mutagenesis have not yet been used for this species. In this study, we generated a dense P. protegens Pf-5 transposon mutant library and used transposon-directed insertion site sequencing (TraDIS) to identify 446 genes essential for growth on rich media. Genes required for fundamental cellular machinery were enriched in the essential gene set, while genes related to nutrient biosynthesis, stress responses, and transport were underrepresented. The majority of Pf-5 essential genes were part of the P. protegens core genome. Comparison of the essential gene set of Pf-5 with those of two plant-associated pseudomonads, P. simiae and P. syringae, and the well-studied opportunistic human pathogen P. aeruginosa PA14 showed that the four species share a large number of essential genes, but each species also had uniquely essential genes. Comparison of the Pf-5 in silico-predicted and in vitro-determined essential gene sets highlighted the essential cellular functions that are over- and underestimated by each method. Expanding essentiality studies into bacteria with a range of lifestyles may improve our understanding of the biological processes important for bacterial survival and growth.IMPORTANCE Essential genes are those crucial for survival or normal growth rates in an organism. Essential gene sets have been identified in numerous bacterial pathogens but only a few plant-associated bacteria. Employing genome-wide approaches, such as transposon insertion sequencing, allows for the concurrent analyses of all genes of a bacterial species and rapid determination of essential gene sets. We have used transposon insertion sequencing to systematically analyze thousands of Pseudomonas protegens Pf-5 genes and gain insights into gene functions and interactions that are not readily available using traditional methods. Comparing Pf-5 essential genes with those of three other pseudomonads highlights how gene essentiality varies between closely related species.

Project description:Many plant-associated bacteria have the ability to positively affect plant growth and there is growing interest in utilizing such bacteria in agricultural settings to reduce reliance on pesticides and fertilizers. However, our capacity to utilize microbes in this way is currently limited due to patchy understanding of bacterial-plant interactions at a molecular level. Traditional methods of studying molecular interactions have sought to characterize the function of one gene at a time, but the slow pace of this work means the functions of the vast majority of bacterial genes remain unknown or poorly understood. New approaches to improve and speed up investigations into the functions of bacterial genes in agricultural systems will facilitate efforts to optimize microbial communities and develop microbe-based products. Techniques enabling high-throughput gene functional analysis, such as transposon insertion sequencing analyses, have great potential to be widely applied to determine key aspects of plant-bacterial interactions. Transposon insertion sequencing combines saturation transposon mutagenesis and high-throughput sequencing to simultaneously investigate the function of all the non-essential genes in a bacterial genome. This technique can be used for both in vitro and in vivo studies to identify genes involved in microbe-plant interactions, stress tolerance and pathogen virulence. The information provided by such investigations will rapidly accelerate the rate of bacterial gene functional determination and provide insights into the genes and pathways that underlie biotic interactions, metabolism, and survival of agriculturally relevant bacteria. This knowledge could be used to select the most appropriate plant growth promoting bacteria for a specific set of conditions, formulating crop inoculants, or developing crop protection products. This review provides an overview of transposon insertion sequencing, outlines how this approach has been applied to study plant-associated bacteria, and proposes new applications of these techniques for the benefit of agriculture.

Project description:Escherichia coli K1 strains are major causative agents of invasive disease of newborn infants. The age dependency of infection can be reproduced in neonatal rats. Colonization of the small intestine following oral administration of K1 bacteria leads rapidly to invasion of the blood circulation; bacteria that avoid capture by the mesenteric lymphatic system and evade antibacterial mechanisms in the blood may disseminate to cause organ-specific infections such as meningitis. Some E. coli K1 surface constituents, in particular the polysialic acid capsule, are known to contribute to invasive potential, but a comprehensive picture of the factors that determine the fully virulent phenotype has not emerged so far. We constructed a library and constituent sublibraries of ∼775,000 Tn5 transposon mutants of E. coli K1 strain A192PP and employed transposon-directed insertion site sequencing (TraDIS) to identify genes required for fitness for infection of 2-day-old rats. Transposon insertions were lacking in 357 genes following recovery on selective agar; these genes were considered essential for growth in nutrient-replete medium. Colonization of the midsection of the small intestine was facilitated by 167 E. coli K1 gene products. Restricted bacterial translocation across epithelial barriers precluded TraDIS analysis of gut-to-blood and blood-to-brain transits; 97 genes were required for survival in human serum. This study revealed that a large number of bacterial genes, many of which were not previously associated with systemic E. coli K1 infection, are required to realize full invasive potential.IMPORTANCEEscherichia coli K1 strains cause life-threatening infections in newborn infants. They are acquired from the mother at birth and colonize the small intestine, from where they invade the blood and central nervous system. It is difficult to obtain information from acutely ill patients that sheds light on physiological and bacterial factors determining invasive disease. Key aspects of naturally occurring age-dependent human infection can be reproduced in neonatal rats. Here, we employ transposon-directed insertion site sequencing to identify genes essential for the in vitro growth of E. coli K1 and genes that contribute to the colonization of susceptible rats. The presence of bottlenecks to invasion of the blood and cerebrospinal compartments precluded insertion site sequencing analysis, but we identified genes for survival in serum.

Dataset Information

Identification of putative essential protein domains from high-density transposon insertion sequencing.

Publications

Identification of putative essential protein domains from high-density transposon insertion sequencing.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets