Project description:<p><strong>BACKGROUND:</strong> Plants exhibit wide chemical diversity due to the production of specialized metabolites that function as pollinator attractants, defensive compounds, and signaling molecules. Lamiaceae (mints) are known for their chemodiversity and have been cultivated for use as culinary herbs, as well as sources of insect repellents, health-promoting compounds, and fragrance.</p><p><strong>FINDINGS:</strong> We report the chromosome-scale genome assembly of Callicarpa americana L. (American beautyberry), a species within the early-diverging Callicarpoideae clade of Lamiaceae, known for its metallic purple fruits and use as an insect repellent due to its production of terpenoids. Using long-read sequencing and Hi-C scaffolding, we generated a 506.1-Mb assembly spanning 17 pseudomolecules with N50 contig and N50 scaffold sizes of 7.5 and 29.0 Mb, respectively. In all, 32,164 genes were annotated, including 53 candidate terpene synthases and 47 putative clusters of specialized metabolite biosynthetic pathways. Our analyses revealed 3 putative whole-genome duplication events, which, together with local tandem duplications, contributed to gene family expansion of terpene synthases. Kolavenyl diphosphate is a gateway to many of the bioactive terpenoids in C. americana; experimental validation confirmed that CamTPS2 encodes kolavenyl diphosphate synthase. Syntenic analyses with Tectona grandis L. f. (teak), a member of the Tectonoideae clade of Lamiaceae known for exceptionally strong wood resistant to insects, revealed 963 collinear blocks and 21,297 C. americana syntelogs.</p><p><strong>CONCLUSIONS:</strong> Access to the C. americana genome provides a road map for rapid discovery of genes encoding plant-derived agrichemicals and a key resource for understanding the evolution of chemical diversity in Lamiaceae.</p>
Project description:Background: Germ-free or axenic organisms are valuable tools for studying immunity, digestion, and development in different hosts. Although most of these studies have been conducted on mice, recently, germ-free invertebrate models (e.g. Drosophila and Apis) are used due to their easy husbandry, low cost for production, maintenance and the high number of individuals per generation they produce. However, a limitation of using these insects is the simple bacterial community present in their guts. The gut of the American cockroach Periplaneta americana displays a complex gut bacterial community composed of hundreds of species. Using P. americana, we developed a germ-free omnivorous invertebrate model to investigate how gut bacteria stimulate and shape normal gut development and metabolism. To determine if the insect host is directly affected by the presence of specific members of their bacterial community, gnotobiotic cockroaches were generated by inoculating a set of various P. americana gut-endemic Gram-negative (Bacteroidetes; n=11) and Gram-positive (Firmicutes; n=2) bacterial strains into germ-free insects. Additionally, we were able to recover the ‘normal’ bacterial-induced gut phenotype by co-housing germ-free cockroaches with wildtype P. americana to produce gut-bacteria conventionalized insects. Changes in gene expression profiles from two distinct regions (midgut and hindgut) of P. americana guts were quantified by RNA-Seq analysis of the germfree, gnotobiotic and conventionalized insects. Basic transcriptomics description: High-resolution transcriptome profiling of germ-free, gnotobiotic, and conventionalized treated P. americana midgut and hindguts. Ca. 43 million reads were obtained for each treatment. A de-novo assembly of all sequence reads was performed by Trinity assembler. Transcriptome assembly yielded 369,082 gene models and 554,155 isoforms. After running Trinotate pipeline, 65,047 (12 %) these transcripts matched an annotated product in at least one of the reference databases used (Uniprot, pfam, KEGG, COG). Additionally, 1,008 putative bacterial genes were annotated in the P. americana genome and ultimately excluded from these analyses. After bacteria decontamination, 553,147 assembled isoforms were used for transcript quantification and differential expression analysis using the DESeq2 pipeline. DESeq2 analysis detected 6,730 and 3,958 differentially expressed transcripts among the germ-free, gnotobiotic and conventionalized treatments in P. americana hindgut and midgut, respectively.