Project description:<p><strong>BACKGROUND:</strong> Plants exhibit wide chemical diversity due to the production of specialized metabolites that function as pollinator attractants, defensive compounds, and signaling molecules. Lamiaceae (mints) are known for their chemodiversity and have been cultivated for use as culinary herbs, as well as sources of insect repellents, health-promoting compounds, and fragrance.</p><p><strong>FINDINGS:</strong> We report the chromosome-scale genome assembly of Callicarpa americana L. (American beautyberry), a species within the early-diverging Callicarpoideae clade of Lamiaceae, known for its metallic purple fruits and use as an insect repellent due to its production of terpenoids. Using long-read sequencing and Hi-C scaffolding, we generated a 506.1-Mb assembly spanning 17 pseudomolecules with N50 contig and N50 scaffold sizes of 7.5 and 29.0 Mb, respectively. In all, 32,164 genes were annotated, including 53 candidate terpene synthases and 47 putative clusters of specialized metabolite biosynthetic pathways. Our analyses revealed 3 putative whole-genome duplication events, which, together with local tandem duplications, contributed to gene family expansion of terpene synthases. Kolavenyl diphosphate is a gateway to many of the bioactive terpenoids in C. americana; experimental validation confirmed that CamTPS2 encodes kolavenyl diphosphate synthase. Syntenic analyses with Tectona grandis L. f. (teak), a member of the Tectonoideae clade of Lamiaceae known for exceptionally strong wood resistant to insects, revealed 963 collinear blocks and 21,297 C. americana syntelogs.</p><p><strong>CONCLUSIONS:</strong> Access to the C. americana genome provides a road map for rapid discovery of genes encoding plant-derived agrichemicals and a key resource for understanding the evolution of chemical diversity in Lamiaceae.</p>
Project description:Small RNAs (21-24 nt) are pivotal regulators of gene expression that guide both transcriptional and post-transcriptional silencing mechanisms in diverse eukaryotes, including most if not all plants. MicroRNAs (miRNAs) and short interfering RNAs (siRNAs) are the two major types, both of which have a demonstrated and important role in plant development, stress responses and pathogen resistance. In this work, we used a deep sequencing approach (Sequencing-By-Synthesis, or SBS) to develop sequence resources of small RNAs from Persea americana tissues (including leaves, flowers and fruit). The high depth of the resulting datasets enabled us to examine in detail critical small RNA features as size distribution, tissue-specific regulation and sequence conservation between different organs in this species. We also developed database resources and a dedicated website (http://smallrna.udel.edu/) with computational tools for allowing other users to identify new miRNAs or siRNAs involved in specific regulatory pathways, verify the degree of conservation of these sequences in other plant species and map small RNAs on genes or larger regions of the maize genome under study.