Project description:BackgroundSoapberry (Sapindus mukorossi) is an economically important multifunctional tree species. Triterpenoid saponins have many functions in soapberry. However, the types of uridine diphosphate (UDP) glucosyltransferases (UGTs) involved in the synthesis of triterpenoid saponins in soapberry have not been clarified.ResultsIn this study, 42 SmUGTs were identified in soapberry, which were unevenly distributed on 12 chromosomes and had sequence lengths of 450 bp to 1638 bp, with an average of 1388 bp. The number of amino acids in SmUGTs was 149 to 545, with an average of 462. Most SmUGTs were acidic and hydrophilic unstable proteins, and their secondary structures were mainly α-helices and random coils. All had conserved UDPGT and PSPG-box domains. Phylogenetic analysis divided them into four subclasses, which glycosylated different carbon atoms. Prediction of cis-acting elements suggested roles of SmUGTs in plant development and responses to environmental stresses. The expression patterns of SmUGTs differed according to the developmental stage of fruits, as determined by transcriptomics and RT-qPCR. Co-expression network analysis of SmUGTs and related genes/transcription factors in the triterpenoid saponin synthesis pathway was also performed. The results indicated potential roles for many transcription factors, such as SmERFs, SmGATAs and SmMYBs. A correlation analysis showed that 42 SmUGTs were crucial in saponin synthesis in soapberry.ConclusionsOur findings suggest optimal targets for manipulating glycosylation in soapberry triterpenoid saponin biosynthesis; they also provide a theoretical foundation for further evaluation of the functions of SmUGTs and analyses of their biosynthetic mechanisms.
Project description:Sapindus mukorossi is an environmentally friendly plant and renewable energy source whose fruit has been widely used for biomedicine, biodiesel, and biological chemicals due to its richness in saponin and oil contents. Here, we report the first chromosome-scale genome assembly of S. mukorossi (covering ~391 Mb with a scaffold N50 of 24.66 Mb) and characterize its genetic architecture and evolution by resequencing 104 S. mukorossi accessions. Population genetic analyses showed that genetic diversity in the southwestern distribution area was relatively higher than that in the northeastern distribution area. Gene flow events indicated that southwest species may be the donor population for the distribution areas in China. Genome-wide selective sweep analysis showed that a large number of genes are involved in defense responses, growth and development, including SmRPS2, SmRPS4, SmRPS7, SmNAC2, SmNAC23, SmNAC102, SmWRKY6, SmWRKY26, and SmWRKY33. We also identified several candidate genes controlling six agronomic traits by genome-wide association studies, including SmPCBP2, SmbHLH1, SmCSLD1, SmPP2C, SmLRR-RKs, and SmAHP. Our study not only provides a rich genomic resource for further basic research on Sapindaceae woody trees but also identifies several economically significant genes for genomics-enabled improvements in molecular breeding.