Project description:BackgroundThe genus Caragana encompasses multiple plant species that possess medicinal and ecological value. However, some species of Caragana are quite similar in morphology, so identifying species in this genus based on their morphological characteristics is considerably complex. In our research, illumina paired-end sequencing was employed to investigate the genetic organization and structure of Caragana tibetica and Caragana turkestanica, including the previously published chloroplast genome sequence of 7 Caragana plants.ResultsThe lengths of C. tibetica and C. turkestanica chloroplast genomes were 128,433 bp and 129,453 bp, respectively. The absence of inverted repeat sequences in these two species categorizes them under the inverted repeat loss clade (IRLC). They encode 110 and 111 genes (4 /4 rRNA genes, 30 /31tRNA genes, and 76 /76 protein-coding genes), respectively. Comparison of the chloroplast genomes of C. tibetica and C. turkestanica with 7 other Caragana species revealed a high overall sequence similarity. However, some divergence was observed between certain intergenic regions (matK-rbcL, psbD-psbM, atpA-psbI, and etc.). Nucleotide diversity (π) analysis revealed the detection of five highly likely variable regions, namely rps2-atpI, accD-psaI-ycf4, cemA-petA, psbN-psbH and rpoA-rps11. Phylogenetic analysis revealed that C. tibetica's sister species is Caragana jubata, whereas C. turkestanica's closest relative is Caragana arborescens.ConclusionsThe present study provides worthwhile information about the chloroplast genomes of C. tibetica and C. turkestanica, which aids in the identification and classification of Caragana species.