Project description:Spinacia is a genus of important leafy vegetable crops worldwide and includes cultivated Spinacia oleracea and two wild progenitors, Spinacia turkestanica and Spinacia tetrandra. However, the chloroplast genomes of the two wild progenitors remain unpublished, limiting our knowledge of chloroplast genome evolution among these three Spinacia species. Here, we reported the complete chloroplast genomes of S. oleracea, S. turkestanica, and S. tetrandra obtained via Illumina sequencing. The three chloroplast genomes exhibited a typical quadripartite structure and were 150,739, 150,747, and 150,680 bp in size, respectively. Only three variants were identified between S. oleracea and S. turkestanica, whereas 690 variants were obtained between S. oleracea and S. tetrandra, strongly demonstrating the close relationship between S. turkestanica and S. oleracea. This was further supported by phylogenetic analysis. We reported a comprehensive variant dataset including 503 SNPs and 83 Indels using 85 Spinacia accessions containing 61 S. oleracea, 16 S. turkestanica, and eight S. tetrandra accessions. Thirteen S. oleracea accessions were derived through introgression from S. turkestanica that acts as the maternal parent. Together, these results provide a valuable resource for spinach breeding programs and improve our understanding of the phylogenetic relationships within Amaranthaceae.
Project description:BackgroundThe orchids of the subtribe Coelogyninae are among the most morphologically diverse and economically important groups within the subfamily Epidendroideae. Previous molecular studies have revealed that Coelogyninae is an unambiguously monophyletic group. However, intergeneric and infrageneric relationships within Coelogyninae are largely unresolved. There has been long controversy over the classification among the genera within the subtribe.ResultsThe complete chloroplast (cp.) genomes of 15 species in the subtribe Coelogyninae were newly sequenced and assembled. Together with nine available cp. genomes in GenBank from representative clades of the subtribe, we compared and elucidated the characteristics of 24 Coelogyninae cp. genomes. The results showed that all cp. genomes shared highly conserved structure and contained 135 genes arranged in the same order, including 89 protein-coding genes, 38 tRNAs, and eight rRNAs. Nevertheless, structural variations in relation to particular genes at the IR/SC boundary regions were identified. The diversification pattern of the cp. genomes showed high consistency with the phylogenetic placement of Coelogyninae. The number of different types of SSRs and long repeats exhibited significant differences in the 24 Coelogyninae cp. genomes, wherein mononucleotide repeats (A/T), and palindromic repeats were the most abundant. Four mutation hotspot regions (ycf1a, ndhF-rp132, psaC-ndhE, and rp132-trnL) were determined, which could serve as effective molecular markers. Selection pressure analysis revealed that three genes (ycf1a, rpoC2 and ycf2 genes) might have experienced apparent positive selection during the evolution. Using the alignments of whole cp. genomes and protein-coding sequences, this study presents a well-resolved phylogenetic framework of Coelogyninae.ConclusionThe inclusion of 55 plastid genome data from a nearly complete generic-level sampling provide a comprehensive view of the phylogenetic relationships among genera and species in subtribe Coelogyninae and illustrate the diverse genetic variation patterns of plastid genomes in this species-rich plant group. The inferred relationships and informally recognized major clades within the subtribe are presented. The genetic markers identified here will facilitate future studies on the genetics and phylogeny of subtribe Coelogyninae.
Project description:BackgroundCapsicum (Solanaceae) is a globally important vegetable crop and is also used therapeutically in traditional medicine systems. However, little is known of the genetic variation within the commonly grown cultivars, the evolutionary relationships and differences in the chloroplast (cp.) genomes between Capsicum species remain unclear.ResultsThe cp. genomes of 32 Capsicum varieties in three species from 6 countries were investigated. The cp. genome of Capsicum was found to be ~ 156 kb in length and to contain 113 unique genes, of which 79 encoded proteins, 30 encoded transfer tRNAs, and 4 were for ribosomal RNAs. The 32 varieties that we chose for study represented 13 genotypes, containing a total of 608 indels, 83 SNPs, 47 SSRs and 281-306 repeat sequences. We then included several previously sequenced Capsicum cp. genomes, and found that the nine investigated species showed a number of differences in the characteristics of the four IR boundaries, and it was the non-coding regions that contained the most variable regions. We conducted a phylogenetic reconstruction using the cp. genomes of 43 representative species of Solanaceae, and the resulting phylogeny generally reflected the currently accepted classification, with the species of the pungent group having close relationship with one another.ConclusionsThis study provides a comprehensive analysis of Capsicum chloroplast genomes, revealing significant variations in IR boundaries and other genomic features. These findings enhance our understanding of Capsicum evolution and genetic diversity.
Project description:Lamiales, comprising over 23,755 species across 24 families, stands as a highly diverse and prolific plant group, playing a significant role in the cultivation of horticultural, ornamental, and medicinal plant varieties. Whole-genome duplication (WGD) and its subsequent post-polyploid diploidization (PPD) process represent the most drastic type of karyotype evolution, injecting significant potential for promoting the diversity of this lineage. However, polyploidization histories, as well as genome and subgenome fractionation following WGD events in Lamiales species, are still not well investigated. In this study, we constructed a chromosome-level genome assembly of Lindenbergia philippensis (Orobanchaceae) and conducted comparative genomic analyses with 14 other Lamiales species. L. philippensis is positioned closest to the parasitic lineage within Orobanchaceae and has a conserved karyotype. Through a combination of Ks analysis and syntenic depth analysis, we reconstructed and validated polyploidization histories of Lamiales species. Our results indicated that Primulina huaijiensis underwent three rounds of diploidization events following the γ-WGT event, rather than two rounds as reported. Besides, we reconfirmed that most Lamiales species shared a common diploidization event (L-WGD). Subsequently, we constructed the Lamiales Ancestral Karyotype (LAK), comprising 11 proto-chromosomes, and elucidated its evolutionary trajectory, highlighting the highly flexible reshuffling of the Lamiales paleogenome. We identified biased fractionation of subgenomes following the L-WGD event across eight species, and highlighted the positive impacts of non-WGD genes on gene family expansion. This study provides novel genomic resources and insights into polyploidy and karyotype remodeling of Lamiales species, essential for advancing our understanding of species diversification and genome evolution.
Project description:Bougainvillea (Nyctaginaceae) is a popular ornamental plant group primarily grown for its striking colorful bracts. However, despite its established horticultural value, limited genomic resources and molecular studies have been reported for this genus. Thus, to address this existing gap, complete chloroplast genomes of four species (Bougainvillea glabra, Bougainvillea peruviana, Bougainvilleapachyphylla, Bougainvillea praecox) and one Bougainvillea cultivar were sequenced and characterized. The Bougainvillea cp genomes range from 153,966 bp to 154,541 bp in length, comprising a large single-copy region (85,159 bp-85,708 bp) and a small single-copy region (18,014 bp-18,078 bp) separated by a pair of inverted repeats (25,377-25,427 bp). All sequenced plastomes have 131 annotated genes, including 86 protein-coding, eight rRNA, and 37 tRNA genes. These five newly sequenced Bougainvillea cp genomes were compared to the Bougainvillea spectabilis cp genome deposited in GeBank. The results showed that all cp genomes have highly similar structures, contents, and organization. They all exhibit quadripartite structures and all have the same numbers of genes and introns. Codon usage, RNA editing sites, and repeat analyses also revealed highly similar results for the six cp genomes. The amino acid leucine has the highest proportion and almost all favored synonymous codons have either an A or U ending. Likewise, out of the 42 predicted RNA sites, most conversions were from serine (S) to leucine (L). The majority of the simple sequence repeats detected were A/T mononucleotides, making the cp genomes A/T-rich. The contractions and expansions of the IR boundaries were very minimal as well, hence contributing very little to the differences in genome size. In addition, sequence variation analyses showed that Bougainvillea cp genomes share nearly identical genomic profiles though several potential barcodes, such as ycf1, ndhF, and rpoA were identified. Higher variation was observed in both B. peruviana and B.pachyphylla cp sequences based on SNPs and indels analysis. Phylogenetic reconstructions further showed that these two species appear to be the basal taxa of Bougainvillea. The rarely cultivated and wild species of Bougainvillea (B.pachyphylla, B. peruviana, B. praecox) diverged earlier than the commonly cultivated species and cultivar (B. spectabilis, B. glabra, B. cv.). Overall, the results of this study provide additional genetic resources that can aid in further phylogenetic and evolutionary studies in Bougainvillea. Moreover, genetic information from this study is potentially useful in identifying Bougainvillea species and cultivars, which is essential for both taxonomic and plant breeding studies.
Project description:BackgroundOat (Avena sativa L.) is a recognized health-food, and the contributions of its different candidate A-genome progenitor species remain inconclusive. Here, we report chloroplast genome sequences of eleven Avena species, to examine the plastome evolutionary dynamics and analyze phylogenetic relationships between oat and its congeneric wild related species.ResultsThe chloroplast genomes of eleven Avena species (size range of 135,889-135,998 bp) share quadripartite structure, comprising of a large single copy (LSC; 80,014-80,132 bp), a small single copy (SSC; 12,575-12,679 bp) and a pair of inverted repeats (IRs; 21,603-21,614 bp). The plastomes contain 131 genes including 84 protein-coding genes, eight ribosomal RNAs and 39 transfer RNAs. The nucleotide sequence diversities (Pi values) range from 0.0036 (rps19) to 0.0093 (rpl32) for ten most polymorphic genes and from 0.0084 (psbH-petB) to 0.0240 (petG-trnW-CCA) for ten most polymorphic intergenic regions. Gene selective pressure analysis shows that all protein-coding genes have been under purifying selection. The adjacent position relationships between tandem repeats, insertions/deletions and single nucleotide polymorphisms support the evolutionary importance of tandem repeats in causing plastome mutations in Avena. Phylogenomic analyses, based on the complete plastome sequences and the LSC intermolecular recombination sequences, support the monophyly of Avena with two clades in the genus.ConclusionsDiversification of Avena plastomes is explained by the presence of highly diverse genes and intergenic regions, LSC intermolecular recombination, and the co-occurrence of tandem repeat and indels or single nucleotide polymorphisms. The study demonstrates that the A-genome diploid-polyploid lineage maintains two subclades derived from different maternal ancestors, with A. longiglumis as the first diverging species in clade I. These genome resources will be helpful in elucidating the chloroplast genome structure, understanding the evolutionary dynamics at genus Avena and family Poaceae levels, and are potentially useful to exploit plastome variation in making hybrids for plant breeding.
Project description:Carotenoids, such as β-carotene, accumulate in chromoplasts of various fleshy fruits, awarding them with colors, aromas, and nutrients. The Orange (CmOr) gene controls β-carotene accumulation in melon fruit by posttranslationally enhancing carotenogenesis and repressing β-carotene turnover in chromoplasts. Carotenoid isomerase (CRTISO) isomerizes yellow prolycopene into red lycopene, a prerequisite for further metabolism into β-carotene. We comparatively analyzed the developing fruit transcriptomes of orange-colored melon and its two isogenic EMS-induced mutants, low-β (Cmor) and yofi (Cmcrtiso). The Cmor mutation in low-β caused a major transcriptomic change in the mature fruit. In contrast, the Cmcrtiso mutation in yofi significantly changed the transcriptome only in early fruit developmental stages. These findings indicate that melon fruit transcriptome is primarily altered by changes in carotenoid metabolic flux and plastid conversion, but minimally by carotenoid composition in the ripe fruit. Clustering of the differentially expressed genes into functional groups revealed an association between fruit carotenoid metabolic flux with the maintenance of the photosynthetic apparatus in fruit chloroplasts. Moreover, large numbers of thylakoid localized photosynthetic genes were differentially expressed in low-β. CmOR family proteins were found to physically interact with light-harvesting chlorophyll a-b binding proteins, suggesting a new role of CmOR for chloroplast maintenance in melon fruit. This study brings more insights into the cellular and metabolic processes associated with fruit carotenoid accumulation in melon fruit and reveals a new maintenance mechanism of the photosynthetic apparatus for plastid development.
Project description:Coffee is a high value agricultural commodity grown in about 80 countries. Sustainable coffee cultivation is hampered by multiple biotic and abiotic stress conditions predominantly driven by climate change. The NAC proteins are plants specific transcription factors associated with various physiological functions in plants which include cell division, secondary wall formation, formation of shoot apical meristem, leaf senescence, flowering embryo and seed development. Besides, they are also involved in biotic and abiotic stress regulation. Due to their ubiquitous influence, studies on NAC transcription factors have gained momentum in different crop plant species. In the present study, NAC25 like transcription factor was isolated and characterized from two cultivated coffee species, Coffea arabica and Coffea canephora and five Indian wild coffee species for the first time. The full-length NAC25 gene varied from 2,456 bp in Coffea jenkinsii to 2,493 bp in C. arabica. In all the seven coffee species, sequencing of the NAC25 gene revealed 3 exons and 2 introns. The NAC25 gene is characterized by a highly conserved 377 bp NAM domain (N-terminus) and a highly variable C terminus region. The sequence analysis revealed an average of one SNP per every 40.92 bp in the coding region and 37.7 bp in the intronic region. Further, the non-synonymous SNPs are 8-11 fold higher compared to synonymous SNPs in the non-coding and coding region of the NAC25 gene, respectively. The expression of NAC25 gene was studied in six different tissue types in C. canephora and higher expression levels were observed in leaf and flower tissues. Further, the relative expression of NAC25 in comparison with the GAPDH gene revealed four folds and eight folds increase in expression levels in green fruit and ripen fruit, respectively. The evolutionary relationship revealed the independent evolution of the NAC25 gene in coffee.
Project description:BackgroundChloroplast genome sequences are extremely informative about species-interrelationships owing to its non-meiotic and often uniparental inheritance over generations. The subject of our study, Fagopyrum esculentum, is a member of the family Polygonaceae belonging to the order Caryophyllales. An uncertainty remains regarding the affinity of Caryophyllales and the asterids that could be due to undersampling of the taxa. With that background, having access to the complete chloroplast genome sequence for Fagopyrum becomes quite pertinent.ResultsWe report the complete chloroplast genome sequence of a wild ancestor of cultivated buckwheat, Fagopyrum esculentum ssp. ancestrale. The sequence was rapidly determined using a previously described approach that utilized a PCR-based method and employed universal primers, designed on the scaffold of multiple sequence alignment of chloroplast genomes. The gene content and order in buckwheat chloroplast genome is similar to Spinacia oleracea. However, some unique structural differences exist: the presence of an intron in the rpl2 gene, a frameshift mutation in the rpl23 gene and extension of the inverted repeat region to include the ycf1 gene. Phylogenetic analysis of 61 protein-coding gene sequences from 44 complete plastid genomes provided strong support for the sister relationships of Caryophyllales (including Polygonaceae) to asterids. Further, our analysis also provided support for Amborella as sister to all other angiosperms, but interestingly, in the bayesian phylogeny inference based on first two codon positions Amborella united with Nymphaeales.ConclusionComparative genomics analyses revealed that the Fagopyrum chloroplast genome harbors the characteristic gene content and organization as has been described for several other chloroplast genomes. However, it has some unique structural features distinct from previously reported complete chloroplast genome sequences. Phylogenetic analysis of the dataset, including this new sequence from non-core Caryophyllales supports the sister relationship between Caryophyllales and asterids.
Project description:Determination and comparisons of complete mitochondrial genomes (mitogenomes) are important to understand the origin and evolution of mitochondria. Mitogenomes of unicellular protists are particularly informative in this regard because they are gene-rich and display high structural diversity. Ciliates are a highly diverse assemblage of protists and their mitogenomes (linear structure with high A+T content in general) were amongst the first from protists to be characterized and have provided important insights into mitogenome evolution. Here, we report novel mitogenome sequences from three representatives (Strombidium sp., Strombidium cf. sulcatum, and Halteria grandinella) in two dominant ciliate lineages. Comparative and phylogenetic analyses of newly sequenced and previously published ciliate mitogenomes were performed and revealed a number of important insights. We found that the mitogenomes of these three species are linear molecules capped with telomeric repeats that differ greatly among known species. The genomes studied here are highly syntenic, but larger in size and more gene-rich than those of other groups. They also all share an AT-rich tandem repeat region which may serve as the replication origin and modulate initiation of bidirectional transcription. More generally we identified a split version of ccmf, a cytochrome c maturation-related gene that might be a derived character uniting taxa in the subclasses Hypotrichia and Euplotia. Finally, our mitogenome comparisons and phylogenetic analyses support to reclassify Halteria grandinella from the subclass Oligotrichia to the subclass Hypotrichia. These results add to the growing literature on the unique features of ciliate mitogenomes, shedding light on the diversity and evolution of their linear molecular architecture.