Project description:BACKGROUND:Chromosome evolution is an important driver of speciation and species evolution. Previous studies have detected chromosome rearrangement events among different Carnivora species using chromosome painting strategies. However, few of these studies have focused on chromosome evolution at a nucleotide resolution due to the limited availability of chromosome-level Carnivora genomes. Although the de novo genome assembly of the giant panda is available, current short read-based assemblies are limited to moderately sized scaffolds, making the study of chromosome evolution difficult. RESULTS:Here, we present a chromosome-level giant panda draft genome with a total size of 2.29 Gb. Based on the giant panda genome and published chromosome-level dog and cat genomes, we conduct six large-scale pairwise synteny alignments and identify evolutionary breakpoint regions. Interestingly, gene functional enrichment analysis shows that for all of the three Carnivora genomes, some genes located in evolutionary breakpoint regions are significantly enriched in pathways or terms related to sensory perception of smell. In addition, we find that the sweet receptor gene TAS1R2, which has been proven to be a pseudogene in the cat genome, is located in an evolutionary breakpoint region of the giant panda, suggesting that interchromosomal rearrangement may play a role in the cat TAS1R2 pseudogenization. CONCLUSIONS:We show that the combined strategies employed in this study can be used to generate efficient chromosome-level genome assemblies. Moreover, our comparative genomics analyses provide novel insights into Carnivora chromosome evolution, linking chromosome evolution to functional gene evolution.
Project description:Gynostemma pentaphyllum (Thunb.) Makino is an economically valuable medicinal plant belonging to the Cucurbitaceae family that produces the bioactive compound gypenoside. Despite several transcriptomes having been generated for G. pentaphyllum, a reference genome is still unavailable, which has limited the understanding of the gypenoside biosynthesis and regulatory mechanism. Here, we report a high-quality G. pentaphyllum genome with a total length of 582 Mb comprising 1,232 contigs and a scaffold N50 of 50.78 Mb. The G. pentaphyllum genome comprised 59.14% repetitive sequences and 25,285 protein-coding genes. Comparative genome analysis revealed that G. pentaphyllum was related to Siraitia grosvenorii, with an estimated divergence time dating to the Paleogene (∼48 million years ago). By combining transcriptome data from seven tissues, we reconstructed the gypenoside biosynthetic pathway and potential regulatory network using tissue-specific gene co-expression network analysis. Four UDP-glucuronosyltransferases (UGTs), belonging to the UGT85 subfamily and forming a gene cluster, were involved in catalyzing glycosylation in leaf-specific gypenoside biosynthesis. Furthermore, candidate biosynthetic genes and transcription factors involved in the gypenoside regulatory network were identified. The genetic information obtained in this study provides insights into gypenoside biosynthesis and lays the foundation for further exploration of the gypenoside regulatory mechanism.
Project description:The evolutionary history of horseshoe crabs, spanning approximately 500 million years, is characterized by remarkable morphological stasis and a low species diversity with only four extant species. Here we report a chromosome-level genome assembly for the mangrove horseshoe crab (Carcinoscorpius rotundicauda) using PacBio reads and Hi-C data. The assembly spans 1.67 Gb with contig N50 of 7.8 Mb and 98% of the genome assigned to 16 chromosomes. The genome contains five Hox clusters with 34 Hox genes, the highest number reported in any invertebrate. Detailed analysis of the genome provides evidence that suggests three rounds of whole-genome duplication (WGD), raising questions about the relationship between WGD and species radiation. Several gene families, particularly those involved in innate immunity, have undergone extensive tandem duplication. These expanded gene families may be important components of the innate immune system of horseshoe crabs, whose amebocyte lysate is a sensitive agent for detecting endotoxin contamination.
Project description:Abstract Fallopia multiflora (Thunb.) Harald, a vine belonging to the Polygonaceae family, is used in traditional medicine. The stilbenes contained in it have significant pharmacological activities in anti-oxidation and anti-aging. This study describes the assembly of the F. multiflora genome and presents its chromosome-level genome sequence containing 1.46 gigabases of data (with a contig N50 of 1.97 megabases), 1.44 gigabases of which was assigned to 11 pseudochromosomes. Comparative genomics confirmed that F. multiflora shared a whole-genome duplication event with Tartary buckwheat and then underwent different transposon evolution after separation. Combining genomics, transcriptomics, and metabolomics data to map a network of associated genes and metabolites, we identified two FmRS genes responsible for the catalysis of one molecule of p-coumaroyl-CoA and three molecules of malonyl-CoA to resveratrol in F. multiflora. These findings not only serve as the basis for revealing the stilbene biosynthetic pathway but will also contribute to the development of tools for increasing the production of bioactive stilbenes through molecular breeding in plants or metabolic engineering in microbes. Moreover, the reference genome of F. multiflora is a useful addition to the genomes of the Polygonaceae family.
Project description:BackgroundThe grass carp has great economic value and occupies an important evolutionary position. Genomic information regarding this species could help better understand its rapid growth rate as well as its unique body plan and environmental adaptation.ResultsWe assembled the chromosome-level grass carp genome using the PacBio sequencing and chromosome structure capture technique. The final genome assembly has a total length of 893.2 Mb with a contig N50 of 19.3 Mb and a scaffold N50 of 35.7 Mb. About 99.85% of the assembled contigs were anchored into 24 chromosomes. Based on the prediction, this genome contained 30,342 protein-coding genes and 43.26% repetitive sequences. Furthermore, we determined that the large genome size can be attributed to the DNA-mediated transposable elements which accounted for 58.9% of the repetitive sequences in grass carp. We identified that the grass carp has only 24 pairs of chromosomes due to the fusion of two ancestral chromosomes. Enrichment analyses of significantly expanded and positively selected genes reflected evolutionary adaptation of grass carp to the feeding habits. We also detected the loss of conserved non-coding regulatory elements associated with the development of the immune system, nervous system, and digestive system, which may be critical for grass carp herbivorous traits.ConclusionsThe high-quality reference genome reported here provides a valuable resource for the genetic improvement and molecular-guided breeding of the grass carp.
Project description:BackgroundThe highly eusocial stingless bees are crucial pollinators of native and agricultural ecosystems. Nevertheless, genomic studies within this bee tribe remain scarce. We present the genome assembly of the stingless bee Melipona bicolor. This bee is a remarkable exception to the typical single-queen colony structure, since in this species, multiple queens may coexist and share reproductive duties, resulting in genetically diverse colonies with weak kinship connections. As the only known genuinely polygynous bee, M. bicolor's genome provides a valuable resource for investigating sociality beyond kin selection.ResultsThe genome was assembled employing a hybrid approach combining short and long reads, resulting in 241 contigs spanning 259 Mb (N50 of 6.2 Mb and 97.5% complete BUSCOs). Comparative analyses shed light on some evolutionary aspects of stingless bee genomics, including multiple chromosomal rearrangements in Melipona. Additionally, we explored the evolution of venom genes in M. bicolor and other stingless bees, revealing that, apart from two genes, the conserved repertoire of venom components remains under purifying selection in this clade.ConclusionThis study advances our understanding of stingless bee genomics, contributing to the conservation efforts of these vital pollinators and offering insights into the evolutionary mechanisms driving their unique adaptations.
Project description:Gentiana dahurica Fisch. is a perennial herb of the family Gentianaceae. This species is used as a traditional Tibetan medicine because of its rich gentiopicroside constituents. Here, we generate a high-quality, chromosome-level genome of G. dahurica with a total length of 1,416.54 Mb. Comparative genomic analyses showed that G. dahurica shared one whole-genome duplication (WGD) event with Gelsemium sempervirens of the family Gelsemiaceaei and had one additional species-specific WGD after the ancient whole-genome triplication with other eudicots. Further transcriptome analyses identified numerous enzyme coding genes and the transcription factors related to gentiopicroside biosynthesis. A set of candidate cytochrome P450 genes were identified for being involved in biosynthetic shifts from swertiamarin to gentiopicroside. Both gene expressions and the contents measured by high-performance liquid chromatography indicated that the gentiopicrosides were mainly synthesized in the rhizomes with the highest contents. In addition, we found that two above-mentioned WGDs, contributed greatly to the identified candidate genes involving in gentiopicroside biosynthesis. The first reference genome of Gentianaceae we generated here will definitely accelerate evolutionary, ecological, and pharmaceutical studies of this family.
Project description:Pennisetum alopecuroides is an important forage grass resource, which plays a vital role in ecological environment improvement. Therefore, the acquisition of P. alopecuroides genome resources is conducive to the study of the adaptability of Pennisetum species in ecological remediation and forage breeding development. Here we assembled a P. alopecuroides cv. 'Liqiu' genome at the chromosome level with a size of approximately 845.71 Mb, contig N50 of 84.83Mb, and genome integrity of 99.13% as assessed by CEGMA. A total of 833.41-Mb sequences were mounted on nine chromosomes by Hi-C technology. In total, 60.66% of the repetitive sequences and 34,312 genes were predicted. The genomic evolution analysis showed that P. alopecuroides cv. 'Liqiu' was isolated from Setaria 7.53-13.80 million years ago and from Cenchrus 5.33-8.99 million years ago, respectively. The whole-genome event analysis showed that P. alopecuroides cv. 'Liqiu' underwent two whole-genome duplication (WGD) events in the evolution process, and the duplication events occurred at a similar time to that of Oryza sativa and Setaria viridis. The completion of the genome sequencing of P. alopecuroides cv. 'Liqiu' provides data support for mining high-quality genetic resources of P. alopecuroides and provides a theoretical basis for the origin and evolutionary characteristics of Pennisetum.
Project description:Japanese chestnut (Castanea crenata Sieb. et Zucc) is an economically and ecologically important chestnut species in East Asia. Here, we presented a high-quality chromosome-level reference genome of the Japanese chestnut cultivar 'Tsukuba' by combining Nanopore long reads and Hi-C sequencing. The final assembly has a size of 718.30 Mb and consists of 12 pseudochromosomes ranging from 41.03 to 92.03 Mb, with a BUSCO complete gene percentage of 97.6%. A total of 421.37 Mb repetitive sequences and 46,744 gene models encoding 46,463 proteins were predicted in the genome. Genome evolution analysis showed that Japanese chestnut is closely related to Chinese chestnut and these species shared a common ancestor ~6.5 million years ago. This high-quality Japanese chestnut genome represents an important resource for the chestnut genomics community and will improve our understanding of chestnut biology and evolution.
Project description:Potentilla anserina is a perennial stoloniferous plant with edible tuberous roots in Rosaceae, served as important food and medicine sources for Tibetans in the Qinghai-Tibetan Plateau (QTP), China, over thousands of years. However, a lack of genome information hindered the genetic study. Here, we presented a chromosome-level genome assembly using single-molecule long-read sequencing, and the Hi-C technique. The assembled genome was 454.28 Mb, containing 14 chromosomes, with contig N50 of 2.14 Mb. A total of 46,495 protein-coding genes, 169.74 Mb repeat regions, and 31.76 Kb non-coding RNA were predicted. P. anserina diverged from Potentilla micrantha ∼28.52 million years ago (Mya). Furthermore, P. anserina underwent a recent tetraploidization ∼6.4 Mya. The species-specific genes were enriched in Starch and sucrose metabolism and Galactose metabolism pathways. We identified the sub-genome structures of P. anserina, with A sub-genome was larger than B sub-genome and closer to P. micrantha phylogenetically. Despite lacking significant genome-wide expression dominance, the A sub-genome had higher homoeologous gene expression in shoot apical meristem, flower and tuberous root. The resistance genes was contracted in P. anserina genome. Key genes involved in starch biosynthesis were expanded and highly expressed in tuberous roots, which probably drives the tuber formation. The genomics and transcriptomics data generated in this study advance our understanding of the genomic landscape of P. anserina, and will accelerate genetic studies and breeding programs.