Unknown

Dataset Information

0

Reference genome of the nutrition-rich orphan crop chia (Salvia hispanica) and its implications for future breeding.


ABSTRACT: Chia (Salvia hispanica L.) is one of the most popular nutrition-rich foods and pseudocereal crops of the family Lamiaceae. Chia seeds are a rich source of proteins, polyunsaturated fatty acids (PUFAs), dietary fibers, and antioxidants. In this study, we present the assembly of the chia reference genome, which spans 303.6 Mb and encodes 48,090 annotated protein-coding genes. Our analysis revealed that ~42% of the chia genome harbors repetitive content, and identified ~3 million single nucleotide polymorphisms (SNPs) and 15,380 simple sequence repeat (SSR) marker sites. By investigating the chia transcriptome, we discovered that ~44% of the genes undergo alternative splicing with a higher frequency of intron retention events. Additionally, we identified chia genes associated with important nutrient content and quality traits, such as the biosynthesis of PUFAs and seed mucilage fiber (dietary fiber) polysaccharides. Notably, this is the first report of in-silico annotation of a plant genome for protein-derived small bioactive peptides (biopeptides) associated with improving human health. To facilitate further research and translational applications of this valuable orphan crop, we have developed the Salvia genomics database (SalviaGDB), accessible at https://salviagdb.org.

SUBMITTER: Gupta P 

PROVIDER: S-EPMC10757625 | biostudies-literature | 2023

REPOSITORIES: biostudies-literature

altmetric image

Publications

Reference genome of the nutrition-rich orphan crop chia (<i>Salvia hispanica</i>) and its implications for future breeding.

Gupta Parul P   Geniza Matthew M   Elser Justin J   Al-Bader Noor N   Baschieri Rachel R   Phillips Jeremy Levi JL   Haq Ebaad E   Preece Justin J   Naithani Sushma S   Jaiswal Pankaj P  

Frontiers in plant science 20231214


Chia (<i>Salvia hispanica L.</i>) is one of the most popular nutrition-rich foods and pseudocereal crops of the family Lamiaceae. Chia seeds are a rich source of proteins, polyunsaturated fatty acids (PUFAs), dietary fibers, and antioxidants. In this study, we present the assembly of the chia reference genome, which spans 303.6 Mb and encodes 48,090 annotated protein-coding genes. Our analysis revealed that ~42% of the chia genome harbors repetitive content, and identified ~3 million single nucl  ...[more]

Similar Datasets

| S-EPMC4189032 | biostudies-literature
| S-EPMC6611817 | biostudies-literature
| S-EPMC10406817 | biostudies-literature
| S-EPMC6994964 | biostudies-literature
| S-EPMC8877361 | biostudies-literature
| S-EPMC4395390 | biostudies-literature
2020-01-01 | E-MTAB-5515 | biostudies-arrayexpress
| S-EPMC7236935 | biostudies-literature
| PRJEB58694 | ENA
| S-EPMC11358080 | biostudies-literature