Project description:In monocots other than the cereals maize and rice, the repertoire and diversity of microRNAs (miRNAs) and the populations of phased, secondary, small interfering RNAs (phasiRNAs) are poorly characterized. To remedy this, we sequenced small RNAs (sRNAs) from vegetative and dissected inflorescence tissue in 28 phylogenetically diverse monocots and from several early-diverging angiosperm lineages, as well as publicly available data from 10 additional monocot species. We annotated miRNAs, siRNAs and phasiRNAs across the monocot phylogeny, identifying miRNAs apparently lost or gained in the grasses relative to other monocot families, as well as a number of tRNA fragments misannotated as miRNAs. Using our miRNA database cleaned of these misannotations, we identified conservation at the 8th, 9th, 19th and 3’ end positions that we hypothesize are signatures of selection for processing, targeting, or Argonaute sorting. We show that 21-nt reproductive phasiRNAs are far more numerous in grass genomes than other monocots. Based on sequenced monocot genomes and transcriptomes, DICER-LIKE 5 (DCL5), important to 24-nt phasiRNA biogenesis, likely originated via gene duplication before the diversification of the grasses. This curated database of phylogenetically diverse monocot miRNAs, siRNAs, and phasiRNAs is the largest collection to date, and should facilitate continued exploration of small RNA diversification in flowering plants.
Project description:Salvia is an important genus from the Lamiaceae with approximately 1000 species distributed globally. Several Salvia species are commercially important because of their medicinal and culinary properties. We report the construction of the first fingerprinting array for Salvia species enriched with polymorphic and divergent DNA sequences and demonstrate the potential of this array for fingerprinting several economically important members of this genus.