Project description:Rhododendron hybridum Hort. (Ericaceae) is an important ornamental species with striking continuous flowering feature. However, few genomic resources are currently available in this species, and the breeding programs were handicapped by the lack of basic genetic information. Here, we established a transcriptomic profiling study from four different tissues using RNA-Seq to gain insight on the functional genes and to isolate EST-SSR markers for breeding and conservation purposes. In total 38,050,296 high-quality sequence reads were obtained, and 56,120 unigenes (with N50 = 1,236bp) were assembled. Of which, 32,580 (58.05 %) and 8,788 (15.66 %) were annotated to GO and KEGG database, respectively. Additionally, 38,775 (69.09 %) and 37,409 (66.66 %) R. hybridum unigenes were aligned to the Arabidopsis thaliana and Oryza sativa genome, respectively. A total of 21,103 simple sequence repeat (SSR) motifs were identified in 15,050 contigs. Among them, dinucleotide repeats account for the largest proportion for 49.27%, followed by mono- (35.94%) and trinucleotide (21.5%). This study represents the first transcriptome data of R. hybridum and confirms that the transcriptome assembly data are a useful resource for EST-SSR loci development. Such vast sequence data and markers will be robust tools for genomic research and breeding of R. hybridum and related species.
2020-04-11 | GSE97630 | GEO
Project description:Perilla transcriptome analysis for developed EST-SSR markers
| PRJNA714787 | ENA
Project description:Development of EST-SSR markers in Schisandra chinensis using transcriptome sequencing
Project description:Transcriptome from high throughput sequencing-by-synthesis is a good resource of molecular markers. In this study, we present utility of massively parallel sequencing by synthesis for profiling the transcriptome of red pepper (Capsicum annuum L. TF68) by 454 GS-FLX pyrosequencing. Through the generation of approximately 30.63 megabases (Mb) of Expressed Sequence Tags (ESTs) data with the average length of 375 base pairs (bp), 9,818 contigs and 23,712 singletons were obtained by assembly. Using BLAST alignment against NCBI non-redundant and a UniProt protein database, 30% of the tentative consensus sequences were assigned to specific function annotation, while 24% returned alignments of unknown function, leaving up to 46% with no alignment. Functional classification using FunCat revealed that sequences with putative known function were distributed cross 18 categories. Furthermore, over 200 high quality single nucleotide discrepancies were discovered using the Bukang cDNA collection as a reference database. Moreover, 758 simple sequence repeat (SSR) motif loci were mined from over 600 contigs, from which 572 primer sets were designed. The SSR motifs corresponded to di- and tri- nucleotide motifs (27.03 and 61.92%, respectively). These molecular markers may be of great value for application in linkage mapping and association mapping research.