Unknown

Dataset Information

0

De novo transcriptome assembly from inflorescence of Orchis italica: analysis of coding and non-coding transcripts.


ABSTRACT: The floral transcriptome of Orchis italica, a wild orchid species, was obtained using Illumina RNA-seq technology and specific de novo assembly and analysis tools. More than 100 million raw reads were processed resulting in 132,565 assembled transcripts and 86,079 unigenes with an average length of 606 bp and N50 of 956 bp. Functional annotation assigned 38,984 of the unigenes to records present in the NCBI non-redundant protein database, 32,161 of them to Gene Ontology terms, 15,775 of them to Eukaryotic Orthologous Groups (KOG) and 7,143 of them to Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. The in silico expression analysis based on the Fragments Per Kilobase of transcript per Million mapped reads (FPKM) was confirmed by real-time RT-PCR experiments on 10 selected unigenes, which showed high and statistically significant positive correlation with the RNA-seq based expression data. The prediction of putative long non-coding RNAs was assessed using two different software packages, CPC and Portrait, resulting in 7,779 unannotated unigenes that matched the threshold values for both of the analyses. Among the predicted long non-coding RNAs, one is the homologue of TAS3, a long non-coding RNA precursor of trans-acting small interfering RNAs (ta-siRNAs). The differential expression pattern observed for the selected putative long non-coding RNAs suggests their possible functional role in different floral tissues.

SUBMITTER: De Paolo S 

PROVIDER: S-EPMC4099010 | biostudies-literature | 2014

REPOSITORIES: biostudies-literature

altmetric image

Publications

De novo transcriptome assembly from inflorescence of Orchis italica: analysis of coding and non-coding transcripts.

De Paolo Sofia S   Salvemini Marco M   Gaudio Luciano L   Aceto Serena S  

PloS one 20140715 7


The floral transcriptome of Orchis italica, a wild orchid species, was obtained using Illumina RNA-seq technology and specific de novo assembly and analysis tools. More than 100 million raw reads were processed resulting in 132,565 assembled transcripts and 86,079 unigenes with an average length of 606 bp and N50 of 956 bp. Functional annotation assigned 38,984 of the unigenes to records present in the NCBI non-redundant protein database, 32,161 of them to Gene Ontology terms, 15,775 of them to  ...[more]

Similar Datasets

| S-EPMC4878839 | biostudies-literature
| S-EPMC6396907 | biostudies-literature
| S-EPMC4632031 | biostudies-literature
| S-EPMC5968133 | biostudies-literature
| S-EPMC6141965 | biostudies-literature
| S-EPMC5200869 | biostudies-literature
| S-EPMC4892416 | biostudies-literature
| S-EPMC3804621 | biostudies-literature
| S-EPMC5302821 | biostudies-literature
| S-EPMC3543268 | biostudies-literature