Unknown

Dataset Information

0

A practical guide to build de-novo assemblies for single tissues of non-model organisms: the example of a Neotropical frog.


ABSTRACT: Whole genome sequencing (WGS) is a very valuable resource to understand the evolutionary history of poorly known species. However, in organisms with large genomes, as most amphibians, WGS is still excessively challenging and transcriptome sequencing (RNA-seq) represents a cost-effective tool to explore genome-wide variability. Non-model organisms do not usually have a reference genome and the transcriptome must be assembled de-novo. We used RNA-seq to obtain the transcriptomic profile for Oreobates cruralis, a poorly known South American direct-developing frog. In total, 550,871 transcripts were assembled, corresponding to 422,999 putative genes. Of those, we identified 23,500, 37,349, 38,120 and 45,885 genes present in the Pfam, EggNOG, KEGG and GO databases, respectively. Interestingly, our results suggested that genes related to immune system and defense mechanisms are abundant in the transcriptome of O. cruralis. We also present a pipeline to assist with pre-processing, assembling, evaluating and functionally annotating a de-novo transcriptome from RNA-seq data of non-model organisms. Our pipeline guides the inexperienced user in an intuitive way through all the necessary steps to build de-novo transcriptome assemblies using readily available software and is freely available at: https://github.com/biomendi/TRANSCRIPTOME-ASSEMBLY-PIPELINE/wiki.

SUBMITTER: Montero-Mendieta S 

PROVIDER: S-EPMC5582611 | biostudies-literature | 2017

REPOSITORIES: biostudies-literature

altmetric image

Publications

A practical guide to build <i>de-novo</i> assemblies for single tissues of non-model organisms: the example of a Neotropical frog.

Montero-Mendieta Santiago S   Grabherr Manfred M   Lantz Henrik H   De la Riva Ignacio I   Leonard Jennifer A JA   Webster Matthew T MT   Vilà Carles C  

PeerJ 20170901


Whole genome sequencing (WGS) is a very valuable resource to understand the evolutionary history of poorly known species. However, in organisms with large genomes, as most amphibians, WGS is still excessively challenging and transcriptome sequencing (RNA-seq) represents a cost-effective tool to explore genome-wide variability. Non-model organisms do not usually have a reference genome and the transcriptome must be assembled <i>de-novo</i>. We used RNA-seq to obtain the transcriptomic profile for  ...[more]

Similar Datasets

| S-EPMC9713437 | biostudies-literature
| S-EPMC4489290 | biostudies-literature
| S-EPMC10146838 | biostudies-literature
| S-EPMC5148890 | biostudies-literature
| S-EPMC7496956 | biostudies-literature
| S-EPMC10994295 | biostudies-literature
| S-EPMC3146844 | biostudies-literature
| S-EPMC10868617 | biostudies-literature
| S-EPMC7169969 | biostudies-literature
| PRJEB76276 | ENA