Dataset Information

A comparison across non-model animals suggests an optimal sequencing depth for de novo transcriptome assembly.

ABSTRACT: The lack of genomic resources can present challenges for studies of non-model organisms. Transcriptome sequencing offers an attractive method to gather information about genes and gene expression without the need for a reference genome. However, it is unclear what sequencing depth is adequate to assemble the transcriptome de novo for these purposes.We assembled transcriptomes of animals from six different phyla (Annelids, Arthropods, Chordates, Cnidarians, Ctenophores, and Molluscs) at regular increments of reads using Velvet/Oases and Trinity to determine how read count affects the assembly. This included an assembly of mouse heart reads because we could compare those against the reference genome that is available. We found qualitative differences in the assemblies of whole-animals versus tissues. With increasing reads, whole-animal assemblies show rapid increase of transcripts and discovery of conserved genes, while single-tissue assemblies show a slower discovery of conserved genes though the assembled transcripts were often longer. A deeper examination of the mouse assemblies shows that with more reads, assembly errors become more frequent but such errors can be mitigated with more stringent assembly parameters.These assembly trends suggest that representative assemblies are generated with as few as 20 million reads for tissue samples and 30 million reads for whole-animals for RNA-level coverage. These depths provide a good balance between coverage and noise. Beyond 60 million reads, the discovery of new genes is low and sequencing errors of highly-expressed genes are likely to accumulate. Finally, siphonophores (polymorphic Cnidarians) are an exception and possibly require alternate assembly strategies.

SUBMITTER: Francis WR

PROVIDER: S-EPMC3655071 | biostudies-literature | 2013 Mar

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

A comparison across non-model animals suggests an optimal sequencing depth for de novo transcriptome assembly.

Francis Warren R WR Christianson Lynne M LM Kiko Rainer R Powers Meghan L ML Shaner Nathan C NC Haddock Steven H D SH

BMC genomics 20130312

<h4>Background</h4>The lack of genomic resources can present challenges for studies of non-model organisms. Transcriptome sequencing offers an attractive method to gather information about genes and gene expression without the need for a reference genome. However, it is unclear what sequencing depth is adequate to assemble the transcriptome de novo for these purposes.<h4>Results</h4>We assembled transcriptomes of animals from six different phyla (Annelids, Arthropods, Chordates, Cnidarians, Cten ...[more]

PMID: 23496952

Dataset Information

A comparison across non-model animals suggests an optimal sequencing depth for de novo transcriptome assembly.

Publications

A comparison across non-model animals suggests an optimal sequencing depth for de novo transcriptome assembly.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Impact of sequencing depth and technology on de novo RNA-Seq assembly.
| S-EPMC6651908 | biostudies-literature

Sequencing and de novo transcriptome assembly of Brachypodium sylvaticum (Poaceae).
| S-EPMC4105277 | biostudies-literature

De Novo Sequencing and Assembly Analysis of the Pseudostellaria heterophylla Transcriptome.
| S-EPMC5072632 | biostudies-literature

Sequencing and de novo assembly of a Dahlia hybrid cultivar transcriptome.
| S-EPMC4101353 | biostudies-literature

Sequencing, de novo assembly and comparative analysis of Raphanus sativus transcriptome.
| S-EPMC4428447 | biostudies-literature

Optimization of de novo transcriptome assembly from next-generation sequencing data.
| S-EPMC2945192 | biostudies-literature

High-throughput sequencing and De Novo assembly of the Isatis indigotica transcriptome.
| S-EPMC4178013 | biostudies-literature

Sequencing and De Novo Assembly of the Toxicodendron radicans (Poison Ivy) Transcriptome.
| S-EPMC5704230 | biostudies-literature

Transcriptome Sequencing and De Novo Assembly of Golden Cuttlefish Sepia esculenta Hoyle.
| S-EPMC5085775 | biostudies-literature

IDP-denovo: de novo transcriptome assembly and isoform annotation by hybrid sequencing.
| S-EPMC6022631 | biostudies-literature