Unknown

Dataset Information

0

Single-molecule long-read sequencing facilitates shrimp transcriptome research.


ABSTRACT: Although shrimp are of great economic importance, few full-length shrimp transcriptomes are available. Here, we used Pacific Biosciences single-molecule real-time (SMRT) long-read sequencing technology to generate transcripts from the Pacific white shrimp (Litopenaeus vannamei). We obtained 322,600 full-length non-chimeric reads, from which we generated 51,367 high-quality unique full-length transcripts. We corrected errors in the SMRT sequences by comparison with Illumina-produced short reads. We successfully annotated 81.72% of all unique SMRT transcripts against the NCBI non-redundant database, 58.63% against Swiss-Prot, 45.38% against Gene Ontology, 32.57% against Clusters of Orthologous Groups of proteins (COG), and 47.83% against Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Across all transcripts, we identified 3,958 long non-coding RNAs (lncRNAs) and 80,650 simple sequence repeats (SSRs). Our study provides a rich set of full-length cDNA sequences for L. vannamei, which will greatly facilitate shrimp transcriptome research.

SUBMITTER: Zeng D 

PROVIDER: S-EPMC6240054 | biostudies-literature | 2018 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Single-molecule long-read sequencing facilitates shrimp transcriptome research.

Zeng Digang D   Chen Xiuli X   Peng Jinxia J   Yang Chunling C   Peng Min M   Zhu Weilin W   Xie Daxiang D   He Pingping P   Wei Pinyuan P   Lin Yong Y   Zhao Yongzhen Y   Chen Xiaohan X  

Scientific reports 20181116 1


Although shrimp are of great economic importance, few full-length shrimp transcriptomes are available. Here, we used Pacific Biosciences single-molecule real-time (SMRT) long-read sequencing technology to generate transcripts from the Pacific white shrimp (Litopenaeus vannamei). We obtained 322,600 full-length non-chimeric reads, from which we generated 51,367 high-quality unique full-length transcripts. We corrected errors in the SMRT sequences by comparison with Illumina-produced short reads.  ...[more]

Similar Datasets

2022-05-31 | PXD031213 | Pride
| S-EPMC9435740 | biostudies-literature
| S-EPMC6627194 | biostudies-literature
| S-EPMC4931018 | biostudies-literature
| S-EPMC5550469 | biostudies-literature
| S-EPMC9626564 | biostudies-literature
| S-EPMC6105124 | biostudies-literature
| S-EPMC7174332 | biostudies-literature
| S-EPMC7942025 | biostudies-literature
| S-EPMC6711964 | biostudies-literature