Unknown

Dataset Information

0

Long-read transcriptome data for improved gene prediction in Lentinula edodes.


ABSTRACT: Lentinula edodes is one of the most popular edible mushrooms in the world and contains useful medicinal components such as lentinan. The whole-genome sequence of L. edodes has been determined with the objective of discovering candidate genes associated with agronomic traits, but experimental verification of gene models with correction of gene prediction errors is lacking. To improve the accuracy of gene prediction, we produced 12.6 Gb of long-read transcriptome data of variable lengths using PacBio single-molecule real-time (SMRT) sequencing and generated 36,946 transcript clusters with an average length of 2.2 kb. Evidence-driven gene prediction on the basis of long- and short-read RNA sequencing data was performed; a total of 16,610 protein-coding genes were predicted with error correction. Of the predicted genes, 42.2% were verified to be covered by full-length transcript clusters. The raw reads have been deposited in the NCBI SRA database under accession number PRJNA396788.

SUBMITTER: Park SG 

PROVIDER: S-EPMC5961913 | biostudies-literature | 2017 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Long-read transcriptome data for improved gene prediction in <i>Lentinula edodes</i>.

Park Sin-Gi SG   Yoo Seung Il SI   Ryu Dong Sung DS   Lee Hyunsung H   Ahn Yong Ju YJ   Ryu Hojin H   Ko Junsu J   Hong Chang Pyo CP  

Data in brief 20170927


<i>Lentinula edodes</i> is one of the most popular edible mushrooms in the world and contains useful medicinal components such as lentinan. The whole-genome sequence of <i>L. edodes</i> has been determined with the objective of discovering candidate genes associated with agronomic traits, but experimental verification of gene models with correction of gene prediction errors is lacking. To improve the accuracy of gene prediction, we produced 12.6 Gb of long-read transcriptome data of variable len  ...[more]

Similar Datasets

| S-EPMC5411510 | biostudies-literature
| S-EPMC8110776 | biostudies-literature
2005-01-20 | GSE2167 | GEO
2006-02-09 | GSE4202 | GEO
| S-EPMC7100940 | biostudies-literature
| S-EPMC6368761 | biostudies-literature
2005-01-19 | E-GEOD-2167 | biostudies-arrayexpress
2006-02-08 | E-GEOD-4202 | biostudies-arrayexpress
2010-11-24 | GSE25463 | GEO
| S-EPMC10057243 | biostudies-literature