Unknown

Dataset Information

0

A chromosome-scale genome assembly of cucumber (Cucumis sativus L.).


ABSTRACT: BACKGROUND:Accurate and complete reference genome assemblies are fundamental for biological research. Cucumber is an important vegetable crop and model system for sex determination and vascular biology. Low-coverage Sanger sequences and high-coverage short Illumina sequences have been used to assemble draft cucumber genomes, but the incompleteness and low quality of these genomes limit their use in comparative genomics and genetic research. A high-quality and complete cucumber genome assembly is therefore essential. FINDINGS:We assembled single-molecule real-time (SMRT) long reads to generate an improved cucumber reference genome. This version contains 174 contigs with a total length of 226.2 Mb and an N50 of 8.9 Mb, and provides 29.0 Mb more sequence data than previous versions. Using 10X Genomics and high-throughput chromosome conformation capture (Hi-C) data, 89 contigs (?211.0 Mb) were directly linked into 7 pseudo-chromosome sequences. The newly assembled regions show much higher guanine-cytosine or adenine-thymine content than found previously, which is likely to have been inaccessible to Illumina sequencing. The new assembly contains 1,374 full-length long terminal retrotransposons and 1,078 novel genes including 239 tandemly duplicated genes. For example, we found 4 tandemly duplicated tyrosylprotein sulfotransferases, in contrast to the single copy of the gene found previously and in most other plants. CONCLUSION:This high-quality genome presents novel features of the cucumber genome and will serve as a valuable resource for genetic research in cucumber and plant comparative genomics.

SUBMITTER: Li Q 

PROVIDER: S-EPMC6582320 | biostudies-literature | 2019 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

A chromosome-scale genome assembly of cucumber (Cucumis sativus L.).

Li Qing Q   Li Hongbo H   Huang Wu W   Xu Yuanchao Y   Zhou Qian Q   Wang Shenhao S   Ruan Jue J   Huang Sanwen S   Zhang Zhonghua Z  

GigaScience 20190601 6


<h4>Background</h4>Accurate and complete reference genome assemblies are fundamental for biological research. Cucumber is an important vegetable crop and model system for sex determination and vascular biology. Low-coverage Sanger sequences and high-coverage short Illumina sequences have been used to assemble draft cucumber genomes, but the incompleteness and low quality of these genomes limit their use in comparative genomics and genetic research. A high-quality and complete cucumber genome ass  ...[more]

Similar Datasets

| S-EPMC7917098 | biostudies-literature
| S-EPMC3091718 | biostudies-literature
2015-05-01 | GSE57294 | GEO
| S-EPMC7148364 | biostudies-literature
| S-EPMC1163627 | biostudies-other
| S-EPMC7346245 | biostudies-literature
| S-EPMC3470563 | biostudies-literature
| S-EPMC6051622 | biostudies-literature
| S-EPMC5399834 | biostudies-literature
| S-EPMC3539955 | biostudies-literature