Unknown

Dataset Information

0

De novo genome assembly of two tomato ancestors, Solanum pimpinellifolium and Solanum  lycopersicum var. cerasiforme, by long-read sequencing.


ABSTRACT: The ancestral tomato species are known to possess genes that are valuable for improving traits in breeding. Here, we aimed to construct high-quality de novo genome assemblies of Solanum pimpinellifolium 'LA1670' and S. lycopersicum var. cerasiforme 'LA1673', originating from Peru. The Pacific Biosciences (PacBio) long-read sequences with 110× and 104× coverages were assembled and polished to generate 244 and 202 contigs spanning 808.8 Mbp for 'LA1670' and 804.5 Mbp for 'LA1673', respectively. After chromosome-level scaffolding with reference guiding, 14 scaffold sequences corresponding to 12 tomato chromosomes and 2 unassigned sequences were constructed. High-quality genome assemblies were confirmed using the Benchmarking Universal Single-Copy Orthologs and long terminal repeat assembly index. The protein-coding sequences were then predicted, and their transcriptomes were confirmed. The de novo assembled genomes of S. pimpinellifolium and S. lycopersicum var. cerasiforme were predicted to have 71,945 and 75,230 protein-coding genes, including 29,629 and 29,185 non-redundant genes, respectively, as supported by the transcriptome analysis results. The chromosome-level genome assemblies coupled with transcriptome data sets of the two accessions would be valuable for gaining insights into tomato domestication and understanding genome-scale breeding.

SUBMITTER: Takei H 

PROVIDER: S-EPMC7934570 | biostudies-literature | 2021 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

De novo genome assembly of two tomato ancestors, Solanum pimpinellifolium and Solanum  lycopersicum var. cerasiforme, by long-read sequencing.

Takei Hitomi H   Shirasawa Kenta K   Kuwabara Kosuke K   Toyoda Atsushi A   Matsuzawa Yuma Y   Iioka Shinji S   Ariizumi Tohru T  

DNA research : an international journal for rapid publication of reports on genes and genomes 20210101 1


The ancestral tomato species are known to possess genes that are valuable for improving traits in breeding. Here, we aimed to construct high-quality de novo genome assemblies of Solanum pimpinellifolium 'LA1670' and S. lycopersicum var. cerasiforme 'LA1673', originating from Peru. The Pacific Biosciences (PacBio) long-read sequences with 110× and 104× coverages were assembled and polished to generate 244 and 202 contigs spanning 808.8 Mbp for 'LA1670' and 804.5 Mbp for 'LA1673', respectively. Af  ...[more]

Similar Datasets

| S-EPMC3411241 | biostudies-literature
| S-EPMC6286132 | biostudies-literature
| S-EPMC11246243 | biostudies-literature
| S-EPMC2657798 | biostudies-other
| S-EPMC7603519 | biostudies-literature
| PRJNA303052 | ENA
| PRJNA699750 | ENA
| PRJNA268123 | ENA
2019-09-10 | GSE125476 | GEO
| S-EPMC10748600 | biostudies-literature