Unknown

Dataset Information

0

A high-quality chromosome-level genome assembly of the Chinese medaka Oryzias sinensis.


ABSTRACT: Oryzias sinensis, also known as Chinese medaka or Chinese ricefish, is a commonly used animal model for aquatic environmental assessment in the wild as well as gene function validation or toxicology research in the lab. Here, a high-quality chromosome-level genome assembly of O. sinensis was generated using single-tube long fragment read (stLFR) reads, Nanopore long-reads, and Hi-C sequencing data. The genome is 796.58 Mb, and a total of 712.17 Mb of the assembled sequences were anchored to 23 pseudo-chromosomes. A final set of 22,461 genes were annotated, with 98.67% being functionally annotated. The Benchmarking Universal Single-Copy Orthologs (BUSCO) benchmark of genome assembly and gene annotation reached 95.1% (93.3% single-copy) and 94.6% (91.7% single-copy), respectively. Furthermore, we also use ATAC-seq to uncover chromosome transposase-accessibility as well as related genome area function enrichment for Oryzias sinensis. This study offers a new improved foundation for future genomics research in Chinese medaka.

SUBMITTER: Dong Z 

PROVIDER: S-EPMC10978949 | biostudies-literature | 2024 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

A high-quality chromosome-level genome assembly of the Chinese medaka Oryzias sinensis.

Dong Zhongdian Z   Wang Jiangman J   Chen Guozhu G   Guo Yusong Y   Zhao Na N   Wang Zhongduo Z   Zhang Bo B  

Scientific data 20240328 1


Oryzias sinensis, also known as Chinese medaka or Chinese ricefish, is a commonly used animal model for aquatic environmental assessment in the wild as well as gene function validation or toxicology research in the lab. Here, a high-quality chromosome-level genome assembly of O. sinensis was generated using single-tube long fragment read (stLFR) reads, Nanopore long-reads, and Hi-C sequencing data. The genome is 796.58 Mb, and a total of 712.17 Mb of the assembled sequences were anchored to 23 p  ...[more]

Similar Datasets

| S-EPMC6877568 | biostudies-literature
2024-11-10 | GSE281536 | GEO
| S-EPMC7486949 | biostudies-literature
| S-EPMC10961820 | biostudies-literature
| S-EPMC8645875 | biostudies-literature
| S-EPMC11246460 | biostudies-literature
| S-EPMC11785749 | biostudies-literature
| S-EPMC9635629 | biostudies-literature
| S-EPMC10341846 | biostudies-literature
| S-EPMC7493909 | biostudies-literature