Unknown

Dataset Information

0

A Chromosome-Level Genome Assembly of the Spotted Scat (Scatophagus argus).


ABSTRACT: The spotted scat, Scatophagus argus is a member of the family Scatophagidae found in Indo-Pacific coastal waters. It is an emerging commercial aquaculture species, particularly in East and Southeast Asia. In this study, the first chromosome-level genome of S. argus was constructed using PacBio and Hi-C sequencing technologies. The genome is 572.42 Mb, with a scaffold N50 of 24.67 Mb. Using Hi-C data, 563.28 Mb (98.67% of the genome) sequences were anchored and oriented in 24 chromosomes, ranging from 12.57 Mb to 30.38 Mb. The assembly is of high integrity, containing 94.26% conserved single-copy orthologues, based on BUSCO analysis. A total of 24,256 protein-coding genes were predicted in the genome, and 96.30% of the predicted genes were functionally annotated. Evolutionary analysis showed that S. argus diverged from the common ancestor of Japanese puffer (Takifugu rubripes) approximately 114.8 Ma. The chromosomes of S. argus showed significant correlation to T. rubripes chromosomes. A comparative genomic analysis identified 49 unique and 90 expanded gene families. These genomic resources provide a solid foundation for functional genomics studies to decipher the economic traits of this species.

SUBMITTER: Huang Y 

PROVIDER: S-EPMC8214404 | biostudies-literature | 2021 Jun

REPOSITORIES: biostudies-literature

altmetric image

Publications

A Chromosome-Level Genome Assembly of the Spotted Scat (Scatophagus argus).

Huang Yuanqing Y   Mustapha Umar Farouk UF   Huang Yang Y   Tian Changxu C   Yang Wei W   Chen Huapu H   Deng Siping S   Zhu Chunhua C   Jiang Dongneng D   Li Guangli G  

Genome biology and evolution 20210601 6


The spotted scat, Scatophagus argus is a member of the family Scatophagidae found in Indo-Pacific coastal waters. It is an emerging commercial aquaculture species, particularly in East and Southeast Asia. In this study, the first chromosome-level genome of S. argus was constructed using PacBio and Hi-C sequencing technologies. The genome is 572.42 Mb, with a scaffold N50 of 24.67 Mb. Using Hi-C data, 563.28 Mb (98.67% of the genome) sequences were anchored and oriented in 24 chromosomes, ranging  ...[more]

Similar Datasets

| S-EPMC6940847 | biostudies-literature
| S-EPMC7073721 | biostudies-literature
| S-EPMC9403048 | biostudies-literature
| S-EPMC8001731 | biostudies-literature
| S-EPMC5114590 | biostudies-literature
| S-EPMC10607709 | biostudies-literature
| S-EPMC8909180 | biostudies-literature
| S-EPMC11773731 | biostudies-literature
| S-EPMC9732426 | biostudies-literature
| S-EPMC6145014 | biostudies-literature