Unknown

Dataset Information

0

The draft genome sequence of the grove snail Cepaea nemoralis.


ABSTRACT: Studies on the shell color and banding polymorphism of the grove snail Cepaea nemoralis and the sister taxon Cepaea hortensis have provided compelling evidence for the fundamental role of natural selection in promoting and maintaining intraspecific variation. More recently, Cepaea has been the focus of citizen science projects on shell color evolution in relation to climate change and urbanization. C. nemoralis is particularly useful for studies on the genetics of shell polymorphism and the evolution of "supergenes," as well as evo-devo studies of shell biomineralization, because it is relatively easily maintained in captivity. However, an absence of genomic resources for C. nemoralis has generally hindered detailed genetic and molecular investigations. We therefore generated ∼23× coverage long-read data for the ∼3.5 Gb genome, and produced a draft assembly composed of 28,537 contigs with the N50 length of 333 kb. Genome completeness, estimated by BUSCO using the metazoa dataset, was 91%. Repetitive regions cover over 77% of the genome. A total of 43,519 protein-coding genes were predicted in the assembled genome, and 97.3% of these were functionally annotated from either sequence homology or protein signature searches. This first assembled and annotated genome sequence for a helicoid snail, a large group that includes edible species, agricultural pests, and parasite hosts, will be a core resource for identifying the loci that determine the shell polymorphism, as well as in a wide range of analyses in evolutionary and developmental biology, and snail biology in general.

SUBMITTER: Saenko SV 

PROVIDER: S-EPMC8022989 | biostudies-literature |

REPOSITORIES: biostudies-literature

Similar Datasets

| S-EPMC4023409 | biostudies-literature
| S-EPMC6629550 | biostudies-literature
| S-EPMC1690693 | biostudies-literature
| PRJNA184044 | ENA
| PRJNA377398 | ENA
| PRJNA730503 | ENA
| PRJNA241957 | ENA
| PRJNA183949 | ENA
| S-EPMC4956449 | biostudies-literature
| S-EPMC5607911 | biostudies-literature