Unknown

Dataset Information

0

The genome of the American groundhog, Marmota monax.


ABSTRACT: We sequenced the genome of the North American groundhog, Marmota monax, also known as the woodchuck. Our sequencing strategy included a combination of short, high-quality Illumina reads plus long reads generated by both Pacific Biosciences and Oxford Nanopore instruments. Assembly of the combined data produced a genome of 2.74 Gbp in total length, with an N50 contig size of 1,094,236 bp. To annotate the genome, we mapped the genes from another M. monax genome and from the closely related Alpine marmot, Marmota marmota, onto our assembly, resulting in 20,559 annotated protein-coding genes and 28,135 transcripts. The genome assembly and annotation are available in GenBank under BioProject PRJNA587092.

SUBMITTER: Puiu D 

PROVIDER: S-EPMC7682491 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

altmetric image

Publications

The genome of the American groundhog, <i>Marmota monax</i>.

Puiu Daniela D   Zimin Aleksey A   Shumate Alaina A   Ge Yuchen Y   Qiu Jiabin J   Bhaskaran Manoj M   Salzberg Steven L SL  

F1000Research 20200916


We sequenced the genome of the North American groundhog, <i>Marmota monax</i>, also known as the woodchuck. Our sequencing strategy included a combination of short, high-quality Illumina reads plus long reads generated by both Pacific Biosciences and Oxford Nanopore instruments. Assembly of the combined data produced a genome of 2.74 Gbp in total length, with an N50 contig size of 1,094,236 bp. To annotate the genome, we mapped the genes from another <i>M. monax</i> genome and from the closely r  ...[more]

Similar Datasets

| S-EPMC5091844 | biostudies-literature
| S-EPMC4610724 | biostudies-literature
| PRJNA155585 | ENA
| PRJNA155581 | ENA
| PRJNA281246 | ENA
| PRJNA162681 | ENA
| PRJNA984759 | ENA
| PRJNA291589 | ENA
| PRJNA85555 | ENA
| PRJNA290841 | ENA