Unknown

Dataset Information

0

De Novo Genome and Transcriptome Assembly of the Canadian Beaver (Castor canadensis).


ABSTRACT: The Canadian beaver (Castor canadensis) is the largest indigenous rodent in North America. We report a draft annotated assembly of the beaver genome, the first for a large rodent and the first mammalian genome assembled directly from uncorrected and moderate coverage (< 30 ×) long reads generated by single-molecule sequencing. The genome size is 2.7 Gb estimated by k-mer analysis. We assembled the beaver genome using the new Canu assembler optimized for noisy reads. The resulting assembly was refined using Pilon supported by short reads (80 ×) and checked for accuracy by congruency against an independent short read assembly. We scaffolded the assembly using the exon-gene models derived from 9805 full-length open reading frames (FL-ORFs) constructed from the beaver leukocyte and muscle transcriptomes. The final assembly comprised 22,515 contigs with an N50 of 278,680 bp and an N50-scaffold of 317,558 bp. Maximum contig and scaffold lengths were 3.3 and 4.2 Mb, respectively, with a combined scaffold length representing 92% of the estimated genome size. The completeness and accuracy of the scaffold assembly was demonstrated by the precise exon placement for 91.1% of the 9805 assembled FL-ORFs and 83.1% of the BUSCO (Benchmarking Universal Single-Copy Orthologs) gene set used to assess the quality of genome assemblies. Well-represented were genes involved in dentition and enamel deposition, defining characteristics of rodents with which the beaver is well-endowed. The study provides insights for genome assembly and an important genomics resource for Castoridae and rodent evolutionary biology.

SUBMITTER: Lok S 

PROVIDER: S-EPMC5295618 | biostudies-literature | 2017 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

<i>De Novo</i> Genome and Transcriptome Assembly of the Canadian Beaver (<i>Castor canadensis</i>).

Lok Si S   Paton Tara A TA   Wang Zhuozhi Z   Kaur Gaganjot G   Walker Susan S   Yuen Ryan K C RK   Sung Wilson W L WW   Whitney Joseph J   Buchanan Janet A JA   Trost Brett B   Singh Naina N   Apresto Beverly B   Chen Nan N   Coole Matthew M   Dawson Travis J TJ   Ho Karen K   Hu Zhizhou Z   Pullenayegum Sanjeev S   Samler Kozue K   Shipstone Arun A   Tsoi Fiona F   Wang Ting T   Pereira Sergio L SL   Rostami Pirooz P   Ryan Carol Ann CA   Tong Amy Hin Yan AH   Ng Karen K   Sundaravadanam Yogi Y   Simpson Jared T JT   Lim Burton K BK   Engstrom Mark D MD   Dutton Christopher J CJ   Kerr Kevin C R KC   Franke Maria M   Rapley William W   Wintle Richard F RF   Scherer Stephen W SW  

G3 (Bethesda, Md.) 20170209 2


The Canadian beaver (<i>Castor canadensis</i>) is the largest indigenous rodent in North America. We report a draft annotated assembly of the beaver genome, the first for a large rodent and the first mammalian genome assembled directly from uncorrected and moderate coverage (< 30 ×) long reads generated by single-molecule sequencing. The genome size is 2.7 Gb estimated by k-mer analysis. We assembled the beaver genome using the new Canu assembler optimized for noisy reads. The resulting assembly  ...[more]

Similar Datasets

| S-EPMC6456477 | biostudies-literature
| S-EPMC7014947 | biostudies-literature
2019-07-02 | E-MTAB-8038 | biostudies-arrayexpress
2017-12-24 | E-MTAB-6258 | biostudies-arrayexpress
| S-EPMC5831098 | biostudies-literature
| S-EPMC4881982 | biostudies-literature
| S-EPMC3746961 | biostudies-literature
| S-EPMC4914502 | biostudies-literature
| S-EPMC4440962 | biostudies-literature
| S-EPMC5742341 | biostudies-literature