Unknown

Dataset Information

0

A reference library for Canadian invertebrates with 1.5 million barcodes, voucher specimens, and DNA samples.


ABSTRACT: The reliable taxonomic identification of organisms through DNA sequence data requires a well parameterized library of curated reference sequences. However, it is estimated that just 15% of described animal species are represented in public sequence repositories. To begin to address this deficiency, we provide DNA barcodes for 1,500,003 animal specimens collected from 23 terrestrial and aquatic ecozones at sites across Canada, a nation that comprises 7% of the planet's land surface. In total, 14 phyla, 43 classes, 163 orders, 1123 families, 6186 genera, and 64,264 Barcode Index Numbers (BINs; a proxy for species) are represented. Species-level taxonomy was available for 38% of the specimens, but higher proportions were assigned to a genus (69.5%) and a family (99.9%). Voucher specimens and DNA extracts are archived at the Centre for Biodiversity Genomics where they are available for further research. The corresponding sequence and taxonomic data can be accessed through the Barcode of Life Data System, GenBank, the Global Biodiversity Information Facility, and the Global Genome Biodiversity Network Data Portal.

SUBMITTER: deWaard JR 

PROVIDER: S-EPMC6897906 | biostudies-literature | 2019 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

A reference library for Canadian invertebrates with 1.5 million barcodes, voucher specimens, and DNA samples.

deWaard Jeremy R JR   Ratnasingham Sujeevan S   Zakharov Evgeny V EV   Borisenko Alex V AV   Steinke Dirk D   Telfer Angela C AC   Perez Kate H J KHJ   Sones Jayme E JE   Young Monica R MR   Levesque-Beaudin Valerie V   Sobel Crystal N CN   Abrahamyan Arusyak A   Bessonov Kyrylo K   Blagoev Gergin G   deWaard Stephanie L SL   Ho Chris C   Ivanova Natalia V NV   Layton Kara K S KKS   Lu Liuqiong L   Manjunath Ramya R   McKeown Jaclyn T A JTA   Milton Megan A MA   Miskie Renee R   Monkhouse Norm N   Naik Suresh S   Nikolova Nadya N   Pentinsaari Mikko M   Prosser Sean W J SWJ   Radulovici Adriana E AE   Steinke Claudia C   Warne Connor P CP   Hebert Paul D N PDN  

Scientific data 20191206 1


The reliable taxonomic identification of organisms through DNA sequence data requires a well parameterized library of curated reference sequences. However, it is estimated that just 15% of described animal species are represented in public sequence repositories. To begin to address this deficiency, we provide DNA barcodes for 1,500,003 animal specimens collected from 23 terrestrial and aquatic ecozones at sites across Canada, a nation that comprises 7% of the planet's land surface. In total, 14  ...[more]

Similar Datasets

| S-EPMC10290705 | biostudies-literature
| S-EPMC6711938 | biostudies-literature
| S-EPMC8758633 | biostudies-literature
| S-EPMC4414572 | biostudies-literature
| S-EPMC9864147 | biostudies-literature
| S-EPMC4355675 | biostudies-literature
| S-EPMC3278308 | biostudies-literature
| S-EPMC5117606 | biostudies-literature
| S-EPMC4971185 | biostudies-literature
| S-EPMC7380090 | biostudies-literature