Unknown

Dataset Information

0

Highly contiguous assemblies of 101 drosophilid genomes.


ABSTRACT: Over 100 years of studies in Drosophila melanogaster and related species in the genus Drosophila have facilitated key discoveries in genetics, genomics, and evolution. While high-quality genome assemblies exist for several species in this group, they only encompass a small fraction of the genus. Recent advances in long-read sequencing allow high-quality genome assemblies for tens or even hundreds of species to be efficiently generated. Here, we utilize Oxford Nanopore sequencing to build an open community resource of genome assemblies for 101 lines of 93 drosophilid species encompassing 14 species groups and 35 sub-groups. The genomes are highly contiguous and complete, with an average contig N50 of 10.5 Mb and greater than 97% BUSCO completeness in 97/101 assemblies. We show that Nanopore-based assemblies are highly accurate in coding regions, particularly with respect to coding insertions and deletions. These assemblies, along with a detailed laboratory protocol and assembly pipelines, are released as a public resource and will serve as a starting point for addressing broad questions of genetics, ecology, and evolution at the scale of hundreds of species.

SUBMITTER: Kim BY 

PROVIDER: S-EPMC8337076 | biostudies-literature | 2021 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Highly contiguous assemblies of 101 drosophilid genomes.

Kim Bernard Y BY   Wang Jeremy R JR   Miller Danny E DE   Barmina Olga O   Delaney Emily E   Thompson Ammon A   Comeault Aaron A AA   Peede David D   D'Agostino Emmanuel R R ERR   Pelaez Julianne J   Aguilar Jessica M JM   Haji Diler D   Matsunaga Teruyuki T   Armstrong Ellie E EE   Zych Molly M   Ogawa Yoshitaka Y   Stamenković-Radak Marina M   Jelić Mihailo M   Veselinović Marija Savić MS   Tanasković Marija M   Erić Pavle P   Gao Jian-Jun JJ   Katoh Takehiro K TK   Toda Masanori J MJ   Watabe Hideaki H   Watada Masayoshi M   Davis Jeremy S JS   Moyle Leonie C LC   Manoli Giulia G   Bertolini Enrico E   Košťál Vladimír V   Hawley R Scott RS   Takahashi Aya A   Jones Corbin D CD   Price Donald K DK   Whiteman Noah N   Kopp Artyom A   Matute Daniel R DR   Petrov Dmitri A DA  

eLife 20210719


Over 100 years of studies in <i>Drosophila melanogaster</i> and related species in the genus <i>Drosophila</i> have facilitated key discoveries in genetics, genomics, and evolution. While high-quality genome assemblies exist for several species in this group, they only encompass a small fraction of the genus. Recent advances in long-read sequencing allow high-quality genome assemblies for tens or even hundreds of species to be efficiently generated. Here, we utilize Oxford Nanopore sequencing to  ...[more]

Similar Datasets

| S-EPMC8933002 | biostudies-literature
| S-EPMC9346566 | biostudies-literature
| S-EPMC6169393 | biostudies-literature
| S-EPMC10763503 | biostudies-literature
| S-EPMC1950901 | biostudies-literature
| S-EPMC2770083 | biostudies-literature
| S-EPMC8442456 | biostudies-literature
| S-EPMC7484070 | biostudies-literature
| S-EPMC6893191 | biostudies-literature
| S-EPMC10563150 | biostudies-literature