Unknown

Dataset Information

0

Sixteen diverse laboratory mouse reference genomes define strain-specific haplotypes and novel functional loci.


ABSTRACT: We report full-length draft de novo genome assemblies for 16 widely used inbred mouse strains and find extensive strain-specific haplotype variation. We identify and characterize 2,567 regions on the current mouse reference genome exhibiting the greatest sequence diversity. These regions are enriched for genes involved in pathogen defence and immunity and exhibit enrichment of transposable elements and signatures of recent retrotransposition events. Combinations of alleles and genes unique to an individual strain are commonly observed at these loci, reflecting distinct strain phenotypes. We used these genomes to improve the mouse reference genome, resulting in the completion of 10 new gene structures. Also, 62 new coding loci were added to the reference genome annotation. These genomes identified a large, previously unannotated, gene (Efcab3-like) encoding 5,874 amino acids. Mutant Efcab3-like mice display anomalies in multiple brain regions, suggesting a possible role for this gene in the regulation of brain development.

SUBMITTER: Lilue J 

PROVIDER: S-EPMC6205630 | biostudies-literature | 2018 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Sixteen diverse laboratory mouse reference genomes define strain-specific haplotypes and novel functional loci.

Lilue Jingtao J   Doran Anthony G AG   Fiddes Ian T IT   Abrudan Monica M   Armstrong Joel J   Bennett Ruth R   Chow William W   Collins Joanna J   Collins Stephan S   Czechanski Anne A   Danecek Petr P   Diekhans Mark M   Dolle Dirk-Dominik DD   Dunn Matt M   Durbin Richard R   Earl Dent D   Ferguson-Smith Anne A   Flicek Paul P   Flint Jonathan J   Frankish Adam A   Fu Beiyuan B   Gerstein Mark M   Gilbert James J   Goodstadt Leo L   Harrow Jennifer J   Howe Kerstin K   Ibarra-Soria Ximena X   Kolmogorov Mikhail M   Lelliott Chris J CJ   Logan Darren W DW   Loveland Jane J   Mathews Clayton E CE   Mott Richard R   Muir Paul P   Nachtweide Stefanie S   Navarro Fabio C P FCP   Odom Duncan T DT   Park Naomi N   Pelan Sarah S   Pham Son K SK   Quail Mike M   Reinholdt Laura L   Romoth Lars L   Shirley Lesley L   Sisu Cristina C   Sjoberg-Herrera Marcela M   Stanke Mario M   Steward Charles C   Thomas Mark M   Threadgold Glen G   Thybert David D   Torrance James J   Wong Kim K   Wood Jonathan J   Yalcin Binnaz B   Yang Fengtang F   Adams David J DJ   Paten Benedict B   Keane Thomas M TM  

Nature genetics 20181001 11


We report full-length draft de novo genome assemblies for 16 widely used inbred mouse strains and find extensive strain-specific haplotype variation. We identify and characterize 2,567 regions on the current mouse reference genome exhibiting the greatest sequence diversity. These regions are enriched for genes involved in pathogen defence and immunity and exhibit enrichment of transposable elements and signatures of recent retrotransposition events. Combinations of alleles and genes unique to an  ...[more]

Similar Datasets

| S-EPMC4686825 | biostudies-literature
| S-EPMC3121819 | biostudies-other
2014-10-15 | E-ERAD-328 | biostudies-arrayexpress
| S-EPMC6553538 | biostudies-literature
| S-EPMC6726654 | biostudies-literature
| S-EPMC1449890 | biostudies-literature
| S-EPMC4175211 | biostudies-literature
| S-EPMC4534496 | biostudies-literature
| S-EPMC8743550 | biostudies-literature
| PRJEB49424 | ENA