Unknown

Dataset Information

0

The first near-complete assembly of the hexaploid bread wheat genome, Triticum aestivum.


ABSTRACT: Common bread wheat, Triticum aestivum, has one of the most complex genomes known to science, with 6 copies of each chromosome, enormous numbers of near-identical sequences scattered throughout, and an overall haploid size of more than 15 billion bases. Multiple past attempts to assemble the genome have produced assemblies that were well short of the estimated genome size. Here we report the first near-complete assembly of T. aestivum, using deep sequencing coverage from a combination of short Illumina reads and very long Pacific Biosciences reads. The final assembly contains 15 344 693 583 bases and has a weighted average (N50) contig size of 232 659 bases. This represents by far the most complete and contiguous assembly of the wheat genome to date, providing a strong foundation for future genetic studies of this important food crop. We also report how we used the recently published genome of Aegilops tauschii, the diploid ancestor of the wheat D genome, to identify 4 179 762 575 bp of T. aestivum that correspond to its D genome components.

SUBMITTER: Zimin AV 

PROVIDER: S-EPMC5691383 | biostudies-literature | 2017 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

The first near-complete assembly of the hexaploid bread wheat genome, Triticum aestivum.

Zimin Aleksey V AV   Puiu Daniela D   Hall Richard R   Kingan Sarah S   Clavijo Bernardo J BJ   Salzberg Steven L SL  

GigaScience 20171101 11


Common bread wheat, Triticum aestivum, has one of the most complex genomes known to science, with 6 copies of each chromosome, enormous numbers of near-identical sequences scattered throughout, and an overall haploid size of more than 15 billion bases. Multiple past attempts to assemble the genome have produced assemblies that were well short of the estimated genome size. Here we report the first near-complete assembly of T. aestivum, using deep sequencing coverage from a combination of short Il  ...[more]

Similar Datasets

| S-EPMC8360199 | biostudies-literature
| S-EPMC3981729 | biostudies-literature
| S-EPMC7922369 | biostudies-literature
| S-EPMC4588698 | biostudies-literature
| S-EPMC6230953 | biostudies-literature
| S-EPMC5316916 | biostudies-literature
| S-EPMC5072575 | biostudies-literature
| S-EPMC3119610 | biostudies-literature
| S-EPMC4995049 | biostudies-literature
2021-12-21 | PXD022231 | Pride