Unknown

Dataset Information

0

Sub genome anchored physical frameworks of the allotetraploid Upland cotton (Gossypium hirsutum L.) genome, and an approach toward reference-grade assemblies of polyploids.


ABSTRACT: Like those of many agricultural crops, the cultivated cotton is an allotetraploid and has a large genome (~2.5 gigabase pairs). The two sub genomes, A and D, are highly similar but unequally sized and repeat-rich, which pose significant challenges for accurate genome reconstruction using standard approaches. Here we report the development of BAC libraries, sub genome specific physical maps, and a new-generation sequencing approach that will lead to a reference-grade genome assembly for Upland cotton. Three BAC libraries were constructed, fingerprinted, and integrated with BAC-end sequences (BES) to produce a de novo whole-genome physical map. The BAC map was partitioned by sub genomes through alignment to the diploid progenitor D-genome reference sequence with densely spaced BES anchor points and computational filtering. The physical maps were validated with FISH and genetic mapping of SNP markers derived from BES. Two pairs of homeologous chromosomes, A11/D11 and A12/D12, were used to assess multiplex sequencing approaches for completeness and scalability. The results represent the first sub genome anchored physical maps of Upland cotton, and a new-generation approach to the whole-genome sequencing, which will lead to the reference-grade assembly of allopolyploid cotton and serve as a general strategy for sequencing other polyploid species.

SUBMITTER: Saski CA 

PROVIDER: S-EPMC5681701 | biostudies-literature | 2017 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Sub genome anchored physical frameworks of the allotetraploid Upland cotton (Gossypium hirsutum L.) genome, and an approach toward reference-grade assemblies of polyploids.

Saski Christopher A CA   Scheffler Brian E BE   Hulse-Kemp Amanda M AM   Liu Bo B   Song Qingxin Q   Ando Atsumi A   Stelly David M DM   Scheffler Jodi A JA   Grimwood Jane J   Jones Don C DC   Peterson Daniel G DG   Schmutz Jeremy J   Chen Z Jeffery ZJ  

Scientific reports 20171110 1


Like those of many agricultural crops, the cultivated cotton is an allotetraploid and has a large genome (~2.5 gigabase pairs). The two sub genomes, A and D, are highly similar but unequally sized and repeat-rich, which pose significant challenges for accurate genome reconstruction using standard approaches. Here we report the development of BAC libraries, sub genome specific physical maps, and a new-generation sequencing approach that will lead to a reference-grade genome assembly for Upland co  ...[more]

Similar Datasets

| S-EPMC5785356 | biostudies-other
| S-EPMC6736783 | biostudies-literature
| S-EPMC5459830 | biostudies-literature
| S-EPMC3306275 | biostudies-literature
| S-EPMC5579058 | biostudies-literature
| S-EPMC6013946 | biostudies-literature
| S-EPMC5841646 | biostudies-literature
| S-EPMC3438127 | biostudies-literature
| S-EPMC5698051 | biostudies-literature
| S-EPMC8342446 | biostudies-literature