Project description:Cotton is an agriculturally important crop. Because of its importance, a genome sequence of a diploid cotton species (Gossypium raimondii, D-genome) was first assembled using Sanger sequencing data in 2012. Improvements to DNA sequencing technology have improved accuracy and correctness of assembled genome sequences. Here we report a new de novo genome assembly of G. raimondii and its close relative G. turneri The two genomes were assembled to a chromosome level using PacBio long-read technology, HiC, and Bionano optical mapping. This report corrects some minor assembly errors found in the Sanger assembly of G. raimondii We also compare the genome sequences of these two species for gene composition, repetitive element composition, and collinearity. Most of the identified structural rearrangements between these two species are due to intra-chromosomal inversions. More inversions were found in the G. turneri genome sequence than the G. raimondii genome sequence. These findings and updates to the D-genome sequence will improve accuracy and translation of genomics to cotton breeding and genetics.
Project description:Purpose: The goal of this experiment was to use RNA-seq to compare the two commercial cotton species Gossypium hirsutum and Gossypium barbadense and determine what transcripts may account for the better fiber quality in the latter. Methods: RNA was extracted from Gossypium barbadense or Gossypium hirsutum fibers at 10, 15, 18, 21, and 28 days post anthesis. Paired-end, 100-bp RNA-seq was performed on an Illumina HiSeq2000 and the reads were mapped to the Gossypium raimondii genome at www.phytozome.net and non-homologous contig assemblies from Gossypium arboreum. Results from RNA-seq were combined with non-targeted metabolomics. Results: Approximately 38,000 transcripts were expressed (RPKM>2) in each fiber type and approximately 2,000 of these transcripts were differentially expressed in a cross-species comparison at each timepoint. Enriched Gene Ontology biological processes in differentially expressed transcripts suggested that Gh fibers were more stressed. Conclusions: Both metabolomic and transcriptomic data suggest that better mechanisms for managing reactive oxygen species contribute to the increased fiber length in Gossypium barbadense. This appears to result from enhanced ascorbate biosynthesis via gulono-1,4-lactone oxidase and ascorbate recycling via dehydroascorbate reductase.
Project description:Purpose: The goal of this experiment was to use RNA-seq to compare the two commercial cotton species Gossypium hirsutum and Gossypium barbadense and determine what transcripts may account for the better fiber quality in the latter. Methods: RNA was extracted from Gossypium barbadense or Gossypium hirsutum fibers at 10, 15, 18, 21, and 28 days post anthesis. Paired-end, 100-bp RNA-seq was performed on an Illumina HiSeq2000 and the reads were mapped to the Gossypium raimondii genome at www.phytozome.net and non-homologous contig assemblies from Gossypium arboreum. Results from RNA-seq were combined with non-targeted metabolomics. Results: Approximately 38,000 transcripts were expressed (RPKM>2) in each fiber type and approximately 2,000 of these transcripts were differentially expressed in a cross-species comparison at each timepoint. Enriched Gene Ontology biological processes in differentially expressed transcripts suggested that Gh fibers were more stressed. Conclusions: Both metabolomic and transcriptomic data suggest that better mechanisms for managing reactive oxygen species contribute to the increased fiber length in Gossypium barbadense. This appears to result from enhanced ascorbate biosynthesis via gulono-1,4-lactone oxidase and ascorbate recycling via dehydroascorbate reductase. See Bioproject PRJNA263926 and SRA accession SRP049330 for study design and raw sequencing data and Bioproject PRJNA269608 and TSA accession GBYK00000000 for Gossypium arboreum assembled contig sequences used for transcriptome mapping - Cotton fiber mRNA from 10,15,18,21 and 28 day post anthesis fiber from either Gossypium hirusutm or Gossypium barbadense was sequenced and differential gene expression analysis was conducted between species for each timepoint and between adjacent timepoints. Each timepoint was representative of fiber from 9 individual plants processed as 3 biological replicate pools (material from 3 individual plants per pool).