Project description:The Zika outbreak, spread by the Aedes aegypti mosquito, highlights the need to create high-quality assemblies of large genomes in a rapid and cost-effective fashion. Here, we combine Hi-C data with existing draft assemblies to generate chromosome-length scaffolds. We validate this method by assembling a human genome, de novo, from short reads alone (67X coverage, Sample GSM1551550). We then combine our method with draft sequences to create genome assemblies of the mosquito disease vectors Aedes aegypti and Culex quinquefasciatus, each consisting of three scaffolds corresponding to the three chromosomes in each species. These assemblies indicate that virtually all genomic rearrangements among these species occur within, rather than between, chromosome arms. The genome assembly procedure we describe is fast, inexpensive, accurate, and can be applied to many species.
Project description:The Atlantic killifish (Fundulus heteroclitus) is an ideal model species to study physiological and toxicological adaptations to stressors. Killifish inhabiting the PCB-contaminated Superfund site in New Bedford Harbor, MA (NBH) have evolved resistance to toxicity and activation of the aryl hydrocarbon receptor (AHR) signaling pathway after exposure to PCBs and other AHR agonists. Until recently, a lack of genomic information has limited efforts to understand the molecular mechanisms underlying environmental adaptation to stressors. The advent of high throughput sequencing has facilitated an unbiased assessment of coding as well as non-coding RNAs in any species of interest. Among non-coding RNAs, microRNAs (miRNAs) are important regulators of gene expression and play crucial roles in development and physiology. The objective of this study is to catalog the miRNAs in killifish and determine their expression patterns in the embryos from contaminated (NBH) and pristine (Scorton Creek, MA (SC)) sites. Embryos from NBH and SC were collected daily from 1 to 15 days post-fertilization and RNA from pooled samples from each site was sequenced using SOLiD sequencing. We obtained 7.5 and 11 million raw reads from pooled SC and NBH samples, respectively. Analysis of the sequencing data identified 216 conserved mature miRNA sequences that are expressed during development. Using the draft killifish genome, we retrieved the miRNA precursor sequences. Based on the capacity of these putative precursor sequences to form the characteristic hairpin loop (assessed using RNAfold), we identified 197 conserved miRNA sequences in the genome.
2015-07-20 | GSE70953 | GEO
Project description:Draft Nuclear Genome Sequences for Three Ambrosial Ceratocystidaceae Species
Project description:Frogs are an ecologically diverse and phylogenetically ancient group of anuran amphibians that include important vertebrate cell and developmental model systems, notably the genus Xenopus. Here we report a high-quality reference genome sequence for the western clawed frog, Xenopus tropicalis, along with draft chromosome-scale sequences of three distantly related emerging model frog species, Eleutherodactylus coqui, Engystomops pustulosus and Hymenochirus boettgeri. Frog chromosomes have remained remarkably stable since the Mesozoic Era, with limited Robertsonian (i.e., centric) translocations and end-to-end fusions found among the smaller chromosomes. Conservation of synteny includes conservation of centromere locations, marked by centromeric tandem repeats associated with Cenp-a binding, surrounded by pericentromeric LINE/L1 elements. We explored chromosome structure across frogs, using a dense meiotic linkage map for X. tropicalis and chromatin conformation capture (Hi-C) data for all species. Abundant satellite repeats occupy the unusually long (~20 megabase) terminal regions of each chromosome that coincide with high rates of recombination. Both embryonic and differentiated cells show reproducible association of centromeric chromatin, and of telomeres, reflecting a Rabl-like configuration. Our comparative analyses reveal 13 conserved ancestral anuran chromosomes from which contemporary frog genomes were constructed.
Project description:We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long-reads and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from three different tissue types from three other species of squid species (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein coding genes supported by evidence and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome.
Project description:The draft genome of L. sativa (lettuce) cv. Tizian was sequenced in two Illumina sequencing runs, mate pair and shotgun. This entry contains the RAW sequencing data.