Project description:The draft genome of L. sativa (lettuce) cv. Tizian was sequenced in two Illumina sequencing runs, mate pair and shotgun. This entry contains the RAW sequencing data.
Project description:Indian sandalwood (Santalum album) is an economically important plant known for its aromatic wood. This highly valued plant has also been reported as an endangered species. Despite its economic value, the genome sequence of this plant is not yet available. In the current study,we report the draft genome sequence of sandalwood generated using Illumina HiSeq1000 sequencing platform. Genome annotation was carried out using InterProScan tool and Uniprot database,which was further facilitated using in-house RNA-Seq data. Further, we carried out in-depth proteome analysis of samples derived from four tissues viz., shoot meristem, leaf, stem and fruit using high-resolution tandem mass spectrometry. Proteogenomics analysis was performed to identify novel gene models, revise the predicted gene structures and provide experimental evidence for the predicted genes. Our analysis resulted in the identification of 72,325 peptides mapping to 10,076 genes predicted in the sandalwood genome thereby validating the expression of these gene models. Additionally, this study also provides evidence for 53 novel protein coding genes and revision of 121existing gene models.
Project description:We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long-reads and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from three different tissue types from three other species of squid species (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein coding genes supported by evidence and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome.
Project description:Background Trombidid mites have a unique lifecycle in which only the larval stage is ectoparasitic. In the superfamily Trombiculoidea (“chiggers”), the larvae feed preferentially on vertebrates, including humans. Species in the genus Leptotrombidium are vectors of a potentially fatal bacterial infection, scrub typhus, which affects 1 million people annually. Moreover, chiggers can cause pruritic dermatitis (trombiculiasis) in humans and domesticated animals. In the Trombidioidea (velvet mites), the larvae feed on other arthropods and are potential biological control agents for agricultural pests. Here, we present the first trombidid mites genomes, obtained both for a chigger, Leptotrombidium deliense, and for a velvet mite, Dinothrombium tinctorium. Results Sequencing was performed on the Illumina MiSeq platform. A 180 Mb draft assembly for D. tinctorium was generated from two paired-end and one mate-pair library using a single adult specimen. For L. deliense, a lower-coverage draft assembly (117 Mb) was obtained using pooled, engorged larvae with a single paired-end library. Remarkably, both genomes exhibited evidence of ancient lateral gene transfer from soil-derived bacteria or fungi. The transferred genes confer functions that are rare in animals, including terpene and carotenoid synthesis. Thirty-seven allergenic protein families were predicted in the L. deliense genome, of which nine were unique. Preliminary proteomic analyses identified several of these putative allergens in larvae. Conclusions Trombidid mite genomes appear to be more dynamic than those of other acariform mites. A priority for future research is to determine the biological function of terpene synthesis in this taxon and its potential for exploitation in disease control. Project was jointly supervised by Stuart Armstrong and Ben Makepeace.
Project description:“Bathymodiolus” childressi is a species of deep-sea mussels found predominantly in the Gulf of Mexico harbouring a methane oxidizing symbiont (MOX). Ca. Endonucleobacter is a gammaproteobacterial intranuclear parasite that infects the ciliated edge cells of the deep-sea mussel “B.” childressi. After a single Ca. Endonucleobacter cell invades the host nucleus, it proliferates massively, increasing the volume of the nucleus up to 50 fold. It is hypothesized that intranuclear parasites use host chromatin as a nutritional source, however, this would lead to a destabilization of the replication niche. The aim of this study was to investigate how Ca. Endonucleobacter thrives in the nucleus and how it affects the host cell using various “-omics' ' approaches. In addition to protein identification we generated a high quality genome draft for Ca. Endonucleobacter using a hybrid assembly pipeline (PacBio + Illumina reads). We annotated this high quality genome draft and used it as a reference to identify the proteins responsible for the molecular interaction between Ca. Endonucleobacter and its host cell. The 12 specimens of “B.” childressi analyzed in this study were collected during five dives from the Mississippi Canyon site (MC853, 28º07’ N; -089º08’ W) and the Green Canyon site (GC234, 27º45’ N; -091º13’ W) at a water depth of 1,070 and 540 m, respectively. The mussels were selected according to their degree of Ca. Endonucleobacter infection. According to this criterion, the “B.” childressi specimens were divided into two conditions: 8 were classified as “infected”, while 4 were classified as “non-infected” (negative control). For each specimen, we prepare microdissected ciliated edge samples in triplicates.
Project description:A new haloalkaliphilic species of Wenzhouxiangella, strain AB-CW3 was isolated from a system of alkaline soda lakes in the Kulunda Steppe. Its complete, circular genome was assembled from combined nanopore and illumina sequencing and its proteome was determined for three different experimental conditions: growth on Staphylococcus cells, casein, or peptone. AB-CW3 is an aerobic bacterium feeding mainly on proteins and peptides.
Project description:Omics approaches are broadly used to explore endocrine and toxicity-related pathways and functions. Nevertheless, there is still a significant gap in knowledge in terms of understanding the endocrine system and its numerous connections and intricate feedback loops, especially in non-model organisms. The fathead minnow (Pimephales promelas) is a widely used small fish model for aquatic toxicology and regulatory testing, particularly in North America. A draft genome has been published but the amount of available genomic or transcriptomic information is still far behind that of other more broadly studied species, such as the zebrafish. Here, we surveyed the tissue-specific proteome and transcriptome profiles in adult male fathead minnow. To do so, we generated a draft transcriptome using short and long sequencing reads. We also performed RNA sequencing and proteomics analysis on the telencephalon, hypothalamus, liver, and gut of male fish. The main purpose of this analysis was to generate tissue-specific omics data in order to support future aquatic ecotoxicogenomic and endocrine-related studies as well as to improve our understanding of the fathead minnow as an ecological model.