Project description:L. helveticus is used to modulate cheese flavor and as a starter organism in certain cheese varieties. Our group has compiled a draft (4x) sequence for the 2.4 Mb genome of an industrial strain L. helveticus CNRZ32. The primary aim was to investigate expression of 168 completely sequenced genes during growth in milk and MRS medium using microarrays. Oligonucleotide probes against each of the completely sequenced genes were compiled on maskless photolithography-based DNA microarrays. Additionally, the entire draft genome sequence was used to produce tiled microarrays where the non-interrupted sequence contigs were covered by consecutive 24-mer probes. Keywords: growth conditions response
Project description:Tremella fuciformis is a popular edible fungus with fruiting bodies that can be produced in large quantities at low costs, while it is easy to transform and cultivate as yeast. This makes it an attractive potential bioreactor. Enhanced heterologous gene expression through codon optimization would be useful, but until now codon usage preferences in T. fuciformis remain unknown. The only available genome of a species in the genus Tremella was that of Tremella mesenterica. To precisely determine the preferred codon usage of T. fuciformis we sequenced the genome of strain Tr26 resulting in a 24.5 Mb draft genome with 10,040 predicted genes. 3288 of the derived predicted proteins matched the UniProtKB/Swiss-Prot databases with 40% or more similarity. Corresponding gene models of this subset were subsequently optimized trough repetitive comparison of alternative start codons and selection of best length matching gene models. For experimental confirmation of gene models, 96 random clones from an existing T. fuciformis cDNA library were sequenced, generating 80 complete CDSs. Calculated optimal codons (RSCU and RFCU values) for the 3288 predicted and the 80 cloned CDSs were highly similar, indicating sufficient accuracy of predicted gene models for codon usage analysis. T. fuciformis showed a strong preference for C and then G (C+G; 66.4 %) at the third base pair position of used codons, while average GC content of predicted genes was slightly higher (58.5 %) than he total genome sequence average (55.9%). Most frequently used codons (optimal codons) all ended in C or G except for one (Ter; AGU), and an increased frequency of C ending codons was observed in genes with higher expression levels. Surprisingly, the preferred codon usage in T. fuciformis strongly differed from T. mesenterica (same genus) and C. neoformans (same family). Instead, optimal codon usage was similar to more distant related species such as Ustilago maydis and Neurospora crassa. Despite much higher overall sequence homology between T. fuciformis and T. mesenterica only 7 out of 21 optimal codons were equal, whereas T. fuciformis shared up to 20 out 21 optimal codons with other species. Clearly, codon usage in Tremella can differ largely and should be estimated for individual species. The precise identification of optimal and high expression related codons is therefore an important step in the development of T. fuciformis as a bioreactor system. Using the Total RNA was extracted from yeast-like cells of T. fuciformis Tr21 in their exponential growth stage, and double-stranded cDNA was synthesized for tag library construction and digital gene expression tag (DGE) sequencing at BGI Tech Solutions Co., Ltd (Shenzhen, China). Image analysis, base calling, extraction of tags, and tag counting were conducted using the Illumina pipeline. Clean tags were mapped to predicted gene models of the draft genome with a mismatch tolerance of 1bp. The number of tags for each CDS was calculated and then normalized to TPM (number of transcripts per million clean tags) digital gene expression (DGE)
Project description:Despite a significant increase in genomic data, our knowledge of gene functions and their transcriptional responses to environmental stimuli remains limited. Here, we use the model keystone species Daphnia pulex to study environmental responses of genes in the context of their gene family history to better understand the relationship between genome structure and gene function in response to environmental stimuli. Daphnia were exposed to five different treatments, each consisting of a diet supplemented with one of five cyanobacterial species, and a control treatment consisting of a diet of only green algae. Differential gene expression profiles of Daphnia exposed to each of these five cyanobacterial species showed that genes with known functions are more likely to be shared by different expression profiles whereas genes specific to the lineage of Daphnia are more likely to be unique to a given expression profile. Furthermore, while only a small number of non-lineage specific genes was conserved across treatment type, there was a high degree of overlap in expression profiles at the functional level. The conservation of functional responses across the different cyanobacterial treatments can be attributed to the treatment specific expression of different paralogous genes within the same gene family. Comparison with available gene expression data in the literature suggests differences in nutritional composition in diets with cyanobacterial species compared to diets of green algae as a primary driver for cyanobacterial effects on Daphnia. We conclude that conserved functional responses in Daphnia across different cyanobacterial treatments are mediated through alternate regulation of paralogous gene families.
Project description:The Zika outbreak, spread by the Aedes aegypti mosquito, highlights the need to create high-quality assemblies of large genomes in a rapid and cost-effective fashion. Here, we combine Hi-C data with existing draft assemblies to generate chromosome-length scaffolds. We validate this method by assembling a human genome, de novo, from short reads alone (67X coverage, Sample GSM1551550). We then combine our method with draft sequences to create genome assemblies of the mosquito disease vectors Aedes aegypti and Culex quinquefasciatus, each consisting of three scaffolds corresponding to the three chromosomes in each species. These assemblies indicate that virtually all genomic rearrangements among these species occur within, rather than between, chromosome arms. The genome assembly procedure we describe is fast, inexpensive, accurate, and can be applied to many species.
Project description:Despite a significant increase in genomic data, our knowledge of gene functions and their transcriptional responses to environmental stimuli remains limited. Here, we use the model keystone species Daphnia pulex to study environmental responses of genes in the context of their gene family history to better understand the relationship between genome structure and gene function in response to environmental stimuli. Daphnia were exposed to five different treatments, each consisting of a diet supplemented with one of five cyanobacterial species, and a control treatment consisting of a diet of only green algae. Differential gene expression profiles of Daphnia exposed to each of these five cyanobacterial species showed that genes with known functions are more likely to be shared by different expression profiles whereas genes specific to the lineage of Daphnia are more likely to be unique to a given expression profile. Furthermore, while only a small number of non-lineage specific genes was conserved across treatment type, there was a high degree of overlap in expression profiles at the functional level. The conservation of functional responses across the different cyanobacterial treatments can be attributed to the treatment specific expression of different paralogous genes within the same gene family. Comparison with available gene expression data in the literature suggests differences in nutritional composition in diets with cyanobacterial species compared to diets of green algae as a primary driver for cyanobacterial effects on Daphnia. We conclude that conserved functional responses in Daphnia across different cyanobacterial treatments are mediated through alternate regulation of paralogous gene families. Whole transcriptome dual color arrays were used to discover differentially expressed genes following sub-lethal exposure to five cyanobacteria in D. pulex. RNA was isolated from eight independent and concurrently replicated exposures of Daphnia to control and five cyanobacteria conditions. RNA was hybridized to microarrays using a standard, control vs. treated design that included dye swaps. Cyanobacteria were Anabaena (ANA), Aphanizomenon (Aph), Cylindrospermopsis (Cyl), Nodularia (Nod) and Oscillatoria (Osl).
Project description:The long-tailed macaque, also referred to as cynomolgus monkey (Macaca fascicularis), is one of the most important non-human primate animal models in basic and applied biomedical research. To improve the predictive power of primate experiments for humans, we determined the genome sequence of a Macaca fascicularis female of Mauritian origin using a whole-genome shotgun sequencing approach. We applied a template switch strategy which employs either the rhesus or the human genome to assemble sequence reads. The 6-fold sequence coverage of the draft genome sequence enabled discovery of about 2.1 million potential single-nucleotide polymorphisms based on occurrence of a dimorphic nucleotide at a given position in the genome sequence. Homology-based annotation allowed us to identify 17,387 orthologs of human protein-coding genes in the M. fascicularis draft genome and the predicted transcripts enabled the design of a M. fascicularis-specific gene expression microarray. Using liver samples from 36 individuals of different geographic origin, we identified 718 genes with highly variable expression in liver, whereas the majority of the transcriptome shows relatively stable and comparable expression. Knowledge of the M. fascicularis draft genome is an important contribution to both the use of this animal in disease models and the safety assessment of drugs and their metabolites. In particular, this information allows high-resolution genotyping and microarray-based gene expression profiling for animal stratification, thereby allowing the use of well-characterized animals for safety testing. Finally, the genome sequence presented here is a significant contribution to the global "3R" animal welfare initiative, which has the goal to reduce, refine and replace animal experiments.
Project description:Since CNVs play a vital role in genomic studies, it is an imperative need to develop a comprehensive, more accurate and higher resolution porcine CNV map with practical significance in follow-up CNV functional analyses To detect CNV of pigs, we performed high density aCGH data of diverse pig breeds in the framework of the pig draft genome sequence (Sscrofa10.2)
Project description:Since CNVs play a vital role in genomic studies, it is an imperative need to develop a comprehensive, more accurate and higher resolution porcine CNV map with practical significance in follow-up CNV functional analyses To detect CNV of pigs, we performed high density aCGH data of diverse pig breeds in the framework of the pig draft genome sequence (Sscrofa10.2)