Unknown,Transcriptomics,Genomics,Proteomics

Dataset Information

0

Whole genome sequencing of Saccharomyces cerevisiae: from genotype to phenotype for improved metabolic engineering applications


ABSTRACT: The needs for rapid and efficient microbial cell factory design and construction are possible through the enabling technology, metabolic engineering, which is now being facilitated by systems biology approaches. Metabolic engineering is often complimented by directed evolution, where selective pressure is applied to a partially genetically engineered strain to confer a desirable phenotype. The exact genetic modification or resulting genotype that leads to the improved phenotype is often not identified or understood to enable further metabolic engineering. In this work we establish proof-of-concept that whole genome high-throughput sequencing and annotation can be used to identify single nucleotide polymorphisms (SNPs) between Saccharomyces cerevisiae strains S288c and CEN.PK113-7D. The yeast strain S288c was the first eukaryote sequenced, serving as the reference genome for the Saccharomyces Genome Database, while CEN.PK113-7D is a preferred laboratory strain for industrial biotechnology research. A total of 13,787 high-quality SNPs were detected between both strains (reference strain: S288c). Considering only metabolic genes (782 of 5,873 annotated genes), a total of 219 metabolism specific SNPs are distributed across 158 metabolic genes, with 85 of the SNPs being non-silent (e.g., encoding amino acid modifications). Amongst metabolic SNPs detected, there was pathway enrichment in the galactose uptake pathway (GAL1, GAL10) and ergosterol biosynthetic pathway (ERG8, ERG9). Physiological characterization confirmed a strong deficiency in galactose uptake and metabolism in S288c compared to CEN.PK113-7D, and similarly, ergosterol content in CEN.PK113-7D was significantly higher in both glucose and galactose supplemented cultivations compared to S288c. Furthermore, DNA microarray profiling of S288c and CEN.PK113-7D in both glucose and galactose batch cultures did not provide a clear hypothesis for major phenotypes observed, suggesting that genotype to phenotype correlations are manifested post-transcriptionally or post-translationally either through protein concentration and/or function. With an intensifying need for microbial cell factories that produce a wide array of target compounds, whole genome high-throughput sequencing and annotation for SNP detection can aid in better reducing and defining the metabolic landscape. This work demonstrates direct correlations between genotype and phenotype that provides clear and high-probability of success metabolic engineering targets. The genome sequence, annotation, and a SNP viewer of CEN.PK113-7D are deposited at www.sysbio.se/cenpk. Keywords: Two strains and two different carbon sources Two conditions (glucose and galactose) with two biological replicates for S. cerevisiae strains S288c and CEN.PK113-7D

ORGANISM(S): Saccharomyces cerevisiae

SUBMITTER: wanwipa vongsangnak 

PROVIDER: E-GEOD-21479 | biostudies-arrayexpress |

REPOSITORIES: biostudies-arrayexpress

Similar Datasets

2010-12-22 | GSE21479 | GEO
2003-06-23 | GSE461 | GEO
2007-07-07 | E-GEOD-461 | biostudies-arrayexpress
2012-04-16 | E-GEOD-30052 | biostudies-arrayexpress
2012-04-16 | E-GEOD-30051 | biostudies-arrayexpress
| PRJNA603441 | ENA
2022-02-15 | PXD014765 | Pride
| PRJNA191134 | ENA
| PRJNA52955 | ENA
2023-02-05 | PXD037944 | Pride