Project description:The skin commensal yeast Malassezia is associated with several skin disorders. To establish a reference resource, we sought to determine the complete genome sequence of Malassezia sympodialis and identify its protein-coding genes. A novel genome annotation workflow combining RNA sequencing, proteomics, and manual curation was developed to determine gene structures with high accuracy.
Project description:This data set is part of a study where the genome of Malassezia sympodialis (strain ATCC 42132) was sequenced using long-read technology and annotated using RNA-seq and proteogenomics. RNA was extracted at two different culture times (2 and 4 days). Seven RNA-seq libraries were prepared from independent samples. Two samples (P2 and P3) were enriched for protein-coding RNA using poly(A)-selection. The remaining five samples were processed with RiboMinus to deplete ribosomal RNA, and thus retain both mRNA and non-ribosomal noncoding RNA for sequencing. In total, we obtained 71 million RNA-seq read pairs mapping to genomic regions other than the highly expressed ribosomal loci.