Unknown

Dataset Information

0

Phased Diploid Genome Assemblies for Three Strains of Candida albicans from Oak Trees.


ABSTRACT: Although normally a harmless commensal, Candida albicans, it is also one of the most common causes of bloodstream infections in the U.S. Candida albicans has long been considered an obligate commensal, however, recent studies suggest it can live outside animal hosts. Here, we have generated PacBio sequences and phased genome assemblies for three C. albicans strains from oak trees (NCYC 4144, NCYC 4145, and NCYC 4146). PacBio datasets are high depth (over 400 fold coverage) and more than half of the sequencing data are contained in reads longer than 15 kb. Primary assemblies showed high contiguity with several chromosomes for each strain recovered as single contigs, and greater than half of the alternative haplotype sequence was assembled in haplotigs at least 174 kb long. Using these assemblies we were able to identify structural polymorphisms, including a polymorphic inversion over 100 kb in length. These results show that phased de novo diploid assemblies for C. albicans can enable the study of genomic variation within and among strains of an important fungal pathogen.

SUBMITTER: Hamlin JAP 

PROVIDER: S-EPMC6829152 | biostudies-literature | 2019 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Phased Diploid Genome Assemblies for Three Strains of <i>Candida albicans</i> from Oak Trees.

Hamlin Jennafer A P JAP   Dias Guilherme B GB   Bergman Casey M CM   Bensasson Douda D  

G3 (Bethesda, Md.) 20191105 11


Although normally a harmless commensal, <i>Candida albicans</i>, it is also one of the most common causes of bloodstream infections in the U.S. <i>Candida albicans</i> has long been considered an obligate commensal, however, recent studies suggest it can live outside animal hosts. Here, we have generated PacBio sequences and phased genome assemblies for three <i>C. albicans</i> strains from oak trees (NCYC 4144, NCYC 4145, and NCYC 4146). PacBio datasets are high depth (over 400 fold coverage) a  ...[more]

Similar Datasets

| S-EPMC409918 | biostudies-literature
| S-EPMC4054093 | biostudies-literature
| S-EPMC7728601 | biostudies-literature
| S-EPMC549318 | biostudies-literature
| S-EPMC3583542 | biostudies-other
2020-08-01 | GSE127072 | GEO
| S-EPMC5503144 | biostudies-literature
| S-EPMC9339290 | biostudies-literature
| S-EPMC7225527 | biostudies-literature
| S-EPMC7645668 | biostudies-literature