Please use this identifier to cite or link to this item:http://hdl.handle.net/20.500.12105/12678
Population genomics of the pathogenic yeast Candida tropicalis identifies hybrid isolates in environmental samples.
PLoS Pathog . 2021 Mar 31;17(3):e1009138.
Candida tropicalis is a human pathogen that primarily infects the immunocompromised. Whereas the genome of one isolate, C. tropicalis MYA-3404, was originally sequenced in 2009, there have been no large-scale, multi-isolate studies of the genetic and phenotypic diversity of this species. Here, we used whole genome sequencing and phenotyping to characterize 77 isolates of C. tropicalis from clinical and environmental sources from a variety of locations. We show that most C. tropicalis isolates are diploids with approximately 2-6 heterozygous variants per kilobase. The genomes are relatively stable, with few aneuploidies. However, we identified one highly homozygous isolate and six isolates of C. tropicalis with much higher heterozygosity levels ranging from 36-49 heterozygous variants per kilobase. Our analyses show that the heterozygous isolates represent two different hybrid lineages, where the hybrids share one parent (A) with most other C. tropicalis isolates, but the second parent (B or C) differs by at least 4% at the genome level. Four of the sequenced isolates descend from an AB hybridization, and two from an AC hybridization. The hybrids are MTLa/α heterozygotes. Hybridization, or mating, between different parents is therefore common in the evolutionary history of C. tropicalis. The new hybrids were predominantly found in environmental niches, including from soil. Hybridization is therefore unlikely to be associated with virulence. In addition, we used genotype-phenotype correlation and CRISPR-Cas9 editing to identify a genome variant that results in the inability of one isolate to utilize certain branched-chain amino acids as a sole nitrogen source.
All sequencing data isavailableat NCBI under Bio Project accession PRJNA604451and C. tropicalis genome assembly B and annotationis available under NCBI accession JAFIQD000000000. rDNA sequences are available at accession numbers MW584905-MW584910.Other data sets (i.e. variant calls and images for phenotype analysis) are available at https://doi.org/10.6084/m9.figshare.13128839.v1.
Files in this item