The Candida genome database incorporates multiple Candida species: multispecies search and analysis tools with curated gene and protein information for Candida albicans and Candida glabrata

D. O. Inglis, M. B. Arnaud, J. Binkley, P. Shah, M. S. Skrzypek, F. Wymore, G. Binkley, S. R. Miyasato, M. Simison, G. Sherlock
2011 Nucleic Acids Research  
The Candida Genome Database (CGD, http://www .candidagenome.org/) is an internet-based resource that provides centralized access to genomic sequence data and manually curated functional information about genes and proteins of the fungal pathogen Candida albicans and other Candida species. As the scope of Candida research, and the number of sequenced strains and related species, has grown in recent years, the need for expanded genomic resources has also grown. To answer this need, CGD has
more » ... d beyond storing data solely for C. albicans, now integrating data from multiple species. Herein we describe the incorporation of this multispecies information, which includes curated gene information and the reference sequence for C. glabrata, as well as orthology relationships that interconnect Locus Summary pages, allowing easy navigation between genes of C. albicans and C. glabrata. These orthology relationships are also used to predict GO annotations of their products. We have also added protein information pages that display domains, structural information and physicochemical properties; bibliographic pages highlighting important topic areas in Candida biology; and a laboratory strain lineage page that describes the lineage of commonly used laboratory strains. All of these data are freely available at http://www .candidagenome.org/. We welcome feedback from the research community at candida-curator@lists.stanford.edu.
doi:10.1093/nar/gkr945 pmid:22064862 pmcid:PMC3245171 fatcat:qfogwktijnbufkeevqgqiey5ka