PANTHER in 2013: modeling the evolution of gene function, and other gene attributes, in the context of phylogenetic trees

Huaiyu Mi, Anushya Muruganujan, Paul D. Thomas
2012 Nucleic Acids Research  
The data and tools in PANTHER-a comprehensive, curated database of protein families, trees, subfamilies and functions available at http:// pantherdb.org-have undergone continual, extensive improvement for over a decade. Here, we describe the current PANTHER process as a whole, as well as the website tools for analysis of user-uploaded data. The main goals of PANTHER remain essentially unchanged: the accurate inference (and practical application) of gene and protein function over large sequence
more » ... atabases, using phylogenetic trees to extrapolate from the relatively sparse experimental information from a few model organisms. Yet the focus of PANTHER has continually shifted toward more accurate and detailed representations of evolutionary events in gene family histories. The trees are now designed to represent gene family evolution, including inference of evolutionary events, such as speciation and gene duplication. Subfamilies are still curated and used to define HMMs, but gene ontology functional annotations can now be made at any node in the tree, and are designed to represent gain and loss of function by ancestral genes during evolution. Finally, PANTHER now includes stable database identifiers for inferred ancestral genes, which are used to associate inferred gene attributes with particular genes in the common ancestral genomes of extant species.
doi:10.1093/nar/gks1118 pmid:23193289 pmcid:PMC3531194 fatcat:penzalro4zfajejys6bs2cas5u