ProtPhylo: identification of protein–phenotype and protein–protein functional associations via phylogenetic profiling

Yiming Cheng, Fabiana Perocchi
2015 Nucleic Acids Research  
ProtPhylo is a web-based tool to identify proteins that are functionally linked to either a phenotype or a protein of interest based on co-evolution. Prot-Phylo infers functional associations by comparing protein phylogenetic profiles (co-occurrence patterns of orthology relationships) for more than 9.7 million non-redundant protein sequences from all three domains of life. Users can query any of 2048 fully sequenced organisms, including 1678 bacteria, 255 eukaryotes and 115 archaea. In
more » ... , they can tailor ProtPhylo to a particular kind of biological question by choosing among four main orthology inference methods based either on pairwise sequence comparisons (One-way Best Hits and Best Reciprocal Hits) or clustering of orthologous proteins across multiple species (OrthoMCL and eggNOG). Next, ProtPhylo ranks phylogenetic neighbors of query proteins or phenotypic properties using the Hamming distance as a measure of similarity between pairs of phylogenetic profiles. Candidate hits can be easily and flexibly prioritized by complementary clues on subcellular localization, known protein-protein interactions, membrane spanning regions and protein domains. The resulting protein list can be quickly exported into a csv text file for further analyses. ProtPhylo is freely available at http: //www.protphylo.org.
doi:10.1093/nar/gkv455 pmid:25956654 pmcid:PMC4489284 fatcat:ifwzhjf5zrhshjiu34ar6x3txi