A human functional protein interaction network and its application to cancer data analysis

Guanming Wu, Xin Feng, Lincoln Stein
2010 Genome Biology  
Functional protein interaction network A high-quality human functional protein inter-action network is constructed. Its utility is dem-onstrated in the identification of cancer candidate genes. Abstract Background: One challenge facing biologists is to tease out useful information from massive data sets for further analysis. A pathway-based analysis may shed light by projecting candidate genes onto protein functional relationship networks. We are building such a pathway-based analysis system.
more » ... sults: We have constructed a protein functional interaction network by extending curated pathways with noncurated sources of information, including protein-protein interactions, gene coexpression, protein domain interaction, Gene Ontology (GO) annotations and text-mined protein interactions, which cover close to 50% of the human proteome. By applying this network to two glioblastoma multiforme (GBM) data sets and projecting cancer candidate genes onto the network, we found that the majority of GBM candidate genes form a cluster and are closer than expected by chance, and the majority of GBM samples have sequence-altered genes in two network modules, one mainly comprising genes whose products are localized in the cytoplasm and plasma membrane, and another comprising gene products in the nucleus. Both modules are highly enriched in known oncogenes, tumor suppressors and genes involved in signal transduction. Similar network patterns were also found in breast, colorectal and pancreatic cancers. Conclusions: We have built a highly reliable functional interaction network upon expert-curated pathways and applied this network to the analysis of two genome-wide GBM and several other cancer data sets. The network patterns revealed from our results suggest common mechanisms in the cancer biology. Our system should provide a foundation for a network or pathway-based analysis platform for cancer and other diseases. Human PPI [45-47] Fly PPI [45] Domain Interaction [52] Prieto s Gene Expression [50] Lee s Gene Expression [49] GO BP Sharing [51] Yeast PPI [45] Worm PPI [45] PPIs from GeneWays [53] Human PPI [45-47] Fly PPI [45] Domain Interaction [52] Prieto s Gene Expression [50] Lee s Gene Expression [49] GO BP Sharing [51] Yeast PPI [45] Worm PPI [45] PPIs from GeneWays [53] Data sources for predicted FIs Reactome [23] Panther [60] KEGG [63] TRED [64] NCI-BioCarta [62] NCI-Nature [62] CellMap [61] Reactome [23] Panther [60] KEGG [63] TRED [64] NCI-BioCarta [62] NCI-Nature [62] CellMap [61] Na Data sources for annotated FIs Naïve Bayes Classifier r trained by validated by Predicted FIs Annotated FIs t t d FI d F FI Network N t Wu et al. Genome Biology 2010, 11:R53
doi:10.1186/gb-2010-11-5-r53 pmid:20482850 pmcid:PMC2898064 fatcat:vntehsevq5gcrfppuzcikesvde