Protein-Protein Interaction Document Mining

Shing Doong, Shu-Fen Hong
2006 Proceedings of the 9th Joint Conference on Information Sciences (JCIS)  
Protein-protein interactions (PPI) are very important to the understanding of metabolic pathway. Many digital publications are available today; some of them discuss PPI and some of them do not. If machine learning techniques can be used to detect those PPI documents automatically, it would save researchers tremendous amount of time to construct a biological pathway. In this study, we analyze this document mining problem by using different kinds of feature representations and classification
more » ... ithms. Latent semantic indexing (LSI) and information gain (IG) were used to extract features from a document for classification, while support vector machine (SVM) and Naïve Bayesian (NB) were the selected algorithms. It is found that the combination of LSI and SVM provided the best solution.
doi:10.2991/jcis.2006.250 dblp:conf/jcis/DoongH06 fatcat:z52ogu5yezdcxcunpuxymnj7vq