Support vector machines based on K-means clustering for real-time business intelligence systems

Jiaqi Wang, Xindong Wu, Chengqi Zhang
2005 International Journal of Business Intelligence and Data Mining  
Support vector machines (SVM) have been applied to build classifiers, which can help users make well-informed business decisions. Despite their high generalisation accuracy, the response time of SVM classifiers is still a concern when applied into real-time business intelligence systems, such as stock market surveillance and network intrusion detection. This paper speeds up the response of SVM classifiers by reducing the number of support vectors. This is done by the K-means SVM (KMSVM)
more » ... m proposed in this paper. The KMSVM algorithm combines the K-means clustering technique with SVM and requires one more input parameter to be determined: the number of clusters. The criterion and strategy to determine the input parameters in the KMSVM algorithm are given in this paper. Experiments compare the KMSVM algorithm with SVM on real-world databases, and the results show that the KMSVM algorithm can speed up the response time of classifiers by both reducing support vectors and maintaining a similar testing accuracy to SVM. Reference to this paper should be made as follows: Wang, J., Wu, X. and Zhang, C. (2005) 'Support vector machines based on K-means clustering for real-time business intelligence systems', Int. are data mining and multi-agent systems. He has published more than 200 refereed papers, edited nine books, and published three monographs.
doi:10.1504/ijbidm.2005.007318 fatcat:blomb7t3jfh6lhftjbb7xdjafm