Chinese Web Page Classification Based on Vector Space Model

Li Wei, Ling Zhang, Hua Mei Li, Xiao Zhou Chen
2013 Advanced Materials Research  
Chinese web page classification has been considered as a hot research area in data mining. In this paper, Chinese web page classification algorithm based on vector space model is proposed. This algorithm makes use of supervised machine learning theory to implement a web page classifier. It combined text frequency and methods for feature extraction and improved traditional TFIDF weighting formula. The results show that the classifier was feasible and effective.
doi:10.4028/www.scientific.net/amr.846-847.1801 fatcat:nsvtgkpwo5fxjagmz2z4m7cwmq