Web document clustering using a hybrid neural network

M.Shamim Khan, Sebastian W Khor
2004 Applied Soft Computing  
The list of documents returned by Internet search engines in response to a query these days can be quite overwhelming. There is an increasing need for organising this information and presenting it in a more compact and efficient manner. This paper describes a method developed for the automatic clustering of World Wide Web documents, according to their relevance to the user's information needs, by using a hybrid neural network. The objective is to reduce the time and effort the user has to spend
more » ... to find the information sought after. Clustering documents by features representative of their contents -in this case, key words and phrasesincreases the effectiveness and efficiency of the search process. It is shown that a two-dimensional visual presentation of information on retrieved documents, instead of the traditional linear listing, can create a more user-friendly interface between a search engine and the user.
doi:10.1016/j.asoc.2004.02.003 fatcat:wxtx7t6fgjc5jgfb5tp3kiypnq