Web mining in soft computing framework: relevance, state of the art and future directions

S.K. Pal, V. Talwar, P. Mitra
2002 IEEE Transactions on Neural Networks  
This paper summarizes the different characteristics of web data, the basic components of web mining and its different types, and their current states of the art. The reason for considering web mining, a separate field from data mining, is explained. The limitations of some of the existing web mining methods and tools are enunciated, and the significance of soft computing (comprising fuzzy logic (FL), artificial neural networks (ANNs), genetic algorithms (GAs), and rough sets (RSs) highlighted.
more » ... survey of the existing literature on "soft web mining" is provided along with the commercially available systems. The prospective areas of web mining where the application of soft computing needs immediate attention are outlined with justification. Scope for future research in developing "soft web mining" systems is explained. An extensive bibliography is also provided. Index Terms-Artificial neural networks (ANNs), data mining, fuzzy logic (FL), genetic algorithms (GAs), information retrieval (IR), knowledge discovery, pattern recognition, rough sets (RSs), search engines.
doi:10.1109/tnn.2002.1031947 pmid:18244512 fatcat:a2ea5nfnczgjlpwsbwe6ebt5hi