Parallel and Distributed Data Mining: An Introduction [chapter]

Mohammed J. Zaki
2000 Lecture Notes in Computer Science  
The explosive growth in data collection in business and scientific fields has literally forced upon us the need to analyze and mine useful knowledge from it. Data mining refers to the entire process of extracting useful and novel patterns/models from large datasets. Due to the huge size of data and amount of computation involved in data mining, high-performance computing is an essential component for any successful large-scale data mining application. This chapter presents a survey on
more » ... e parallel and distributed data mining algorithms and systems, serving as an introduction to the rest of this volume. It also discusses the issues and challenges that must be overcome for designing and implementing successful tools for large-scale data mining.
doi:10.1007/3-540-46502-2_1 fatcat:3mmcofbadbas7f7r5y5opxxwdy