Clustering of Association Rules for Big Datasets using Hadoop MapReduce

Salahadin A. Moahmmed, Mohamed A., El-Sayed M.
2021 International Journal of Advanced Computer Science and Applications  
Mining association rules is essential in the discovery of knowledge hidden in datasets. There are many efficient association rule mining algorithms. However, they may suffer from generating large number of rules when applied to big datasets. Large number of rules makes knowledge discovery a daunting task because too many rules are difficult to understand, interpret or visualize. To reduce the number of discovered rules, researchers proposed approaches, such as rules pruning, summarizing, or
more » ... tering. For the flourishing field of big data and Internet-of-Things (IoT), more effective solutions are crucial to cope with the rapid evolution of data. In this paper, we are proposing a novel parallel association rule clustering approach which is based on Hadoop MapReduce. We ran many experiments to study the performance of the proposed approach, and promising results have been demonstrated, e.g. the lowest scaleup was 77%.
doi:10.14569/ijacsa.2021.0120364 fatcat:id2qnadaf5ef7lolgldw64ewiy