Distributed Data Mining for Multiple Sourced Heterogeneous Datasets: A Survey

Xing-ying LI, Shan-zi LI, Yi-xuan WU, Ai-jia HE, Xiao-ya HUANG, Xin ZHAO
2018 DEStech Transactions on Computer Science and Engineering  
In the information age of the 21st century, a large amount of information is collected and applied. However, due to the heterogeneity of system environment for data storage and computing, how to mine these distributed data sources has become a valuable research topic that attracted more and more attention. In this paper, we firstly presented the problem scenario and main challenges confronting with the problem of distributed data mining on multiple sourced heterogeneous data sets. Then, we
more » ... yed research works related to the problem and elicited their main features on different technology domains to show current distributed solutions for different data mining algorithm categories. Finally, we reviewed in detail the research works and discussed the challenges remained in the distributed data mining problem for multiple sourced heterogeneous data sets.
doi:10.12783/dtcse/cmsam2018/26563 fatcat:yna7ghym5fbpzbfibpsynzsn7m