Research on Parallel Design of DBSCAN Clustering Algorithm in Spatial Data Mining

Gong-jian ZHOU
2018 DEStech Transactions on Engineering and Technology Research  
DBSCAN clustering algorithm uses fixed Eps and Minpts. When density distribution is uneven, the effect of clustering is not ideal, and the time complexity of the algorithm is O (n 2 ). To solve the above problems, this paper proposes a parallel grid clustering algorithm and two cluster merging strategies of DBSCAN based on Spark platform, will find the Eps neighborhood to narrow the scope of the eight adjacent cells within the data object, and the parallel execution of the local clustering data
more » ... with fast global clustering. The experiment shows that the improved DBSCAN parallel algorithm has better acceleration ratio and extensibility.
doi:10.12783/dtetr/ecar2018/26370 fatcat:y6vv77gvr5drvipddlxmnau5nm