Robust Hierarchical Overlapping Community Detection with Personalized PageRank

Yinglong Zhang, Xuewen Xia, Xing Xu, Fei Yu, Hongrun Wu, Ying Yu, Bo Wei
2020 IEEE Access  
Community detection is a fundamental task in graph mining. Despite the fact that most of existing community detection methods are devoted to finding disjoint community structure, communities often overlap with each other and are recursively organized in a hierarchical structure in many real-world networks. Also, finding hierarchical overlapping community structure has significant implications in many real-world applications. Some of the few existing attempts suffer from the problem that the
more » ... ined community structure is sensitive to network changes as they are based heavily on one-hop node proximity to detect communities. To tackle this problem, we propose a robust hierarchical overlapping community detection method with Personalized PageRank (PPR), which is often regarded as a prevalent metric to measure node proximity globally. Specifically, motivated by the agglomerative hierarchical clustering method, we present an effective and efficient mechanism to merge small communities and form hierarchically organized overlapping communities. Experimental results on both synthetic and real-world networks corroborate the effectiveness and robustness of the proposed framework. In addition, we introduce how to make use of the detected community structure to perform various node proximity queries such as the top-k structural hole spanner query and the top-k heterogeneous node query, which can help us gain more insights on the underlying network. INDEX TERMS Community detection, hierarchical overlapping communities, Personalized PageRank. FIGURE 1. A toy example to illustrate the perturbations of the network structure from Phase A to Phase C. Over the whole process, the two dense subgraphs (as illustrated in D) do not change. FIGURE 2. Sensitivity of community detection results of BMLPA w.r.t. the network perturbations on the toy example of Fig. 1, where different colors represent different communities. FIGURE 3. Sensitivity of community detection results of LC w.r.t. the network perturbations on the toy example of Fig. 1, where different colors represent different communities. it necessitates the development of an effective hierarchical overlapping community detection method that is robust to network perturbations.
doi:10.1109/access.2020.2998860 fatcat:4f3x7j5vqjbcbawdphdsqf2tvm