GRAPE: A Graph-Based Framework for Disambiguating People Appearances in Web Search

Lili Jiang, Jianyong Wang, Ning An, Shengyuan Wang, Jian Zhan, Lian Li
2009 2009 Ninth IEEE International Conference on Data Mining  
Finding information about people using search engines is one of the most common activities on the Web. However, search engines usually return a long list of Web pages, which may be relevant to many namesakes, especially given the explosive growth of Web data. To address the challenge caused by name ambiguity in Web people search, this paper proposes a novel graph-based framework, GRAPE (abbr. a Graph-based fRamework for disAmbiguating People appEarances in Web search). In GRAPE, people tag
more » ... PE, people tag information (e.g., people name, organization, and email address) surrounding the queried people name is extracted from the search results, a graphbased unsupervised algorithm is then developed to cluster the extracted tags, where a new method, Cohesion, is introduced to measure the importance of a tag for clustering, and each final cluster of tags represents a unique people entity. Experimental results show that our proposed framework outperforms the state-of-the-art Web people name disambiguation approaches.
doi:10.1109/icdm.2009.25 dblp:conf/icdm/JiangWAWZL09 fatcat:kg6mbwyl5vel3iubgnwda45asm