Beyond co-occurrence

Haipeng Zhang, Mohammed Korayem, Erkang You, David J. Crandall
2012 Proceedings of the fifth ACM international conference on Web search and data mining - WSDM '12  
Studying relationships between keyword tags on social sharing websites has become a popular topic of research, both to improve tag suggestion systems and to discover connections between the concepts that the tags represent. Existing approaches have largely relied on tag co-occurrences. In this paper, we show how to find connections between tags by comparing their distributions over time and space, discovering tags with similar geographic and temporal patterns of use. Geo-spatial, temporal and
more » ... o-temporal distributions of tags are extracted and represented as vectors which can then be compared and clustered. Using a dataset of tens of millions of geo-tagged Flickr photos, we show that we can cluster Flickr photo tags based on their geographic and temporal patterns, and we evaluate the results both qualitatively and quantitatively using a panel of human judges. We also develop visualizations of temporal and geographic tag distributions, and show that they help humans recognize semantic relationships between tags. This approach to finding and visualizing similar tags is potentially useful for exploring any data having geographic and temporal annotations.
doi:10.1145/2124295.2124302 dblp:conf/wsdm/ZhangKYC12 fatcat:ndwts2vjtfhfro3ojbdpvh7cxm