Sampling online social networks

Maksym Gabielkov, Ashwin Rao, Arnaud Legout
2014 Proceedings of the 2014 ACM conference on SIGCOMM - SIGCOMM '14  
Online social networks (OSNs) are an important source of information for scientists in different fields such as computer science, sociology, economics, etc. However, it is hard to study OSNs as they are very large. For instance, Facebook has 1.28 billion active users in March 2014 and Twitter claims 255 million active users in April 2014. Also, companies take measures to prevent crawls of their OSNs and refrain from sharing their data with the research community. For these reasons, we argue
more » ... sampling techniques will be the best technique to study OSNs in the future. In this work, we take an experimental approach to study the characteristics of well-known sampling techniques on a full social graph of Twitter crawled in 2012 [2]. Our contribution is to evaluate the behavior of these techniques on a real directed graph by considering two sampling scenarios: (a) obtaining most popular users (b) obtaining an unbiased sample of users, and to find the most suitable sampling techniques for each scenario.
doi:10.1145/2619239.2631452 dblp:conf/sigcomm/GabielkovRL14 fatcat:jz6s2rq6ifaivajuweeb4gz6wq