Matko Boanjak, Eduardo Oliveira, José Martins, Eduarda Mendes Rodrigues, Luís Sarmento
2012 Proceedings of the 21st international conference companion on World Wide Web - WWW '12 Companion  
Modern social network analysis relies on vast quantities of data to infer new knowledge about human relations and communication. In this paper we describe TwitterEcho, an open source Twitter crawler for supporting this kind of research, which is characterized by a modular distributed architecture. Our crawler enables researchers to continuously collect data from particular user communities, while respecting Twitter's imposed limits. We present the core modules of the crawling server, some of
more » ... ch were specifically designed to focus the crawl on the Portuguese Twittosphere. Additional modules can be easily implemented, thus changing the focus to a different community. Our evaluation of the system shows high crawling performance and coverage.
doi:10.1145/2187980.2188266 dblp:conf/www/BoanjakOMRS12 fatcat:nahcnzkgybdjrat2vc63kcoh4a