On the lack of typical behavior in the global Web traffic network

Mark Meiss, Filippo Menczer, Alessandro Vespignani
2005 Proceedings of the 14th international conference on World Wide Web - WWW '05  
We offer the first large-scale analysis of Web traffic based on network flow data. Using data collected on the Inter-net2 network, we constructed a weighted bipartite clientserver host graph containing more than 18 × 10 6 vertices and 68 × 10 6 edges valued by relative traffic flows. When considered as a traffic map of the World-Wide Web, the generated graph provides valuable information on the statistical patterns that characterize the global information flow on the Web. Statistical analysis
more » ... ows that client-server connections and traffic flows exhibit heavy-tailed probability distributions lacking any typical scale. In particular, the absence of an intrinsic average in some of the distributions implies the absence of a prototypical scale appropriate for server design, Web-centric network design, or traffic modeling. The inspection of the amount of traffic handled by clients and servers and their number of connections highlights non-trivial correlations between information flow and patterns of connectivity as well as the presence of anomalous statistical patterns related to the behavior of users on the Web. The results presented here may impact considerably the modeling, scalability analysis, and behavioral study of Web applications.
doi:10.1145/1060745.1060820 dblp:conf/www/MeissMV05 fatcat:tvoqoyo2j5aw5jxfbfwiueilke