Clustering IP Addresses Using Longest Prefix Matching and Nearest Neighbor Algorithms [chapter]

Asim Karim, Syed Imran Jami, Irfan Ahmad, Mansoor Sarwar, Zartash Uzmi
2004 Lecture Notes in Computer Science  
This paper summarizes a new algorithm for clustering IP addresses. Unlike popular clustering algorithms such as k-means and DBSCAN, this algorithm is designed specifically for IP addresses. In particular, the algorithm employs the longest prefix match as a similarity metric and uses an adaptation of the nearest neighbor algorithm for search to yield meaningful clusters. The algorithm is automatic in that it does not require any input parameters. When applied to a large IP address dataset, the
more » ... gorithm produced 90% correct clusters. Correct cluster analysis is essential for many network design and management tasks including design of web caches and server replications.
doi:10.1007/978-3-540-28633-2_116 fatcat:ax3ehvq33bcafgthndwvfrpmey