XEdge

Panagiotis Antonellis, Christos Makris, Nikos Tsirakis
2008 Proceedings of the 2008 ACM symposium on Applied computing - SAC '08  
In this paper we propose a unified clustering algorithm for both homogeneous and heterogeneous XML documents. Depending on the type of the XML documents, the proposed algorithm modifies its distance metric in order to properly adapt to the special structural characteristics of homogeneous and heterogeneous XML documents. We compare the quality of the formed clusters with those of one of the latest XML clustering algorithms and show that our algorithm outperforms it in the case of both homogeneous and heterogeneous XML documents.
doi:10.1145/1363686.1363940 dblp:conf/sac/AntonellisMT08 fatcat:lf5v6tjbobhz5j5wws5nyz5rra