Filters








25 Hits in 1.4 sec

XClust

Mong Li Lee, Liang Huai Yang, Wynne Hsu, Xia Yang
2002 Proceedings of the eleventh international conference on Information and knowledge management - CIKM '02  
Our experiments to integrate real world DTDs demonstrate the effectiveness of the XClust approach.  ...  We introduce XClust, a novel integration strategy that involves the clustering of DTDs.  ...  We implement XClust in Java, and run the Effectiveness of XClust In this experiment, we investigate how XClust facilitates the integration process and produces good quality integrated schema.  ... 
doi:10.1145/584792.584841 dblp:conf/cikm/LeeYHY02 fatcat:rsaclpqrcja5djmpdkkygaje2m

XClust

Mong Li Lee, Liang Huai Yang, Wynne Hsu, Xia Yang
2002 Proceedings of the eleventh international conference on Information and knowledge management - CIKM '02  
Our experiments to integrate real world DTDs demonstrate the effectiveness of the XClust approach.  ...  We introduce XClust, a novel integration strategy that involves the clustering of DTDs.  ...  We implement XClust in Java, and run the Effectiveness of XClust In this experiment, we investigate how XClust facilitates the integration process and produces good quality integrated schema.  ... 
doi:10.1145/584838.584841 fatcat:wlojjqf6ubdmzhlbojfy2vm63a

XML data clustering

Alsayed Algergawy, Marco Mesiti, Richi Nayak, Gunter Saake
2011 ACM Computing Surveys  
In the last few years we have observed a proliferation of approaches for clustering XML documents and schemas based on their structure and content.  ...  We aim at introducing an integrated view that is useful when comparing XML data clustering approaches, when developing a new clustering algorithm, and when implementing an XML clustering component.  ...  We wish to thank Stefanie Quade for improving the quality of the paper.  ... 
doi:10.1145/1978802.1978804 fatcat:zgparleb6nbkdnoxlcxn3vyrhm

Extensible User-Based XML Grammar Matching [chapter]

Joe Tekli, Richard Chbeir, Kokou Yetongnon
2009 Lecture Notes in Computer Science  
In this paper, we provide an approach for automatic XML grammar matching and comparison aiming to minimize the amount of user effort required to perform the match task.  ...  XML grammar matching has found considerable interest recently due to the growing number of heterogeneous XML documents on the web and the increasing need to integrate, and consequently search and retrieve  ...  Acknowledgements We are grateful to Phil Bernstein and Sabine Maßmann for providing us with their test schemas in order to conduct our matching experiments.  ... 
doi:10.1007/978-3-642-04840-1_23 fatcat:ptuf2lzuprc5tlrsalqry7qefi

XML Matchers: Approaches and challenges

Santa Agreste, Pasquale De Meo, Emilio Ferrara, Domenico Ursino
2014 Knowledge-Based Systems  
Finally, we analyze commercial tools implementing XML Matchers and introduce two challenging issues strictly related to this topic, namely XML source clustering and uncertainty management in XML Matchers  ...  In the past, it was largely investigated especially for classical database models (e.g., E/R schemas, relational databases, etc.).  ...  Acknowledgments We thank the Editor and anonymous Reviewers for their thorough review and highly appreciate the comments and suggestions, which significantly contributed to improving the quality of our  ... 
doi:10.1016/j.knosys.2014.04.044 fatcat:bnoqg7u4g5dvtoa5mq4hsqjxci

XML schema clustering with semantic and hierarchical similarity measures

Richi Nayak, Wina Iryadi
2007 Knowledge-Based Systems  
We present a schema clustering process by organising heterogeneous XML schemas into groups.  ...  The methods are required to manage and discover the useful information from them for improved document handling.  ...  Moreover, several databases tools that are developed to deliver, store, integrate and query XML data [5, 12, 21, 33] , require indexing based on structural similarity to support an effective document  ... 
doi:10.1016/j.knosys.2006.08.006 fatcat:ufwnk45e7jezldm3qai5q4fnpi

A PROGRESSIVE CLUSTERING ALGORITHM TO GROUP THE XML DATA BY STRUCTURAL AND SEMANTIC SIMILARITY

RICHI NAYAK, TIEN TRAN
2007 International journal of pattern recognition and artificial intelligence  
Since the emergence in the popularity of XML for data representation and exchange over the Web, the distribution of XML documents has rapidly increased.  ...  We present a novel clustering algorithm PCXSS that keeps the heterogeneous XML documents into various groups according to the similar structural and semantic representations.  ...  Authors of XClust [18] have defined a cardinality table (Table 2) for DTD constraints.  ... 
doi:10.1142/s0218001407005648 fatcat:zyg3rmvjbbcqhbepauror7h74e

Ontology-Alignment Techniques: Survey and Analysis

Fatima Ardjani, Djelloul Bouchiha, Mimoun Malki
2015 International Journal of Modern Education and Computer Science  
They can see the insufficiency, so that they can propose new approaches for stronger alignment.  ...  They can also adapt or reuse alignment techniques for specific research issues, such as semantic annotation, maintenance of links between entities, etc.  ...  , Data types Iterative fixed point computation XClust [50] DTD AUTO Cardinality, WordNet Paths, Children, Leaves, Clustering Constraint- based Automatch [4] Relational schema  ... 
doi:10.5815/ijmecs.2015.11.08 fatcat:3xpytvmlpra7rbb2bxr3zwf43i

Peer-to-peer management of XML data

Georgia Koloniari, Evaggelia Pitoura
2005 SIGMOD record  
In this paper, we focus on data management issues for processing XML data in a p2p setting, namely indexing, replication, clustering and query routing and processing.  ...  The widespread use of XML as a standard for representing and exchanging data in the Internet suggests using XML for describing data shared in a p2p system.  ...  While structured queries work effectively with the inherent structure of XML data and can convey complex semantic meaning, they require from the user to know the schema (or part of the schema) of the XML  ... 
doi:10.1145/1083784.1083788 fatcat:smphtwg5gvfzzeztmkdembnuii

Building an XML document warehouse

Jamel Feki, Ines Ben Messaoud, Gilles Zurfluh
2013 Journal of Decision Systems  
In Lee et al. (2002) an integration strategy called XClust was proposed; it is based on clustering DTDs of XML data sources. In this strategy, similarity degrees between DTDs are computed.  ...  After that, clusters of similar DTDs are determined based on similarity degrees. Finally, an integrated DTD is generated for each cluster.  ...  Interface of extracted dimensions for the galaxy: Clef galaxy Extracted hierarchies for the D_Casimage_Case dimension  ... 
doi:10.1080/12460125.2013.780322 fatcat:uqkfgbf6vbavbfymtgqbylwdyq

Schema Extraction on Semi-structured Data [article]

Panpan Li, Yikun Gong, Chen Wang
2021 arXiv   pre-print
Moreover, we also investigate tools and systems for schemas extraction.  ...  Schema extraction tools are mainly used for spark or NoSQL databases, and are suitable for small datasets or simple application environments.  ...  XClust is a integration strategy that involves clustering DTDs [23] . The XClust's processing is divided into two steps: DTD similarity computation and DTD clustering.  ... 
arXiv:2012.08105v2 fatcat:hco64wxnrfawrk3twlff2xia3i

A matching algorithm for measuring the structural similarity between an XML document and a DTD and its applications

Elisa Bertino, Giovanna Guerrini, Marco Mesiti
2004 Information Systems  
In this paper we propose a matching algorithm for measuring the structural similarity between an XML document and a DTD.  ...  Specifically, the matching algorithm is exploited for the classification of XML documents against a set of DTDs, the evolution of the DTD structure, the evaluation of structural queries, the selective  ...  In [19] the authors propose XClust, an integration strategy that involves the clustering of DTDs.  ... 
doi:10.1016/s0306-4379(03)00031-0 fatcat:rpcb5v5aovep3pf54p3zcnm46u

User Oriented clustering of news articles using Tweets Heterogeneous Information Network
트위트 이형 정보 망을 이용한 뉴스 기사의 사용자 지향적 클러스터링

Muhammad Shoaib, Wang-Cheol Song
2013 Journal of Internet Computing and services  
In order to overcome the issue of zero-participation in the process of clustering news articles in this paper we have proposed a framework for clustering news articles by combining users' judgments that  ...  However these techniques are totally machine oriented techniques and lack users' participation in the process of decision making for membership of clustering.  ...  XClust algorithm is one of structure distance based clustering algorithms for clustering the XML document.  ... 
doi:10.7472/jksii.2013.14.6.85 fatcat:a7pg45tfjjb5hkb655njiy4kbi

Xproj

Charu C. Aggarwal, Na Ta, Jianyong Wang, Jianhua Feng, Mohammed Zaki
2007 Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '07  
In this paper, we propose an effective clustering algorithm for XML data which uses substructures of the documents in order to gain insights about the important underlying structures.  ...  One of the reasons for the popularity of XML has been its ability to encode structural information about data records.  ...  RELATED WORK One of the earliest work on clustering tree structured data is the XClust algorithm [8] , which was designed to cluster XML schemas in order for efficient integration of large numbers of  ... 
doi:10.1145/1281192.1281201 dblp:conf/kdd/AggarwalTWFZ07 fatcat:xra2dwbul5dubbh7pqxw22zbse

A Better Approach to Ontology Integration using Clustering Through Global Similarity Measure

Ashwin Makwana, Amit Ganatra
2018 Journal of Computer Science  
Output of ontology matching tool is mapping between two ontologies and is used for generating clusters of ontology. We use Jaccard Similarity Index as a global similarity measure for clustering.  ...  Ontology integration or merging is necessary in order to solve this problem of mixed knowledge. Finding similarity between two ontologies is crucial to achieve integration or merging of ontology.  ...  Acknowledgment The Authors would like to thank management and staff of Charotar University of Science and Technology, CHARUSAT, Changa, India for suporting this research and providing resources for same  ... 
doi:10.3844/jcssp.2018.854.867 fatcat:ja5mlgfu3jdqpczwarjtnd4lkm
« Previous Showing results 1 — 15 out of 25 results