A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Filters
XClust
2002
Proceedings of the eleventh international conference on Information and knowledge management - CIKM '02
Our experiments to integrate real world DTDs demonstrate the effectiveness of the XClust approach. ...
We introduce XClust, a novel integration strategy that involves the clustering of DTDs. ...
We implement XClust in Java, and run the
Effectiveness of XClust In this experiment, we investigate how XClust facilitates the integration process and produces good quality integrated schema. ...
doi:10.1145/584792.584841
dblp:conf/cikm/LeeYHY02
fatcat:rsaclpqrcja5djmpdkkygaje2m
XClust
2002
Proceedings of the eleventh international conference on Information and knowledge management - CIKM '02
Our experiments to integrate real world DTDs demonstrate the effectiveness of the XClust approach. ...
We introduce XClust, a novel integration strategy that involves the clustering of DTDs. ...
We implement XClust in Java, and run the
Effectiveness of XClust In this experiment, we investigate how XClust facilitates the integration process and produces good quality integrated schema. ...
doi:10.1145/584838.584841
fatcat:wlojjqf6ubdmzhlbojfy2vm63a
XML data clustering
2011
ACM Computing Surveys
In the last few years we have observed a proliferation of approaches for clustering XML documents and schemas based on their structure and content. ...
We aim at introducing an integrated view that is useful when comparing XML data clustering approaches, when developing a new clustering algorithm, and when implementing an XML clustering component. ...
We wish to thank Stefanie Quade for improving the quality of the paper. ...
doi:10.1145/1978802.1978804
fatcat:zgparleb6nbkdnoxlcxn3vyrhm
Extensible User-Based XML Grammar Matching
[chapter]
2009
Lecture Notes in Computer Science
In this paper, we provide an approach for automatic XML grammar matching and comparison aiming to minimize the amount of user effort required to perform the match task. ...
XML grammar matching has found considerable interest recently due to the growing number of heterogeneous XML documents on the web and the increasing need to integrate, and consequently search and retrieve ...
Acknowledgements We are grateful to Phil Bernstein and Sabine Maßmann for providing us with their test schemas in order to conduct our matching experiments. ...
doi:10.1007/978-3-642-04840-1_23
fatcat:ptuf2lzuprc5tlrsalqry7qefi
XML Matchers: Approaches and challenges
2014
Knowledge-Based Systems
Finally, we analyze commercial tools implementing XML Matchers and introduce two challenging issues strictly related to this topic, namely XML source clustering and uncertainty management in XML Matchers ...
In the past, it was largely investigated especially for classical database models (e.g., E/R schemas, relational databases, etc.). ...
Acknowledgments We thank the Editor and anonymous Reviewers for their thorough review and highly appreciate the comments and suggestions, which significantly contributed to improving the quality of our ...
doi:10.1016/j.knosys.2014.04.044
fatcat:bnoqg7u4g5dvtoa5mq4hsqjxci
XML schema clustering with semantic and hierarchical similarity measures
2007
Knowledge-Based Systems
We present a schema clustering process by organising heterogeneous XML schemas into groups. ...
The methods are required to manage and discover the useful information from them for improved document handling. ...
Moreover, several databases tools that are developed to deliver, store, integrate and query XML data [5, 12, 21, 33] , require indexing based on structural similarity to support an effective document ...
doi:10.1016/j.knosys.2006.08.006
fatcat:ufwnk45e7jezldm3qai5q4fnpi
A PROGRESSIVE CLUSTERING ALGORITHM TO GROUP THE XML DATA BY STRUCTURAL AND SEMANTIC SIMILARITY
2007
International journal of pattern recognition and artificial intelligence
Since the emergence in the popularity of XML for data representation and exchange over the Web, the distribution of XML documents has rapidly increased. ...
We present a novel clustering algorithm PCXSS that keeps the heterogeneous XML documents into various groups according to the similar structural and semantic representations. ...
Authors of XClust [18] have defined a cardinality table (Table 2) for DTD constraints. ...
doi:10.1142/s0218001407005648
fatcat:zyg3rmvjbbcqhbepauror7h74e
Ontology-Alignment Techniques: Survey and Analysis
2015
International Journal of Modern Education and Computer Science
They can see the insufficiency, so that they can propose new approaches for stronger alignment. ...
They can also adapt or reuse alignment techniques for specific research issues, such as semantic annotation, maintenance of links between entities, etc. ...
,
Data types
Iterative fixed
point
computation
XClust
[50]
DTD
AUTO
Cardinality,
WordNet
Paths,
Children,
Leaves,
Clustering
Constraint-
based
Automatch
[4]
Relational
schema ...
doi:10.5815/ijmecs.2015.11.08
fatcat:3xpytvmlpra7rbb2bxr3zwf43i
Peer-to-peer management of XML data
2005
SIGMOD record
In this paper, we focus on data management issues for processing XML data in a p2p setting, namely indexing, replication, clustering and query routing and processing. ...
The widespread use of XML as a standard for representing and exchanging data in the Internet suggests using XML for describing data shared in a p2p system. ...
While structured queries work effectively with the inherent structure of XML data and can convey complex semantic meaning, they require from the user to know the schema (or part of the schema) of the XML ...
doi:10.1145/1083784.1083788
fatcat:smphtwg5gvfzzeztmkdembnuii
Building an XML document warehouse
2013
Journal of Decision Systems
In Lee et al. (2002) an integration strategy called XClust was proposed; it is based on clustering DTDs of XML data sources. In this strategy, similarity degrees between DTDs are computed. ...
After that, clusters of similar DTDs are determined based on similarity degrees. Finally, an integrated DTD is generated for each cluster. ...
Interface of extracted dimensions for the galaxy: Clef galaxy Extracted hierarchies for the D_Casimage_Case dimension ...
doi:10.1080/12460125.2013.780322
fatcat:uqkfgbf6vbavbfymtgqbylwdyq
Schema Extraction on Semi-structured Data
[article]
2021
arXiv
pre-print
Moreover, we also investigate tools and systems for schemas extraction. ...
Schema extraction tools are mainly used for spark or NoSQL databases, and are suitable for small datasets or simple application environments. ...
XClust is a integration strategy that involves clustering DTDs [23] . The XClust's processing is divided into two steps: DTD similarity computation and DTD clustering. ...
arXiv:2012.08105v2
fatcat:hco64wxnrfawrk3twlff2xia3i
A matching algorithm for measuring the structural similarity between an XML document and a DTD and its applications
2004
Information Systems
In this paper we propose a matching algorithm for measuring the structural similarity between an XML document and a DTD. ...
Specifically, the matching algorithm is exploited for the classification of XML documents against a set of DTDs, the evolution of the DTD structure, the evaluation of structural queries, the selective ...
In [19] the authors propose XClust, an integration strategy that involves the clustering of DTDs. ...
doi:10.1016/s0306-4379(03)00031-0
fatcat:rpcb5v5aovep3pf54p3zcnm46u
User Oriented clustering of news articles using Tweets Heterogeneous Information Network
트위트 이형 정보 망을 이용한 뉴스 기사의 사용자 지향적 클러스터링
2013
Journal of Internet Computing and services
트위트 이형 정보 망을 이용한 뉴스 기사의 사용자 지향적 클러스터링
In order to overcome the issue of zero-participation in the process of clustering news articles in this paper we have proposed a framework for clustering news articles by combining users' judgments that ...
However these techniques are totally machine oriented techniques and lack users' participation in the process of decision making for membership of clustering. ...
XClust algorithm is one of structure distance based clustering algorithms for clustering the XML document. ...
doi:10.7472/jksii.2013.14.6.85
fatcat:a7pg45tfjjb5hkb655njiy4kbi
In this paper, we propose an effective clustering algorithm for XML data which uses substructures of the documents in order to gain insights about the important underlying structures. ...
One of the reasons for the popularity of XML has been its ability to encode structural information about data records. ...
RELATED WORK One of the earliest work on clustering tree structured data is the XClust algorithm [8] , which was designed to cluster XML schemas in order for efficient integration of large numbers of ...
doi:10.1145/1281192.1281201
dblp:conf/kdd/AggarwalTWFZ07
fatcat:xra2dwbul5dubbh7pqxw22zbse
A Better Approach to Ontology Integration using Clustering Through Global Similarity Measure
2018
Journal of Computer Science
Output of ontology matching tool is mapping between two ontologies and is used for generating clusters of ontology. We use Jaccard Similarity Index as a global similarity measure for clustering. ...
Ontology integration or merging is necessary in order to solve this problem of mixed knowledge. Finding similarity between two ontologies is crucial to achieve integration or merging of ontology. ...
Acknowledgment The Authors would like to thank management and staff of Charotar University of Science and Technology, CHARUSAT, Changa, India for suporting this research and providing resources for same ...
doi:10.3844/jcssp.2018.854.867
fatcat:ja5mlgfu3jdqpczwarjtnd4lkm
« Previous
Showing results 1 — 15 out of 25 results