1,080 Hits in 6.2 sec

Schema and Design Free Keyword Search Interfaces for XML Databases

Ramesh Eluri, M.Rammohan Rao, M Brahmaiah, V Rajesh
2020 Figshare  
(IR) style keyword search on the web, keyword search on XML has emerged recently, which is schema independent approach and allows all the user to query XML databases by using keyword combination without  ...  Abstract:- Now a days XML is becoming a standard in data representation and data exchanging.  ...  Definition 3.2.2: (Data Node) The String values that are contained in the leaf node of XML data and have no tag name isdefined as a data node.  ... 
doi:10.6084/m9.figshare.12236582.v1 fatcat:nxy6cygb55bh7nqhwutlbfd3m4

Effective Keyword Search in XML Documents Based on MIU [chapter]

Jianjun Xu, Jiaheng Lu, Wei Wang, Baile Shi
2006 Lecture Notes in Computer Science  
We first analyze the problems caused by the refinement of result granularity during XML keyword search and then propose to partition an XML document into XML fragments with the granularity of Minimal Information  ...  This paper focuses on effective keyword search in XML documents which are modeled as labeled trees.  ...  Because there is no or weak relationship among these parts, such search results mean little to the user.  ... 
doi:10.1007/11733836_49 fatcat:lnrr2tujyvh2dk7e4a72hni6ou

The SphereSearch Engine for Unified Ranked Retrieval of Heterogeneous XML and Web Documents

Jens Graupmann, Ralf Schenkel, Gerhard Weikum
2005 Very Large Data Bases Conference  
The benefits of the SphereSearch engine are demonstrated by experiments with a large and richly tagged but non-schematic open encyclopedia extended with external documents.  ...  For Web data the XML-oriented query engine is leveraged to provide very rich search options that cannot be expressed in traditional Web search engines: concept-aware and link-aware querying that takes  ...  Our heuristic rules "promote" the text within the opening and closing headline tags into a "semantic" XML tag, and construct a properly nested structure.  ... 
dblp:conf/vldb/GraupmannSW05 fatcat:de4blj4r5zatzmgmsfaq3knl3u

A Decade of XML Data Management: An Industrial Experience Report from Oracle

Zhen Hua Liu, Ravi Murthy
2009 Proceedings / International Conference on Data Engineering  
XML and its related technologies have now been in use for almost a decade.  ...  This paper also provides a timely checkpoint of XML data management from industrial perspective with experience of developing and supporting Oracle XML products.  ...  Furthermore, the use of keyword search within XPath/XQuery [21] is a unique strength of XML. Traditional keyword search can only search for keywords within a document.  ... 
doi:10.1109/icde.2009.18 dblp:conf/icde/LiuM09 fatcat:oecsthn7q5g5bp2y3x7vbnoq5i

defoe: A Spark-Based Toolbox for Analysing Digital Historical Textual Data

Rosa Filgueira, Mariona Coll Ardanuy, Giovanni Colavizza, James Hetherington, Melissa Terras, Michael Jackson, Anna Roubickova, Amrey Krause, Ruth Ahnert, Tessa Hauswedell, Julianne Nyhan, David Beavan (+1 others)
2019 2019 15th International Conference on eScience (eScience)  
ACKNOWLEDGEMENTS This work was funded by Scottish Enterprise as part of the Alan Turing Institute-Scottish Enterprise Data Engineering Programme, and by AHRC as part of the Living with Machines via the  ...  • target_and_keywords_count_by_year: Searches for occurrences of a target word (occurring with any word in a list of keywords) and returns counts of occurrences of each target word and these keywords  ...  However, due to the nested nature of the historical XML schemas (such as ALTO, METS and British-Library specific XML), the spark-xml package is not able to infer them automatically, requiring a lot of  ... 
doi:10.1109/escience.2019.00033 dblp:conf/eScience/FilgueiraACHTJR19 fatcat:sbw5w5dv55dzlkoccet3wgvbbq

Enabling Schema-Free XQuery with meaningful query focus

Yunyao Li, Cong Yu, H. V. Jagadish
2006 The VLDB journal  
The default is to use keyword-based search and we are all too familiar with how difficult it is to obtain precise answers by these means.  ...  However, users may have only a limited knowledge of the XML structure, and may be unable to produce a correct XQuery expression, especially in the context of a heterogeneous information collection.  ...  Acknowledgements This work was supported in part by the United States National Science Foundation (NSF) under grants NSF IIS-0438909, IIS-0219513, by NIH under grant number LM08106-01, and by a gift from  ... 
doi:10.1007/s00778-006-0003-4 fatcat:2xclu6l5zvas7lmyyfnrkydldq

KEMB: A Keyword-Based XML Message Broker

Guoliang Li, Jianhua Feng, Jianyong Wang, Lizhu Zhou
2011 IEEE Transactions on Knowledge and Data Engineering  
This paper studies the problem of XML message brokering with user subscribed profiles of keyword queries and presents a KEyword-based XML Message Broker (KEMB) to address this problem.  ...  In contrast to traditional-path-expressions-based XML message brokers, KEMB stores a large number of user profiles, in the form of keyword queries, which capture the data requirement of users/applications  ...  To support metadata search, the tag names (e.g., "b" in Fig. 1 ) are also taken as keywords.  ... 
doi:10.1109/tkde.2010.159 fatcat:tnwk2jch2jeqjph4hsdpdt4cni

Data mining and the Web

Minos N. Garofalakis, Rajeev Rastogi, S. Seshadri, Kyuseok Shim
1999 Proceedings of the second international workshop on Web information and data management - WIDM '99  
Recently, there has been a great deal of interest in XML which requires documents to store tags along with the data to convey semantic information.  ...  These approaches only take into account hyperlink information and pay little or no attention to the content of Web pages.  ... 
doi:10.1145/319759.319781 dblp:conf/widm/GarofalakisRSS99 fatcat:zttixd2fajcsvbiibc43g563vq

Publishing Relational Data in XML: the SilkRoute Approach

Mary F. Fernandez, Atsuyuki Morishima, Dan Suciu, Wang Chiew Tan
2001 IEEE Data Engineering Bulletin  
Its scope includes the design, implementation, modelling, theory and application of database systems and their technology.  ...  Letters, conference information, and news should be sent to the Editor-in-Chief. Papers for each issue are solicited by and should be sent to the Associate Editor responsible for the issue.  ...  Acknowledgements Funding for this work was provided by DARPA through NAVY/SPAWAR Contract No. N66001-99-1-8908 and by NSF Awards CDA-9623632 and ITR 0086002.  ... 
dblp:journals/debu/FernandezMST01 fatcat:zvdgpmia2zen7lhy2pkhnfg6da

On the integration of structure indexes and inverted lists

Raghav Kaushik, Rajasekar Krishnamurthy, Jeffrey F. Naughton, Raghu Ramakrishnan
2004 Proceedings of the 2004 ACM SIGMOD international conference on Management of data - SIGMOD '04  
Several methods have been proposed to evaluate queries over a native XML DBMS, where the queries specify both path and keyword constraints.  ...  Our technique is general and applicable for a wide range of choices of structure indexes and inverted list join algorithms.  ...  As described in [15] , XML search tasks can be divided into Content-Only (CO) tasks where XML documents are searched only using keywords, and Content-and-Structure (CAS) tasks where both struc-ture and  ... 
doi:10.1145/1007568.1007656 dblp:conf/sigmod/KaushikKNR04 fatcat:d43ndxjl4nakdljdsgzn73df5u

Report on INEX 2009

T. Beckers, S. Geva, W.-C. Huang, T. Iofciu, J. Kamps, G. Kazai, M. Koolen, S. Kutty, M. Landoni, M. Lehtonen, V. Moriceau, P. Bellot (+17 others)
2010 SIGIR Forum  
Book Track Investigating techniques to support users in reading, searching, and navigating full texts of digitized books.  ...  This paper reports on the INEX 2009 evaluation campaign, which consisted of a wide range of tracks: Ad hoc, Book, Efficiency, Entity Ranking, Interactive, QA, Link the Wiki, and XML Mining.  ...  The track should also contain type C topics, high-dimensional, structure-oriented retrieval settings over a DB-style set of content-andstructure queries with deeply nested structure but only a few keyword  ... 
doi:10.1145/1842890.1842897 fatcat:46evgkszirdm3grr6fqrtuyqtm

Microformats: the next (small) thing on the semantic Web?

R. Khare
2006 IEEE Internet Computing  
Users started choosing tags that weren't just keywords but also labeled groups and roles ("to-read").  ...  The "Web of HTML" was poised to give way to a "Web of XML" in which each publisher used its own tags and presentation logic to empower a new generation of browsers.  ... 
doi:10.1109/mic.2006.13 fatcat:aibwg3gvnfchrikacz6tumhdnq

Report on INEX 2008

Gianluca Demartin, Gabriella Kazai, Marijn Koolen, Monica Landoni, Ragnar Nordlie, Nils Pharo, Ralf Schenkel, Martin Theobald, Andrew Trotman, Arjen P. de Vries, Alan Woodley, Ludovic Denoye (+8 others)
2009 SIGIR Forum  
This paper reports on the INEX 2008 evaluation campaign, which consisted of a wide range of tracks: Ad hoc, Book, Efficiency, Entity Ranking, Interactive, QA, Link the Wiki, and XML Mining. • Link-the-Wiki  ...  INEX investigates focused retrieval from structured documents by providing large test collections of structured documents, uniform evaluation measures, and a forum for organizations to compare their results  ...  are mostly using either none or only very little structural information and only a few keyword conditions over the target element of the query.  ... 
doi:10.1145/1670598.1670603 fatcat:zoxbecrybrf63fg54g4kt7w7na

Advanced Information Access to Parliamentary Debates

Maarten Marx
2009 Journal of Digital Information  
In this paper, we analyze the structure of parliamentary proceedings and sketch a widely applicable DTD. We show how proceedings in PDF format can be transformed into deeply nested XML.  ...  Having the proceedings in XML makes a wide range of applications possible.  ...  a safe place, it is in principle always possible to recreate the XML versions we have described here.  ... 
dblp:journals/jodi/Marx09 fatcat:54d7film5jcrzbt72bcimewpom

Distribution of immunodeficiency fact files with XML – from Web to WAP

Jouni Väliaho, Pentti Riikonen, Mauno Vihinen
2005 BMC Medical Informatics and Decision Making  
A XML-based language enables creation of open source databases for storage, maintenance and delivery for different platforms.  ...  Methods: Here we present a new data model called fact file and an XML-based specification Inherited Disease Markup Language (IDML), that were developed to facilitate disease information integration, storage  ...  Acknowledgements Financial support from the European Union, the National Technology Agency of Finland and the Medical Research Fund of Tampere University Hospital is gratefully acknowledged.  ... 
doi:10.1186/1472-6947-5-21 pmid:15978138 pmcid:PMC1184081 fatcat:hzeteza7yvhwdpenx6ep4brwta
« Previous Showing results 1 — 15 out of 1,080 results